JP7743916B2

JP7743916B2 - Model analysis device, model analysis method, and program

Info

Publication number: JP7743916B2
Application number: JP2024509568A
Authority: JP
Inventors: 竜太松野; 智哉坂井; 啓太佐久間; 義男亀田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2022-03-24
Filing date: 2022-03-24
Publication date: 2025-09-25
Anticipated expiration: 2042-03-24
Also published as: JPWO2023181245A1; WO2023181245A1

Description

本開示は、機械学習モデルの分析に関する。 This disclosure relates to the analysis of machine learning models.

近年、様々な分野において、機械学習により得られた予測モデルが利用されている。特許文献１は、識別モデルの学習に使用する学習用データのカテゴリ毎に、モデルの学習に使用した学習用データの数と、学習により得られたモデルの識別精度との関係を可視化する手法を記載している。In recent years, predictive models obtained through machine learning have been used in a variety of fields. Patent Document 1 describes a method for visualizing the relationship between the number of training data items used to train a discriminative model and the discrimination accuracy of the model obtained through training, for each category of training data used to train the model.

国際公開ＷＯ２０２１／０８５１８８号公報International Publication No. WO2021/085188

予測モデルを用いて予測を行う場合、入力されるデータセットのカテゴリや属性によって、モデルの性能に差が生じることがある。この場合、予測に用いられるデータセットのカテゴリ数や属性数が多いと、モデルの性能の差を人手で確認することは困難となる。 When making predictions using a predictive model, differences in model performance can occur depending on the categories and attributes of the input dataset. In this case, if the dataset used for prediction has a large number of categories or attributes, it becomes difficult to manually check the differences in model performance.

本開示の１つの目的は、予測に使用されるデータセットのカテゴリの属性毎にモデルの性能を評価し、可視化することにある。 One objective of this disclosure is to evaluate and visualize the performance of a model for each categorical attribute of the dataset used for prediction.

本開示の一つの観点では、モデル分析装置は、
モデルを取得するモデル取得手段と、
属性の選択を受け取る選択手段と、
選択された属性に該当するデータセットを取得するデータセット取得手段と、
前記データセットの各カテゴリに対応する属性毎に、前記モデルの性能を算出する性能算出手段と、
算出された属性毎のモデルの性能を示す性能情報を出力する出力手段と、
を備え、
前記性能算出手段は、前記属性の選択の対象となったカテゴリ以外のカテゴリについて、当該カテゴリに対応する属性毎に、前記モデルの性能を算出する。
In one aspect of the present disclosure, a model analysis apparatus includes:
a model acquisition means for acquiring a model;
selection means for receiving a selection of attributes;
a dataset acquisition means for acquiring a dataset corresponding to the selected attribute ;
a performance calculation means for calculating the performance of the model for each attribute corresponding to each category of the data set;
an output means for outputting performance information indicating the calculated performance of the model for each attribute;
Equipped with
The performance calculation means calculates the performance of the model for each attribute corresponding to a category other than the category from which the attribute was selected .

本開示の他の観点では、コンピュータにより実行されるモデル分析方法は、
モデルを取得し、
属性の選択を受け取り、
選択された属性に該当するデータセットを取得し、
前記データセットの各カテゴリに対応する属性毎に、前記モデルの性能を算出し、
算出された属性毎のモデルの性能を示す性能情報を出力し、
前記モデルの性能は、前記属性の選択の対象となったカテゴリ以外のカテゴリについて、当該カテゴリに対応する属性毎に算出される。
In another aspect of the present disclosure, a computer-implemented model analysis method includes:
Get the model,
It takes a selection of attributes,
Retrieve the dataset corresponding to the selected attributes ,
Calculating the performance of the model for each attribute corresponding to each category of the dataset;
Outputting performance information showing the performance of the model for each calculated attribute;
The performance of the model is calculated for each attribute corresponding to a category other than the category for which the attribute was selected .

本発明のさらに他の観点では、プログラムは、
モデルを取得し、
属性の選択を受け取り、
選択された属性に該当するデータセットを取得し、
前記データセットの各カテゴリに対応する属性毎に、前記モデルの性能を算出し、
算出された属性毎のモデルの性能を示す性能情報を出力する処理をコンピュータに実行させ、
前記モデルの性能は、前記属性の選択の対象となったカテゴリ以外のカテゴリについて、当該カテゴリに対応する属性毎に算出される。 In yet another aspect of the invention, a program includes:
Get the model,
It takes a selection of attributes,
Retrieve the dataset corresponding to the selected attributes ,
Calculating the performance of the model for each attribute corresponding to each category of the dataset;
causing a computer to execute a process of outputting performance information indicating the calculated performance of the model for each attribute ;
The performance of the model is calculated for each attribute corresponding to a category other than the category for which the attribute was selected .

本開示によれば、予測に使用されるデータセットのカテゴリの属性毎にモデルの性能を評価し、可視化することが可能となる。 This disclosure makes it possible to evaluate and visualize the performance of a model for each categorical attribute of the dataset used for prediction.

第１実施形態に係るモデル生成システムの全体構成を示すブロック図である。1 is a block diagram showing the overall configuration of a model generation system according to a first embodiment. モデル生成装置のハードウェア構成を示すブロック図である。FIG. 2 is a block diagram showing a hardware configuration of the model generating device. 第１実施形態のモデル生成装置の機能構成を示すブロック図である。FIG. 2 is a block diagram showing the functional configuration of the model generation device according to the first embodiment. 属性別性能情報の表示例を示す。An example of displaying attribute-specific performance information is shown below. 属性別性能情報の他の表示例を示す。10 shows another example of displaying attribute-specific performance information. モデル分析処理のフローチャートである。10 is a flowchart of a model analysis process. サーバと端末装置を用いたモデル生成システムの概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a model generation system using a server and a terminal device. 第２実施形態のモデル分析装置の機能構成を示すブロック図である。FIG. 10 is a block diagram showing the functional configuration of a model analysis apparatus according to a second embodiment. 第２実施形態のモデル分析装置による処理のフローチャートである。10 is a flowchart of a process performed by a model analysis device according to a second embodiment.

以下、図面を参照して、本開示の好適な実施形態について説明する。
＜第１実施形態＞
［全体構成］
図１は、第１実施形態に係るモデル生成システムの全体構成を示すブロック図である。モデル生成システム１は、モデル生成装置１００と、表示装置２と、入力装置３とを備える。モデル生成装置１００は、本開示のモデル分析装置を適用したものであり、例えばパーソナルコンピュータ（ＰＣ）などのコンピュータにより構成される。表示装置２は、例えば液晶表示装置などであり、モデル生成装置１００が生成した評価情報を表示する。入力装置３は、例えばマウス、キーボードなどであり、ユーザがモデルの修正時や評価情報の閲覧時に必要な指示、入力を行うために使用される。 Hereinafter, preferred embodiments of the present disclosure will be described with reference to the drawings.
First Embodiment
[Overall configuration]
1 is a block diagram showing the overall configuration of a model generation system according to a first embodiment. The model generation system 1 includes a model generation device 100, a display device 2, and an input device 3. The model generation device 100 is an application of the model analysis device of the present disclosure, and is configured by a computer such as a personal computer (PC). The display device 2 is, for example, a liquid crystal display device, and displays evaluation information generated by the model generation device 100. The input device 3 is, for example, a mouse, a keyboard, etc., and is used by the user to give instructions and input necessary when modifying a model or viewing evaluation information.

まず、モデル生成システム１の動作を概略的に説明する。モデル生成装置１００は、予め用意された訓練データを用いて、機械学習モデル（以下、単に「モデル」と呼ぶ。）を生成する。また、モデル生成装置１００は、生成したモデルの分析、評価を行う。具体的に、モデル生成装置１００は、評価用データなどを用いてモデルによる予測を行い、予測結果に基づいてモデルの予測性能を分析する。モデル生成装置１００は、評価用データのカテゴリの属性毎に、モデルの性能を示す値（以下、「性能値」とも呼ぶ。）を算出する。そして、モデル生成装置１００は、カテゴリの属性毎にモデルの性能を示す情報を評価情報としてユーザに提示する。ユーザは、評価情報を確認し、入力装置３を操作してモデルの修正のための修正情報を入力することができる。First, we will provide an overview of the operation of the model generation system 1. The model generation device 100 generates a machine learning model (hereinafter simply referred to as a "model") using training data prepared in advance. The model generation device 100 also analyzes and evaluates the generated model. Specifically, the model generation device 100 performs model predictions using evaluation data, etc., and analyzes the predictive performance of the model based on the prediction results. The model generation device 100 calculates a value indicating the performance of the model (hereinafter also referred to as a "performance value") for each attribute of a category of the evaluation data. The model generation device 100 then presents information indicating the performance of the model for each attribute of the category to the user as evaluation information. The user can check the evaluation information and operate the input device 3 to input correction information for correcting the model.

なお、「機械学習モデル」とは、説明変数と目的変数の関係を表す情報である。機械学習モデルは、例えば、説明変数に基づいて目的とする変数を算出することにより推定対象の結果を推定するためのコンポーネントである。機械学習モデルは、既に目的変数の値が得られている学習用データと任意のパラメータとを入力として、学習アルゴリズムを実行することにより生成される。機械学習モデルは、例えば、入力ｘを正解ｙに写像する関数ｃにより表されてもよい。機械学習モデルは、推定対象の数値を推定するものであってもよいし、推定対象のラベルを推定するものであってもよい。機械学習モデルは、目的変数の確率分布を記述する変数を出力してもよい。機械学習モデルは、「学習モデル」、「分析モデル」、「ＡＩ（ＡｒｔｉｆｉｃｉａｌＩｎｔｅｌｌｉｇｅｎｃｅ）モデル」または「予測式」などと記載されることもある。また、説明変数とは機械学習モデルにおいて入力として用いられる変数である。説明変数は、「特徴量」または「特徴」などと記載されることがある。 Note that a "machine learning model" is information that represents the relationship between explanatory variables and a response variable. A machine learning model is, for example, a component that estimates the results of an estimation target by calculating a response variable based on explanatory variables. A machine learning model is generated by executing a learning algorithm using training data, for which the response variable value has already been obtained, and arbitrary parameters as input. A machine learning model may be represented, for example, by a function c that maps an input x to a correct answer y. A machine learning model may estimate the numerical value of an estimation target, or may estimate the label of an estimation target. A machine learning model may output a variable that describes the probability distribution of the response variable. A machine learning model may also be referred to as a "learning model," "analysis model," "AI (Artificial Intelligence) model," or "prediction formula." An explanatory variable is a variable used as input in a machine learning model. An explanatory variable may also be referred to as a "feature" or "characteristic."

また、機械学習モデルを生成するための学習アルゴリズムは特に限定されず、既存の学習アルゴリズムでよい。例えば、学習アルゴリズムは、ランダムフォレスト、サポートベクターマシン、ナイーブベイズ、ＦＡＢ推論（ＦａｃｔｏｒｉｚｅｄＡｓｙｍｐｔｏｔｉｃＢａｙｅｓｉａｎＩｎｆｅｒｅｎｃｅ）を利用した区分線形モデル、またはニューラルネットワークであってよい。なお、ＦＡＢ推論を利用した区分線形モデルの手法は、例えば米国公開特許ＵＳ２０１４／０２２２７４１Ａ１号公報等に開示されている。 The learning algorithm for generating the machine learning model is not particularly limited and may be any existing learning algorithm. For example, the learning algorithm may be a piecewise linear model using random forest, support vector machine, naive Bayes, or FAB inference (Factorized Asymptotic Bayesian Inference), or a neural network. Note that a piecewise linear model technique using FAB inference is disclosed, for example, in U.S. Patent Publication US 2014/0222741 A1.

［ハードウェア構成］
図２は、モデル生成装置１００のハードウェア構成を示すブロック図である。図示のように、モデル生成装置１００は、インタフェース（Ｉ／Ｆ）１１１と、プロセッサ１１２と、メモリ１１３と、記録媒体１１４と、データベース（ＤＢ）１１５と、を備える。 [Hardware configuration]
2 is a block diagram showing the hardware configuration of the model generation device 100. As shown in the figure, the model generation device 100 includes an interface (I/F) 111, a processor 112, a memory 113, a recording medium 114, and a database (DB) 115.

Ｉ／Ｆ１１１は、外部装置との間でデータの入出力を行う。具体的に、モデルの生成に使用する訓練データ、評価用データ、及び、ユーザが入力装置３を用いて入力した指示や入力は、Ｉ／Ｆ１１１を通じてモデル生成装置１００に入力される。また、モデル生成装置１００が生成したモデルの評価情報は、Ｉ／Ｆ１１１を通じて表示装置２へ出力される。 I/F 111 inputs and outputs data to and from external devices. Specifically, training data and evaluation data used to generate the model, as well as instructions and input entered by the user using input device 3, are input to model generation device 100 via I/F 111. In addition, evaluation information for the model generated by model generation device 100 is output to display device 2 via I/F 111.

プロセッサ１１２は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）などのコンピュータであり、予め用意されたプログラムを実行することによりモデル生成装置１００の全体を制御する。なお、プロセッサ１１２は、ＧＰＵ（ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）、ＴＰＵ（ＴｅｎｓｏｒＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）、量子プロセッサまたはＦＰＧＡ（Ｆｉｅｌｄ－ＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）であってもよい。プロセッサ１１２は、後述するモデル分析処理を実行する。 Processor 112 is a computer such as a CPU (Central Processing Unit) that controls the entire model generation device 100 by executing a pre-prepared program. Processor 112 may also be a GPU (Graphics Processing Unit), TPU (Tensor Processing Unit), quantum processor, or FPGA (Field-Programmable Gate Array). Processor 112 executes the model analysis process described below.

メモリ１１３は、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）などにより構成される。メモリ１１３は、プロセッサ１１２による各種の処理の実行中に作業メモリとしても使用される。 Memory 113 is composed of ROM (Read Only Memory), RAM (Random Access Memory), etc. Memory 113 is also used as working memory while processor 112 is executing various processes.

記録媒体１１４は、ディスク状記録媒体、半導体メモリなどの不揮発性で非一時的な記録媒体であり、モデル生成装置１００に対して着脱可能に構成される。記録媒体１１４は、プロセッサ１１２が実行する各種のプログラムを記録している。モデル生成装置１００が各種の処理を実行する際には、記録媒体１１４に記録されているプログラムがメモリ１１３にロードされ、プロセッサ１１２により実行される。 The recording medium 114 is a non-volatile, non-transitory recording medium such as a disk-shaped recording medium or semiconductor memory, and is configured to be detachable from the model generation device 100. The recording medium 114 records various programs executed by the processor 112. When the model generation device 100 executes various processes, the programs recorded on the recording medium 114 are loaded into the memory 113 and executed by the processor 112.

ＤＢ１１５は、モデル生成装置１００が生成したモデル（以下、「既存モデル」と呼ぶ。）、及び、再訓練による修正後のモデル（以下、「修正後モデル」と呼ぶ。）に関する情報を記憶する。また、ＤＢ１１５は、必要に応じて、Ｉ／Ｆ１１１を通じて入力された訓練データ、評価用データ、ユーザが入力した修正情報などを記憶する。 DB115 stores information about the model generated by model generation device 100 (hereinafter referred to as the "existing model") and the model after correction through retraining (hereinafter referred to as the "corrected model"). DB115 also stores, as necessary, training data, evaluation data, correction information entered by the user, etc., input via I/F111.

（機能構成）
図３は、第１実施形態のモデル生成装置１００の機能構成を示すブロック図である。モデル生成装置１００は、機能的には、訓練データＤＢ１２１と、モデル訓練部１２２と、モデルＤＢ１２３と、評価用データＤＢ１２４と、分析部１２５と、を備える。 (Functional configuration)
3 is a block diagram showing the functional configuration of the model generation device 100 according to the first embodiment. Functionally, the model generation device 100 includes a training data DB 121, a model training unit 122, a model DB 123, an evaluation data DB 124, and an analysis unit 125.

訓練データＤＢ１２１は、モデルの生成に用いられる訓練データを記憶する。訓練データＤ１は、モデル訓練部１２２に入力される。なお、訓練データＤ１は、入力データと、その入力データに対する正解（正解値又は正解ラベル）との複数の組み合わせにより構成される。 The training data DB 121 stores training data used to generate a model. The training data D1 is input to the model training unit 122. The training data D1 is composed of multiple combinations of input data and correct answers (correct values or correct labels) for the input data.

モデル訓練部１２２は、訓練データＤ１を用いてモデルの訓練を行い、モデルを生成する。モデル訓練部１２２は、生成したモデルに対応するモデルデータＭをモデルＤＢ１２３及び分析部１２５へ出力する。なお、モデルデータＭは、モデルを構成する複数のパラメータ情報を含む。パラメータ情報は、例えば、モデルの入力として用いられる説明変数の情報、各説明変数に対する重みの情報、入力データを構成する各サンプルに対する重みの情報などを含む。 The model training unit 122 trains a model using the training data D1 and generates a model. The model training unit 122 outputs model data M corresponding to the generated model to the model DB 123 and the analysis unit 125. The model data M includes multiple parameter information constituting the model. The parameter information includes, for example, information on explanatory variables used as input to the model, information on weights for each explanatory variable, and information on weights for each sample constituting the input data.

また、モデル訓練部１２２は、既存モデルを再訓練して修正後モデルを生成する。この場合、モデル訓練部１２２は、ユーザが入力装置３を用いて入力した修正情報Ｄ３に基づいて、モデルを構成するパラメータを修正し、必要に応じて再訓練用の訓練データを用いてモデルの再訓練を行う。モデル訓練部１２２は、再訓練により得られた修正後モデルのモデルデータＭをモデルＤＢ１２３へ記憶するとともに、分析部１２５へ出力する。 The model training unit 122 also retrains the existing model to generate a modified model. In this case, the model training unit 122 modifies the parameters that make up the model based on the modification information D3 input by the user using the input device 3, and retrains the model using training data for retraining as necessary. The model training unit 122 stores the model data M of the modified model obtained by retraining in the model DB 123 and outputs it to the analysis unit 125.

評価用データＤＢ１２４は、生成されたモデルの評価に使用する評価用データを記憶する。評価用データは、例えば収集されたデータのうち、訓練データとして使用しなかった一部のデータや、新たに収集されたデータ、検証のために用意されたバリデーションデータなどである。なお、訓練データを評価用データとして使用してもよい。評価用データは、入力データと、その入力データに対する正解（正解値又は正解ラベル）との複数の組み合わせにより構成される。 The evaluation data DB 124 stores evaluation data used to evaluate the generated model. The evaluation data may be, for example, a portion of the collected data that was not used as training data, newly collected data, or validation data prepared for verification. Training data may also be used as evaluation data. The evaluation data consists of multiple combinations of input data and correct answers (correct values or correct labels) for the input data.

評価用データは、データセットの単位で記憶され、使用される。データセットに含まれる各データは、複数のカテゴリを含む。例えば、与信を判定する予測モデルの場合、評価用データとして多数の判定対象者のデータが用意され、各データは、例えば年齢層、性別、居住地域、家族構成、所得などのカテゴリを含む。また、各カテゴリは、複数の属性（グループ）を含む。 Evaluation data is stored and used in units of datasets. Each piece of data contained in a dataset contains multiple categories. For example, in the case of a predictive model for determining creditworthiness, data on a large number of individuals to be evaluated is prepared as evaluation data, and each piece of data contains categories such as age group, gender, residential area, family structure, and income. Furthermore, each category contains multiple attributes (groups).

分析部１２５は、評価用データを用いて、データセットの各カテゴリに対応する属性毎に、モデルの性能を分析する。例えば、あるカテゴリ「性別」が属性「男性」と「女性」を含むとする。この場合、分析部１２５は、属性「男性」及び「女性」のそれぞれについてモデルの性能を示す性能値を算出する。分析部１２５は、他のカテゴリについても、属性毎にモデルの性能値を算出する。なお、モデルの性能指標としては、例えば、モデルの予測精度、互換性など、モデルの性能を評価するために使用できる各種の指標を用いることができる。モデルの性能指標の他の例として、Ｆ１スコア、適合率、再現率を用いても良く、回帰タスクの場合には決定係数等を用いても良い。また、値が低いほど良いとされる誤差系の指標として、例えば平均二乗誤差や交差エントロピー等を用いても良い。そして、分析部１２５は、複数のカテゴリについて、属性毎にモデルの性能値を示す情報（以下、「属性別性能情報」とも呼ぶ。）を生成し、評価情報Ｄ２として表示装置２へ出力する。分析部１２５は、モデル取得手段、データセット取得手段、性能算出手段及び出力手段の一例である。The analysis unit 125 uses the evaluation data to analyze the performance of the model for each attribute corresponding to each category in the dataset. For example, suppose a category called "gender" includes the attributes "male" and "female." In this case, the analysis unit 125 calculates a performance value indicating the model's performance for each of the attributes "male" and "female." The analysis unit 125 also calculates model performance values for each attribute for other categories. Note that various indices that can be used to evaluate model performance, such as the model's predictive accuracy and compatibility, can be used as model performance indicators. Other examples of model performance indicators include the F1 score, precision, and recall, and in the case of regression tasks, the coefficient of determination, etc. Furthermore, error-based indices, in which the lower the value, the better, such as mean squared error and cross entropy, can also be used. The analysis unit 125 then generates information indicating the model's performance value for each attribute for multiple categories (hereinafter referred to as "attribute-specific performance information") and outputs this information to the display device 2 as evaluation information D2. The analysis unit 125 is an example of a model acquisition means, a data set acquisition means, a performance calculation means, and an output means.

表示装置２は、分析部１２５が出力した評価情報Ｄ２を表示装置２に表示する。ユーザは、表示された属性別性能情報を見ることにより、各カテゴリの属性毎のモデルの性能の差異を知ることができる。 The display device 2 displays the evaluation information D2 output by the analysis unit 125 on the display device 2. By looking at the displayed attribute-specific performance information, the user can learn the differences in model performance for each attribute in each category.

また、ユーザは、必要に応じて、現行モデルを修正するための修正情報Ｄ３を入力装置３に入力する。修正情報Ｄ３は、例えば、モデルの入力として用いられる説明変数の情報、各説明変数に対する重みの情報、入力データを構成する各サンプルに対する重みの情報などの修正に関する情報である。モデル訓練部１２２は、入力された修正情報Ｄ３を用いてモデルの再訓練を行うことにより、モデルの修正を行う。 In addition, the user inputs correction information D3 into the input device 3 as needed to correct the current model. The correction information D3 is information related to the correction, such as information on explanatory variables used as input to the model, information on weights for each explanatory variable, and information on weights for each sample that makes up the input data. The model training unit 122 corrects the model by retraining the model using the input correction information D3.

［属性別性能情報の表示例］
次に、属性別性能情報の表示例について説明する。
（第１の表示例）
（１）初期表示例
図４（Ａ）は、属性別性能情報の第１の表示例の初期表示例を示す。初期表示例４０では、予測に用いられるデータセット全体について、属性毎にモデルの性能値が表示される。なお、以下の例では、モデルの性能値としてモデルの精度を用いる。 [Example of attribute-specific performance information display]
Next, an example of displaying attribute-specific performance information will be described.
(First display example)
(1) Initial Display Example Fig. 4(A) shows an initial display example of the first display example of attribute-specific performance information. In the initial display example 40, model performance values are displayed for each attribute for the entire data set used for prediction. In the following example, model accuracy is used as the model performance value.

具体的に、初期表示例４０は、全体性能値欄４１と、カテゴリ欄４２と、性能値欄４３とを含む。全体性能値欄４１は、対象となるデータセット全体に対するモデルの性能値を示す。本例では、データセット全体に対するモデルの性能値として、「全体精度８５％」が示されている。カテゴリ欄４２は、モデルの性能値の表示対象となるカテゴリを示す。本例では、４つのカテゴリ「性別」、「年齢」、「居住地」及び「年収」についてモデルの性能値が表示されている。 Specifically, the initial display example 40 includes an overall performance value column 41, a category column 42, and a performance value column 43. The overall performance value column 41 shows the performance value of the model for the entire target dataset. In this example, "overall accuracy 85%" is shown as the performance value of the model for the entire dataset. The category column 42 shows the category for which the model performance value is displayed. In this example, the model performance values are displayed for the four categories "gender," "age," "place of residence," and "annual income."

性能値欄４３は、各カテゴリについて、各属性に対応する枠（以下、「属性枠」と呼ぶ。）により、属性毎にモデルの性能値を示している。例えば、カテゴリ「性別」については、属性「男性」に対する性能値が９５％、属性「女性」に対する性能値が６０％となっている。なお、性能値欄４３内の各属性名のかっこ内には、その属性の割合が示されている。例えば、図４（Ａ）の例では、カテゴリ「性別」について、データセット全体において男性の割合が６０％、女性の割合が４０％となっている。また、各属性枠の大きさ（横方向の長さ）は、各属性の割合に対応している。例えば、カテゴリ「性別」について、「男性」の属性枠と「女性」の属性枠の長さは３：２（６０：４０）になっている。 The performance value column 43 shows the model performance value for each attribute for each category, using a frame corresponding to each attribute (hereinafter referred to as an "attribute frame"). For example, for the category "gender," the performance value for the attribute "male" is 95%, and the performance value for the attribute "female" is 60%. Note that the percentage of each attribute is shown in parentheses next to each attribute name in the performance value column 43. For example, in the example of Figure 4 (A), for the category "gender," the percentage of males in the entire dataset is 60%, and the percentage of females is 40%. Furthermore, the size (horizontal length) of each attribute frame corresponds to the percentage of each attribute. For example, for the category "gender," the length of the attribute frame for "male" and the attribute frame for "female" is 3:2 (60:40).

カテゴリ「年齢」については、属性「子供」に対する性能値が３５％、属性「大人」に対する性能値が９４％、属性「高齢者」に対する性能値が９５％となっている。なお、カテゴリ「年齢」においては、属性「子供」の割合が２０％、属性「大人」の割合が６０％、属性「高齢者」の割合が２０％である。 For the category "age," the performance value for the attribute "children" is 35%, the performance value for the attribute "adults" is 94%, and the performance value for the attribute "elderly" is 95%. Furthermore, within the category "age," the proportion of attributes "children" is 20%, the proportion of attributes "adults" is 60%, and the proportion of attributes "elderly" is 20%.

カテゴリ「居住地」については、属性「関東」に対する性能値が８４％、属性「関西」に対する性能値が８６％となっている。なお、カテゴリ「居住地」においては、属性「関東」の割合が５５％、属性「関西」の割合が４５％である。 For the category "Residence," the performance value for the attribute "Kanto" is 84% and the performance value for the attribute "Kansai" is 86%. Furthermore, within the category "Residence," the proportion for the attribute "Kanto" is 55%, and the proportion for the attribute "Kansai" is 45%.

カテゴリ「年収」については、属性「４００万未満」に対する性能値が８３％、属性「４００万以上」に対する性能値が８７％となっている。なお、カテゴリ「年収」においては、属性「４００万未満」の割合が４８％、属性「４００万以上」の割合が５２％である。 For the category "Annual Income," the performance value for the attribute "Less than 4 million yen" is 83%, and the performance value for the attribute "4 million yen or more" is 87%. Furthermore, within the category "Annual Income," the proportion of those with the attribute "Less than 4 million yen" is 48%, and the proportion of those with the attribute "4 million yen or more" is 52%.

カテゴリ欄４２と性能値欄４３は、モデルの性能の公平性に基づいて塗分けられる。なお、本明細書では、図示の便宜上、色分けの代わりにハッチングによる区別を用いる。同一カテゴリ内の異なる属性間でモデルの性能値の差が小さいことを「モデルの性能が公平である」と言い、モデルの性能値の差が大きいことを「モデルの性能が不公平である」と言う。カテゴリ欄４２と性能値欄４３は、モデルの性能が公平であるか不公平であるかに応じて、ハッチングの種類により塗分けられている。 The category column 42 and performance value column 43 are color-coded based on the fairness of the model's performance. Note that in this specification, for ease of illustration, hatching is used instead of color coding. A small difference in model performance values between different attributes in the same category is said to be "fair model performance," and a large difference in model performance values is said to be "unfair model performance." The category column 42 and performance value column 43 are color-coded using different types of hatching depending on whether the model's performance is fair or unfair.

いま、モデルの性能を不公平と判定する際の閾値を「５％」と仮定し、精度の差が５％以上の場合にモデルの性能が不公平と判定するものとする。カテゴリ「性別」については、属性「男性」の精度が９５％、属性「女性」の精度が６０％であり、その差が閾値「５％」以上であるため、モデルの性能が不公平と判定される。よって、図４（Ａ）の凡例に示すように、性能値が高い「男性」の属性枠はパターンＰ１のハッチングで示され、性能値が低い「女性」の属性枠はパターンＰ２のハッチングで示されている。なお、実際の表示においては、例えば色分けにより、性能値が低い方の属性枠を「赤色」、性能値が高い方の属性枠を「黄色」などとして、性能値が低い方の属性枠をより目立つように表示することが好ましい。また、カテゴリ欄４２は、性能値が低い方の属性枠と同様にパターンＰ２のハッチングで示す。これにより、ユーザはカテゴリ欄４２を見ただけでも、そのカテゴリについてモデルの性能が不公平であることを容易に認識できる。Assume that the threshold for determining model performance as unfair is 5%, and that a difference in accuracy of 5% or greater is considered unfair. For the category "gender," the accuracy of the attribute "male" is 95%, while the accuracy of the attribute "female" is 60%. Since the difference is equal to or greater than the threshold of 5%, the model's performance is considered unfair. Therefore, as shown in the legend in Figure 4(A), the attribute frame for "male," with a high performance value, is indicated by hatching in pattern P1, and the attribute frame for "female," with a low performance value, is indicated by hatching in pattern P2. In actual display, it is preferable to display the attribute frame with the lower performance value more prominently, for example, by color-coding the attribute frame with the lower performance value as "red" and the attribute frame with the higher performance value as "yellow." Furthermore, the category column 42 is indicated by hatching in pattern P2, just like the attribute frame with the lower performance value. This allows the user to easily recognize that the model's performance for that category is unfair simply by looking at the category column 42.

カテゴリ「年齢」についても、属性「大人」及び「高齢者」の性能値に比べて、属性「子供」の性能値が上記の閾値以上低いので、モデルの性能が不公平であると判定される。その結果、カテゴリ欄４２と「子供」の属性枠とはパターンＰ２のハッチングで示され、「大人」と「高齢者」の属性枠はパターンＰ１のハッチングで示されている。 For the category "Age," the performance value of the attribute "Children" is lower than the performance values of the attributes "Adults" and "Elderly" by more than the threshold value, so the model's performance is determined to be unfair. As a result, the category column 42 and the attribute frame for "Children" are indicated by hatching in pattern P2, and the attribute frames for "Adults" and "Elderly" are indicated by hatching in pattern P1.

一方、カテゴリ「居住地」については、属性「関東」の性能値と属性「関西」の性能値の差が上記の閾値以下であるので、モデルの性能は公平であると判定される。よって、カテゴリ欄４２及び性能値欄４３は、いずれもモデルの性能が公平であることを示すパターンＰ３のハッチングで示されている。 On the other hand, for the category "Residence," the difference between the performance value for the attribute "Kanto" and the performance value for the attribute "Kansai" is below the threshold, so the model's performance is determined to be fair. Therefore, both the category column 42 and the performance value column 43 are shown with hatching in pattern P3, which indicates that the model's performance is fair.

カテゴリ「年収」についても、属性「４００万未満」の性能値と属性「４００万以上」の性能値の差が上記の閾値以下であるので、モデルの性能は公平であると判定される。よって、カテゴリ欄４２及び性能値欄４３は、いずれもモデルの性能が公平であることを示すパターンＰ３のハッチングで示されている。 For the category "Annual Income," the difference between the performance value for the attribute "Less than 4 million yen" and the performance value for the attribute "4 million yen or more" is below the threshold, so the model's performance is determined to be fair. Therefore, both the category column 42 and the performance value column 43 are shown with hatching in pattern P3, which indicates that the model's performance is fair.

以上のように、図４（Ａ）に示す属性別性能情報の表示によれば、各カテゴリについて、属性毎にモデルの性能値が表示される。よって、ユーザは、同じモデルであっても、予測に用いるデータセットに含まれるカテゴリの属性によって性能値が異なることを知ることができる。また、各カテゴリについて、属性毎の性能値に基づいてモデルの性能が公平であるか否かが判定され、モデルの性能が公平であるカテゴリと不公平であるカテゴリとが区別して表示されるので、ユーザは、カテゴリ毎にモデルの公平性を一目で知ることができる。 As described above, the display of attribute-specific performance information shown in Figure 4(A) displays model performance values for each attribute for each category. This allows the user to know that even for the same model, performance values differ depending on the attributes of the categories included in the dataset used for prediction. Furthermore, for each category, the fairness of the model's performance is determined based on the performance values for each attribute, and categories where the model's performance is fair and categories where it is unfair are displayed separately, allowing the user to see at a glance the fairness of the model for each category.

（２）属性選択時の表示例
図４（Ｂ）は、ユーザが属性を選択した際の属性別性能情報の表示例を示す。図４（Ａ）に示す初期表示例４０において、ユーザは、例えばカーソルＣなどを用いて、各カテゴリに対応する任意の属性を選択することができる。図４（Ｂ）に示す属性選択時の表示例４０ｘでは、ユーザは、カテゴリ「性別」について属性「男性」を選択し、カテゴリ「年齢」について属性「大人」を選択している。 (2) Display Example When Attribute is Selected Fig. 4(B) shows an example of how attribute-specific performance information is displayed when the user selects an attribute. In the initial display example 40 shown in Fig. 4(A), the user can select any attribute corresponding to each category using, for example, cursor C. In the display example 40x when an attribute is selected shown in Fig. 4(B), the user selects the attribute "male" for the category "gender" and the attribute "adult" for the category "age."

属性が選択されると、分析部１２５は、データセットのうち、選択された属性に該当するデータを抽出して部分データセットを作成し、その部分データセットについて属性別性能情報を生成して表示する。 When an attribute is selected, the analysis unit 125 extracts data from the dataset that corresponds to the selected attribute to create a partial dataset, and generates and displays attribute-specific performance information for the partial dataset.

具体的に、図４（Ｂ）において、全体性能値欄４１には、ユーザにより選択された属性、即ち、「男性」かつ「大人」に対する性能値が表示される。ユーザによる選択の対象となった「性別」及び「年齢」について、カテゴリ欄４２は白色（ハッチング無し）で表示される。また、ユーザにより選択された「性別」及び「年齢」の属性枠は、選択されたことを示す黒色で示され、それ以外の属性枠は、選択されていないことを示す白色で表示されている。さらに、各属性枠には、各属性の全体に対する割合のみが表示される。 Specifically, in Figure 4 (B), the overall performance value column 41 displays the performance values for the attributes selected by the user, i.e., "Male" and "Adult." The category column 42 displays the "Gender" and "Age" selected by the user in white (no hatching). The attribute boxes for "Gender" and "Age" selected by the user are displayed in black, indicating that they have been selected, while the other attribute boxes are displayed in white, indicating that they have not been selected. Furthermore, each attribute box displays only the percentage of each attribute relative to the total.

一方、ユーザによる選択の対象となっていないカテゴリ「居住地」及び「年収」については、図４（Ａ）の例と同様に、そのカテゴリについてモデルの性能が公平であるか否かを示すパターンのハッチングがカテゴリ欄４２及び性能値欄４３に表示される。具体的に、カテゴリ「居住地」については、モデルの性能が公平であるため、カテゴリ欄４２及び性能値欄４３がいずれもパターンＰ３のハッチングで表示されている。一方、カテゴリ「年収」については、属性「４００万未満」の性能値と属性「４００万以上」の性能値の差が上記の閾値５％以上であり、モデルの性能が不公平であるため、「４００万以上」の属性枠がパターンＰ１のハッチングで表示され、カテゴリ欄４２及び「４００万未満」の属性枠がパターンＰ２のハッチングで表示されている。 On the other hand, for the categories "Place of Residence" and "Annual Income," which are not subject to selection by the user, hatching patterns indicating whether the model's performance for that category is fair or not are displayed in the category column 42 and performance value column 43, similar to the example in Figure 4(A). Specifically, for the category "Place of Residence," the model's performance is fair, so both the category column 42 and performance value column 43 are displayed with hatching pattern P3. On the other hand, for the category "Annual Income," the difference between the performance value for the attribute "Less than 4 million yen" and the performance value for the attribute "4 million yen or more" is equal to or greater than the above-mentioned threshold of 5%, and the model's performance is unfair, so the attribute box for "4 million yen or more" is displayed with hatching pattern P1, and the category column 42 and the attribute box for "Less than 4 million yen" are displayed with hatching pattern P2.

このように、本実施形態では、属性別性能情報の表示画面において、ユーザが任意の属性を選択することにより、その属性に該当するデータについて、各カテゴリの属性毎の性能値を表示させることができる。ユーザは、属性を選択して属性別性能情報を表示することにより、初期表示例ではわからなかったモデル性能の不公平を見つけることができる。例えば、データセット全体に対する属性別性能情報を表示している図４（Ａ）の例では、カテゴリ「年収」についてモデルの性能は公平であると判定されている。しかし、図４（Ｂ）に示すように、データを「男性」かつ「大人」に限定すると、ユーザは、カテゴリ「年収」においてモデルの性能が不公平であることに気づくことができる。よって、ユーザは、任意の属性を選択して属性別性能情報を表示させることにより、様々な条件（即ち、属性の組み合わせ）についてモデルの公平性が担保できているかを確認することができる。In this embodiment, a user can select any attribute on the attribute-specific performance information display screen to display performance values for each attribute in each category for data corresponding to that attribute. By selecting an attribute and displaying attribute-specific performance information, the user can identify unfairness in model performance that was not apparent in the initial display example. For example, in the example shown in Figure 4(A), which displays attribute-specific performance information for the entire data set, the model's performance is determined to be fair for the "annual income" category. However, as shown in Figure 4(B), by limiting the data to "male" and "adult," the user can notice that the model's performance is unfair in the "annual income" category. Therefore, by selecting any attribute and displaying attribute-specific performance information, the user can confirm whether the model's fairness is guaranteed under various conditions (i.e., combinations of attributes).

（第２の表示例）
図５（Ａ）は、属性別性能情報の第２の表示例を示す。上述の第1の表示例は、１つのモデル及び１つのデータセットについて、各カテゴリの属性毎にモデルの性能を表示している。これに対し、第２の表示例は、複数のモデル及び１つのデータセットについて、各カテゴリの属性毎にモデルの性能を表示する。 (Second display example)
5A shows a second display example of attribute-specific performance information. The first display example above displays model performance for each attribute category for one model and one data set. In contrast, the second display example displays model performance for each attribute category for multiple models and one data set.

図示のように、第２の表示例５０は、モデル欄５１と、全体性能値欄５２と、カテゴリ欄５３と、属性欄５４と、性能値欄５５とを有する。モデル欄５１は、複数のモデルを示す。本例では、５つのモデルｈ１～ｈ５について、各カテゴリの属性毎にモデルの性能が示されている。全体性能値欄５２は、対象のデータセットに対するモデルの性能値を示す。本例では、対象のデータセットに対するモデルの精度がモデル毎に示されている。 As shown, the second display example 50 has a model column 51, an overall performance value column 52, a category column 53, an attribute column 54, and a performance value column 55. The model column 51 shows multiple models. In this example, for five models h1 to h5, the model performance is shown for each attribute in each category. The overall performance value column 52 shows the performance value of the model for the target dataset. In this example, the accuracy of the model for the target dataset is shown for each model.

カテゴリ欄５３は、モデルの性能値の表示対象となるカテゴリを示す。本例では、３つのカテゴリ「性別」、「年齢」、及び、「居住地」についてモデルの性能値が表示されている。属性欄５４は、各カテゴリに対応する属性を示す。性能値欄５５は、各カテゴリの属性毎に属性枠が設けられ、属性枠内にモデルの性能値が表示されている。また、各属性の属性枠は、第１の表示例と同様に、モデルの公平性を示すパターンＰ１～Ｐ３のいずれかのハッチングが付されている。 The category column 53 indicates the category for which the model's performance value is displayed. In this example, model performance values are displayed for three categories: "Gender," "Age," and "Place of Residence." The attribute column 54 indicates the attributes corresponding to each category. The performance value column 55 has an attribute frame for each attribute in each category, and the model's performance value is displayed within the attribute frame. Furthermore, as with the first display example, the attribute frame for each attribute is hatched with one of patterns P1 to P3, which indicates the fairness of the model.

第２の表示例５０によれば、カテゴリ毎のモデルの性能の公平性を、複数のモデルについて同時に表示することができる。よって、ユーザは、複数のモデルについてモデルの性能の公平性を比較検討することができる。例えば、時系列で更新された複数のモデルを対象とすることにより、モデルの性能の公平性という観点でモデルの性能の遷移を評価することができる。 According to the second display example 50, the fairness of model performance for each category can be displayed simultaneously for multiple models. This allows the user to compare and consider the fairness of model performance for multiple models. For example, by targeting multiple models updated over time, the transition in model performance can be evaluated from the perspective of the fairness of model performance.

なお、第２の表示例でも、ユーザが任意の属性を選択することにより、選択された属性に該当するデータセットに対する属性別性能情報が表示されるようにしてもよい。その場合、ユーザは、表示例５０とは別のウィンドウなどにおいて、任意の属性を選択すればよい。また、ユーザによる属性欄５４のクリックを受け付けるようにして、ユーザが任意の属性を選択できるようにしてもよい。 In the second display example, the user may also select an attribute, which will display attribute-specific performance information for the data set corresponding to the selected attribute. In this case, the user may select the attribute in a window separate from display example 50. Alternatively, the user may be able to select an attribute by clicking on attribute column 54.

（第３の表示例）
図５（Ｂ）は、属性別性能情報の第３の表示例を示す。上述の第1の表示例は、１つのモデル及び１つのデータセットについて、各カテゴリの属性毎にモデルの性能を表示している。これに対し、第３の表示例は、１つのモデル及び複数のデータセットについて、各カテゴリの属性毎にモデルの性能を表示する。 (Third display example)
5B shows a third display example of attribute-specific performance information. The first display example above displays model performance for each attribute category for one model and one data set. In contrast, the third display example displays model performance for each attribute category for one model and multiple data sets.

図示のように、第３の表示例６０は、データセット欄６１と、全体性能値欄６２と、カテゴリ欄６３と、属性欄６４と、性能値欄６５とを有する。データセット欄６１は、複数のデータセットを示す。本例では、５つのデータセットＤＳ１～ＤＳ５について、各カテゴリの属性毎にモデルの性能が示されている。全体性能値欄６２は、各データセットに対するモデルの性能値を示す。本例では、５つのデータセットＤＳ１～ＤＳ５のそれぞれに対するモデルの精度が示されている。カテゴリ欄６３、属性欄６４及び性能値欄６５は、基本的に第２の表示例５０と同様である。 As shown, the third display example 60 has a dataset column 61, an overall performance value column 62, a category column 63, an attribute column 64, and a performance value column 65. The dataset column 61 shows multiple datasets. In this example, the model performance is shown for each attribute in each category for five datasets DS1 to DS5. The overall performance value column 62 shows the performance value of the model for each dataset. In this example, the accuracy of the model for each of the five datasets DS1 to DS5 is shown. The category column 63, attribute column 64, and performance value column 65 are basically the same as those in the second display example 50.

第３の表示例５０によれば、カテゴリ毎のモデルの性能の公平性を、複数のデータセットについて同時に表示することができる。よって、ユーザは、複数のデータセットについて、モデルの性能の公平性を比較検討することができる。 According to the third display example 50, the fairness of model performance for each category can be displayed simultaneously for multiple datasets. This allows the user to compare and consider the fairness of model performance for multiple datasets.

なお、第３の表示例でも、ユーザが任意の属性を選択することにより、選択された属性に該当するデータセットに対する属性別性能情報が表示されるようにしてもよい。その場合、ユーザは、表示例６０とは別のウィンドウなどにおいて、任意の属性を選択すればよい。また、ユーザによる属性欄６４のクリックを受け付けるようにして、ユーザが任意の属性を選択できるようにしてもよい。 In the third display example, the user may also select an attribute, which will display attribute-specific performance information for the data set corresponding to the selected attribute. In this case, the user may select the attribute in a window separate from display example 60. Alternatively, the user may be able to select an attribute by clicking on attribute column 64.

［モデル分析処理］
次に、モデル生成装置１００によるモデル分析処理について説明する。図６は、モデル生成装置１００によるモデル分析処理のフローチャートである。モデル分析処理は、上記のように、各カテゴリの属性毎にモデルの性能を算出し、属性別性能情報を表示する処理である。この処理は、図２に示すプロセッサ１１２が予め用意されたプログラムを実行し、主として図３に示す分析部１２５として動作することにより実現される。 [Model analysis processing]
Next, the model analysis process performed by the model generating device 100 will be described. FIG. 6 is a flowchart of the model analysis process performed by the model generating device 100. As described above, the model analysis process is a process of calculating the performance of a model for each attribute in each category and displaying attribute-specific performance information. This process is realized by the processor 112 shown in FIG. 2 executing a program prepared in advance and operating mainly as the analysis unit 125 shown in FIG. 3.

まず、分析部１２５は、対象となるデータセットを取得する（ステップＳ１０）。次に、分析部１２５は、カテゴリ毎にデータセットを分割して部分データセットを作成し、モデルを用いて予測を行うことにより、各カテゴリについて属性毎にモデルの性能値を算出する（ステップＳ１１）。そして、分析部１２５は、図４、図５などに例示するように、属性別性能情報を表示装置２に表示する（ステップＳ１２）。First, the analysis unit 125 acquires the target dataset (step S10). Next, the analysis unit 125 divides the dataset by category to create partial datasets, and calculates the model's performance value for each attribute in each category by making predictions using the model (step S11). Then, the analysis unit 125 displays attribute-specific performance information on the display device 2 (step S12), as illustrated in Figures 4 and 5.

次に、分析部１２５は、入力装置３を用いてユーザが属性を選択したか否かを判定する（ステップＳ１３）。属性が選択された場合（ステップＳ１３：Ｙｅｓ）、分析部１２５は、選択された属性に対応するデータセットを取得する（ステップＳ１４）。そして、分析部１２５は、選択された属性に対応するデータセットについて、ステップＳ１１～Ｓ１２を繰り返す。こうして、ユーザが属性を選択するたびに、選択された属性に対応するデータセットについて属性別性能情報が生成され、表示される。 Next, the analysis unit 125 determines whether the user has selected an attribute using the input device 3 (step S13). If an attribute has been selected (step S13: Yes), the analysis unit 125 acquires a dataset corresponding to the selected attribute (step S14). The analysis unit 125 then repeats steps S11 to S12 for the dataset corresponding to the selected attribute. In this way, each time the user selects an attribute, attribute-specific performance information is generated and displayed for the dataset corresponding to the selected attribute.

一方、属性が選択されていない場合（ステップＳ１３：Ｎｏ）、分析部１２５は、ユーザにより終了指示が入力されたか否かを判定する（ステップＳ１５）。終了指示が入力されていない場合（ステップＳ１５：Ｎｏ）、処理はステップＳ１３へ戻る。一方、終了指示が入力された場合（ステップＳ１５：Ｙｅｓ）、処理は終了する。 On the other hand, if no attribute has been selected (step S13: No), the analysis unit 125 determines whether the user has input an end instruction (step S15). If an end instruction has not been input (step S15: No), the process returns to step S13. On the other hand, if an end instruction has been input (step S15: Yes), the process ends.

ステップＳ１１で属性毎にモデルの性能値を算出する際、データセットに含まれるデータが数値データなどの連続変数である場合、分析部１２５は連続変数をカテゴリに変換することにより属性を生成する。具体的に、分析部１２５は、年齢、年収などの連続変数を、順序付きカテゴリに変換する。例えば、分析部１２５は、連続変数の取る最小値と最大値の間を２つ以上の属性（グループ）に等分割し、カテゴリに変換する。 When calculating the model performance value for each attribute in step S11, if the data included in the dataset is a continuous variable such as numerical data, the analysis unit 125 generates attributes by converting the continuous variables into categories. Specifically, the analysis unit 125 converts continuous variables such as age and annual income into ordered categories. For example, the analysis unit 125 divides the range between the minimum and maximum values of the continuous variable equally into two or more attributes (groups) and converts them into categories.

なお、分析部１２５は、業務知識に基づいて分割点を用意できる場合は、その分割点に応じて、連続変数をカテゴリに変換してもよい。例えば、年齢の例では、分析部１２５は、業務知識に基づき、「子供」、「大人」、「高齢者」というように３つの属性を作成している。また、分析部１２５は、各属性のデータサンプル数が同じになるように、パーセンタイルに応じて分割点を決定し、連続変数をカテゴリに変換してもよい。また、分析部１２５は、データセットに含まれるデータ（変数）のクラスタリングにより複数の属性を作成してもよい。なお、データセットに含まれる変数が例えば、性別、居住地など、もともとカテゴリである場合は、上記の処理は不要である。 Note that, if a dividing point can be prepared based on business knowledge, the analysis unit 125 may convert a continuous variable into a category according to that dividing point. For example, in the case of age, the analysis unit 125 creates three attributes, such as "child," "adult," and "elderly," based on business knowledge. The analysis unit 125 may also determine a dividing point according to percentiles so that the number of data samples for each attribute is the same, and convert the continuous variable into a category. The analysis unit 125 may also create multiple attributes by clustering the data (variables) included in the dataset. Note that if the variables included in the dataset are originally categorical, such as gender or place of residence, the above processing is not necessary.

［変形例］
（変形例１）
図４（Ｂ）の例では、ユーザが任意のカテゴリの属性を選択すると、分析部１２５は、選択された属性に該当するデータセットについて属性別性能情報を生成し、表示している。その代わりに、分析部１２５は、モデルの性能が不公平となる属性の組み合わせを自動的に計算し、ユーザに提示してもよい。具体的には、まず、前述のように、モデルの性能を不公平と判定するための性能値の差の閾値を予め決定しておく。分析部１２５は、複数のカテゴリに対応する属性の全ての組み合わせについて属性別性能情報を計算し、モデルの性能が不公平となるカテゴリを抽出する。そして、分析部１２５は、抽出されたカテゴリと、そのカテゴリに対応する属性毎の性能値を含む情報を出力、表示すればよい。これによれば、ユーザが属性の選択を繰り返す必要なく、モデルの性能が不公平となる属性の組み合わせを抽出して表示させることが可能となる。 [Modification]
(Variation 1)
In the example of FIG. 4(B), when a user selects an attribute in an arbitrary category, the analysis unit 125 generates and displays attribute-specific performance information for a dataset corresponding to the selected attribute. Alternatively, the analysis unit 125 may automatically calculate attribute combinations that result in unfair model performance and present the information to the user. Specifically, as described above, a threshold value for the difference in performance values for determining that model performance is unfair is first determined in advance. The analysis unit 125 calculates attribute-specific performance information for all combinations of attributes corresponding to multiple categories and extracts categories in which model performance is unfair. The analysis unit 125 then outputs and displays information including the extracted categories and the performance values for each attribute corresponding to the categories. This makes it possible to extract and display attribute combinations that result in unfair model performance without the user having to repeatedly select attributes.

（変形例２）
上記の実施形態では、モデル生成装置１００をＰＣなどの独立した装置として構成しているが、その代わりに、モデル生成装置をサーバと端末装置により構成してもよい。図７は、サーバと端末装置を用いたモデル生成システム１ｘの概略構成を示すブロック図である。図７において、サーバ１００ｘは、図３に示すモデル生成装置１００の構成を備える。また、ユーザが使用する端末装置７の表示装置２ｘ及び入力装置３ｘを、図３に示す表示装置２及び入力装置３として使用する。 (Variation 2)
In the above embodiment, the model generation device 100 is configured as an independent device such as a PC, but instead, the model generation device may be configured by a server and a terminal device. FIG. 7 is a block diagram showing a schematic configuration of a model generation system 1x using a server and a terminal device. In FIG. 7, the server 100x has the configuration of the model generation device 100 shown in FIG. 3. Furthermore, the display device 2x and input device 3x of the terminal device 7 used by the user are used as the display device 2 and input device 3 shown in FIG. 3.

＜第２実施形態＞
図８は、第２実施形態のモデル分析装置７０の機能構成を示すブロック図である。モデル分析装置７０は、モデル取得手段７１と、データセット取得手段７２と、性能算出手段７３と、出力手段７４と、を備える。 Second Embodiment
8 is a block diagram showing the functional configuration of a model analysis device 70 according to the second embodiment. The model analysis device 70 includes a model acquisition unit 71, a data set acquisition unit 72, a performance calculation unit 73, and an output unit 74.

図９は、第２実施形態のモデル分析装置７０による処理のフローチャートである。モデル取得手段７１は、モデルを取得する（ステップＳ７１）。データセット取得手段７２は、データセットを取得する（ステップＳ７２）。性能算出手段７３は、データセットの各カテゴリに対応する属性毎に、モデルの性能を算出する（ステップＳ７３）。出力手段７４は、算出された属性毎のモデルの性能を示す性能情報を出力する（ステップＳ７４）。 Figure 9 is a flowchart of processing by the model analysis device 70 of the second embodiment. The model acquisition means 71 acquires a model (step S71). The dataset acquisition means 72 acquires a dataset (step S72). The performance calculation means 73 calculates the performance of the model for each attribute corresponding to each category of the dataset (step S73). The output means 74 outputs performance information indicating the calculated performance of the model for each attribute (step S74).

第２実施形態のモデル分析装置７０によれば、データセットのカテゴリの属性毎にモデルの性能を評価することが可能となる。 The model analysis device 70 of the second embodiment makes it possible to evaluate the performance of a model for each attribute of a dataset category.

上記の実施形態の一部又は全部は、以下の付記のようにも記載されうるが、以下には限られない。 Some or all of the above embodiments may also be described as, but are not limited to, the following notes:

（付記１）
モデルを取得するモデル取得手段と、
データセットを取得するデータセット取得手段と、
前記データセットの各カテゴリに対応する属性毎に、前記モデルの性能を算出する性能算出手段と、
算出された属性毎のモデルの性能を示す性能情報を出力する出力手段と、
を備えるモデル分析装置。 (Appendix 1)
a model acquisition means for acquiring a model;
a dataset acquisition means for acquiring a dataset;
a performance calculation means for calculating the performance of the model for each attribute corresponding to each category of the data set;
an output means for outputting performance information indicating the calculated performance of the model for each attribute;
A model analysis device comprising:

（付記２）
前記属性の選択を受け取る選択手段を備え、
前記データセット取得手段は、選択された属性に該当するデータセットを取得し、
前記性能算出手段は、前記属性の選択の対象となったカテゴリ以外のカテゴリについて、当該カテゴリに対応する属性毎に、前記モデルの性能を算出する付記１に記載のモデル分析装置。 (Appendix 2)
selection means for receiving a selection of the attribute;
the data set acquisition means acquires a data set corresponding to the selected attribute,
2. The model analysis device according to claim 1, wherein the performance calculation means calculates the performance of the model for each attribute corresponding to a category other than the category from which the attribute was selected.

（付記３）
前記性能情報は、前記属性毎のモデルの性能の差が所定値以上であるカテゴリと、前記所定値未満であるカテゴリとを区別して示す付記１又は２に記載のモデル分析装置。 (Appendix 3)
3. The model analysis device according to claim 1, wherein the performance information distinguishes between categories in which the difference in model performance for each attribute is equal to or greater than a predetermined value and categories in which the difference is less than the predetermined value.

（付記４）
前記性能情報は、カテゴリに対応する属性毎のデータ数の割合を示す情報を含む付記１乃至３のいずれか一項に記載のモデル分析装置。 (Appendix 4)
4. The model analysis device according to claim 1, wherein the performance information includes information indicating a ratio of the number of data items for each attribute corresponding to a category.

（付記５）
前記モデル取得手段は、複数のモデルを取得し、
前記性能情報は、前記複数のモデルについての性能情報を含む付記１乃至４のいずれか一項に記載のモデル分析装置。 (Appendix 5)
the model acquisition means acquires a plurality of models;
5. The model analysis apparatus according to claim 1, wherein the performance information includes performance information about the plurality of models.

（付記６）
前記データ取得手段は、複数のデータセットを取得し、
前記性能情報は、前記複数のデータセットについての性能情報を含む付記１乃至４のいずれか一項に記載のモデル分析装置。 (Appendix 6)
the data acquisition means acquires a plurality of data sets;
5. The model analysis apparatus according to claim 1, wherein the performance information includes performance information for the plurality of data sets.

（付記７）
前記性能算出手段は、前記データセットに含まれるデータのクラスタリングにより、前記カテゴリに対応する属性を決定する付記１乃至６のいずれか一項に記載のモデル分析装置。 (Appendix 7)
7. The model analysis apparatus according to claim 1, wherein the performance calculation means determines attributes corresponding to the categories by clustering data included in the dataset.

（付記８）
前記性能算出手段は、パーセンタイル分割により、各属性に含まれるデータサンプル数が等しくなるように、前記カテゴリに対応する属性を決定する付記１乃至６のいずれか一項に記載のモデル分析装置。 (Appendix 8)
7. The model analysis device according to claim 1, wherein the performance calculation means determines attributes corresponding to the categories by percentile division so that the number of data samples included in each attribute is equal.

（付記９）
モデルを取得し、
データセットを取得し、
前記データセットの各カテゴリに対応する属性毎に、前記モデルの性能を算出し、
算出された属性毎のモデルの性能を示す性能情報を出力するモデル分析方法。 (Appendix 9)
Get the model,
Get the dataset,
Calculating the performance of the model for each attribute corresponding to each category of the dataset;
A model analysis method that outputs performance information indicating the performance of a model for each calculated attribute.

（付記１０）
モデルを取得し、
データセットを取得し、
前記データセットの各カテゴリに対応する属性毎に、前記モデルの性能を算出し、
算出された属性毎のモデルの性能を示す性能情報を出力する処理をコンピュータに実行させるプログラムを記録した記録媒体。 (Appendix 10)
Get the model,
Get the dataset,
Calculating the performance of the model for each attribute corresponding to each category of the dataset;
A recording medium storing a program for causing a computer to execute a process for outputting performance information indicating the calculated performance of a model for each attribute.

以上、実施形態及び実施例を参照して本開示を説明したが、本開示は上記実施形態及び実施例に限定されるものではない。本開示の構成や詳細には、本開示のスコープ内で当業者が理解し得る様々な変更をすることができる。 The present disclosure has been described above with reference to embodiments and examples, but the present disclosure is not limited to the above embodiments and examples. Various modifications that would be understood by a person skilled in the art can be made to the configuration and details of the present disclosure within the scope of the present disclosure.

１、１ｘモデル生成システム
２、２ｘ表示装置
３、３ｘ入力装置
７端末装置
１００モデル生成装置
１１２プロセッサ
１２１訓練データＤＢ
１２２モデル訓練部
１２３モデルＤＢ
１２４評価用データＤＢ
１２５分析部 1, 1x Model generation system 2, 2x Display device 3, 3x Input device 7 Terminal device 100 Model generation device 112 Processor 121 Training data DB
122 Model Training Department 123 Model DB
124 Evaluation data DB
125 Analysis Department

Claims

a model acquisition means for acquiring a model;
selection means for receiving a selection of attributes;
a dataset acquisition means for acquiring a dataset corresponding to the selected attribute ;
a performance calculation means for calculating the performance of the model for each attribute corresponding to each category of the data set;
an output means for outputting performance information indicating the calculated performance of the model for each attribute;
Equipped with
The performance calculation means is a model analysis device that calculates the performance of the model for each attribute corresponding to a category other than the category for which the attribute was selected .

The model analysis device of claim 1, wherein the performance information distinguishes between categories where the difference in model performance for each attribute is equal to or greater than a predetermined value and categories where the difference is less than the predetermined value.

The model analysis device according to claim 1 , wherein the performance information includes information indicating a ratio of the number of data items for each attribute corresponding to a category.

the model acquisition means acquires a plurality of models;
The model analysis device according to claim 1 , wherein the performance information includes performance information about the plurality of models.

the data set acquisition means acquires a plurality of data sets;
The model analysis apparatus according to claim 1 , wherein the performance information includes performance information about the plurality of data sets.

The model analysis device according to claim 1 , wherein the performance calculation means determines attributes corresponding to the categories by clustering data included in the data set.

The model analysis device according to claim 1 , wherein the performance calculation means determines attributes corresponding to the categories by percentile division so that the number of data samples included in each attribute is equal.

1. A computer-implemented method for model analysis, comprising:
Get the model,
It takes a selection of attributes,
Retrieve the dataset corresponding to the selected attributes ,
Calculating the performance of the model for each attribute corresponding to each category of the dataset;
Outputting performance information showing the performance of the model for each calculated attribute;
A model analysis method in which the performance of the model is calculated for each attribute corresponding to a category other than the category for which the attribute was selected .

Get the model,
It takes a selection of attributes,
Retrieve the dataset corresponding to the selected attributes ,
Calculating the performance of the model for each attribute corresponding to each category of the dataset;
causing a computer to execute a process of outputting performance information indicating the calculated performance of the model for each attribute ;
The performance of the model is calculated for each attribute corresponding to a category other than the category for which the attribute was selected .