JP7731228B2

JP7731228B2 - Skin image analysis method

Info

Publication number: JP7731228B2
Application number: JP2021111192A
Authority: JP
Inventors: 顕西野
Original assignee: Kao Corp
Current assignee: Kao Corp
Priority date: 2020-11-13
Filing date: 2021-07-05
Publication date: 2025-08-29
Anticipated expiration: 2041-07-05
Also published as: JP2022078936A

Description

特許法第３０条第２項適用［公開の事実１］掲載年月日：令和３年１月２２日掲載アドレス：ｈｔｔｐｓ：／／ｗｗｗ．ｋａｏ．ｃｏｍ／ｊｐ／ｃｏｒｐｏｒａｔｅ／ｎｅｗｓ／ｒｄ／２０２１／２０２１０１２２－００１／［公開の事実２］開催日：令和３年１月２２日集会名：日本視覚学会２０２１年冬季大会Article 30, paragraph 2 of the Patent Act applies. [Public Fact 1] Date of publication: January 22, 2021. Posting address: https://www.kao.com/jp/corporate/news/rd/2021/20210122-001/ [Public Fact 2] Date of holding: January 22, 2021. Meeting name: Japan Vision Society 2021 Winter Meeting

本発明は、人の肌画像を機械学習により得られた判別モデルを用いて分析する肌画像分析技術に関する。 The present invention relates to skin image analysis technology that analyzes human skin images using a discriminant model obtained through machine learning.

下記特許文献１には、人物画像から顔領域を検出し、検出された顔領域内のシワが発生し易い領域の画像特徴量（勾配ヒストグラム）を抽出し、その特徴量と学習器（ＳＶＭ（Support Vector Machine））とを用いて人物属性（年齢や性別）の区分を特定する手法が開示されている。
下記特許文献２には、畳み込みニューラルネットワーク（ＣＮＮ）を使用してユーザの画像を解析させてユーザの年齢を予測すると共に、目標の肌年齢を達成するために肌の領域に塗布する製品を推奨する手法が開示されている。
下記特許文献３には、人間の顔を含むデジタル画像に対して顔の位置特定及び巨視的特徴（目、鼻、口等）のマスキングの処理を行った上で畳み込みニューラルネットワーク（ＣＮＮ）を使用してその画像を解析させて、見掛け肌年齢を判定及び表示する手法が開示されている。 The following Patent Document 1 discloses a method for detecting a face region from a person's image, extracting image features (gradient histograms) of areas within the detected face region where wrinkles are likely to occur, and identifying classifications of person attributes (age and gender) using the features and a learning device (SVM (Support Vector Machine)).
Patent Document 2 listed below discloses a method for predicting the age of a user by analyzing an image of the user using a convolutional neural network (CNN) and recommending a product to be applied to an area of the skin to achieve a target skin age.
Patent Document 3 listed below discloses a method for determining and displaying the apparent skin age by performing processes for identifying the position of a face and masking macroscopic features (eyes, nose, mouth, etc.) on a digital image containing a human face, and then analyzing the image using a convolutional neural network (CNN).

特開２０１３－５８０６０号公報JP 2013-58060 A 特表２０１９－５１２７９７号公報Special table 2019-512797 publication 特表２０２０－５２２８１０号公報Special Publication No. 2020-522810

上述の手法のように、ユーザの顔画像全体或いは評価したい部位の画像をニューラルネットワークや学習器に入力した場合には、入力された画像全体に関する肌の属性や肌年齢が予測されることになる。しかしながら、肌の状態は局所的に異なる場合がある。例えば、化粧料を塗布する場合に顔の全体に均一的に塗布するのは困難であり、例え均一に塗布できたとしても部分的に見え方が異なる場合がある。
このような観点において、上述の各手法は、主に評価対象となる肌領域の全体を評価しているに過ぎない点で改良の余地が残る。
本発明は、肌の状態又は属性の指標値を肌の小区画ごとに取得する肌画像分析技術を提供する。 As in the above-described method, when the entire face image of a user or an image of a region to be evaluated is input into a neural network or a learning device, the skin attributes and skin age for the entire input image are predicted. However, the condition of the skin may vary locally. For example, when applying cosmetics, it is difficult to apply them evenly to the entire face, and even if they are applied evenly, the appearance may vary in parts.
From this perspective, there is still room for improvement in each of the above-mentioned methods, in that they mainly only evaluate the entire skin region that is the subject of evaluation.
The present invention provides a skin image analysis technique that obtains skin condition or attribute indices for each skin patch.

本発明によれば、教師用肌画像の被写肌共通の属性又は状態を示す正解情報とその教師用肌画像から複数抽出される所定画像サイズの教師用肌パッチ画像群との複数の組合せを含む複数の教師データに基づいて機械学習されている学習済みの判別モデルを利用可能な一以上のプロセッサが、被評価者の肌が写る評価用肌画像を取得する工程と、前記取得された評価用肌画像から所定画像サイズの肌パッチ画像群を取得する工程と、前記取得された各肌パッチ画像に対してそれぞれ画素値の正規化を行う工程と、正規化された各肌パッチ画像をそれぞれ前記判別モデルに入力することで、各肌パッチ画像に関する肌属性又は肌状態の指標値をそれぞれ取得する工程とを実行する肌画像分析方法が提供され得る。
また、本発明によれば、所定画像サイズの肌パッチ画像の入力に応じてその肌パッチ画像の肌属性又は肌状態を判別する判別モデルの学習方法であって、教師用肌画像とその教師用肌画像の被写肌共通の属性又は状態を示す正解情報との組合せを複数取得する工程と、前記取得された教師用肌画像から肌領域が所定の割合以上含まれる所定画像サイズの教師用肌パッチ画像群を取得する工程と、前記取得された各教師用肌パッチ画像に対してそれぞれ画素値の正規化を行う工程と、各教師用肌パッチ画像が取得された元の前記教師用肌画像に対応する前記正解情報と、前記正規化された教師用肌パッチ画像群の各々とを関連付けた教師データ群を生成する工程と、前記教師データ群を用いて前記判別モデルを学習させる行程とを含む判別モデルの学習方法が提供され得る。
また、上述の一以上のプロセッサ及びメモリを少なくとも備える肌画像分析装置であって上述の肌画像分析方法を実行可能な肌画像分析装置、上述の判別モデルの学習方法を実行可能な判別モデル学習装置なども提供され得る。 According to the present invention, a skin image analysis method can be provided in which one or more processors that can use a trained discrimination model that has been machine-learned based on multiple training data including multiple combinations of correct answer information indicating common attributes or conditions of the subject skin in a training skin image and a group of training skin patch images of a predetermined image size extracted from the training skin image perform the following steps: acquiring an evaluation skin image showing the skin of a subject; acquiring a group of skin patch images of a predetermined image size from the acquired evaluation skin image; normalizing the pixel values of each of the acquired skin patch images; and inputting each normalized skin patch image into the discrimination model to acquire an index value for the skin attribute or skin condition for each skin patch image.
Furthermore, according to the present invention, there can be provided a method for training a discriminant model that discriminates the skin attributes or skin condition of an input skin patch image of a predetermined image size, the method including the steps of: acquiring a plurality of combinations of teacher skin images and correct answer information indicating common attributes or conditions of the subject skin of the teacher skin images; acquiring a group of teacher skin patch images of a predetermined image size from the acquired teacher skin images, each of which contains a skin area at a predetermined rate or more; normalizing the pixel values of each of the acquired teacher skin patch images; generating a group of teacher data that associates the correct answer information corresponding to the original teacher skin image from which each teacher skin patch image was acquired with each of the normalized teacher skin patch images; and training the discriminant model using the group of teacher data.
In addition, a skin image analysis device that is equipped with at least one or more of the above-mentioned processors and memories and is capable of executing the above-mentioned skin image analysis method, and a discriminant model learning device that is capable of executing the above-mentioned discriminant model learning method may also be provided.

上記態様によれば、肌の状態又は属性の指標値を肌の小区画ごとに取得する肌画像分析技術を提供することができる。 The above aspect provides a skin image analysis technology that acquires index values for skin condition or attributes for each small area of skin.

本実施形態に係る肌画像分析方法及び判別モデルの学習方法を実行可能な情報処理装置のハードウェア構成例を概念的に示す図である。FIG. 1 is a diagram conceptually illustrating an example of the hardware configuration of an information processing device capable of executing a skin image analysis method and a discriminant model learning method according to an embodiment of the present invention. 本実施形態に係る肌画像分析方法の処理の流れの概念図である。4 is a conceptual diagram illustrating a processing flow of a skin image analysis method according to the present embodiment. 本実施形態に係る肌画像分析方法のフローチャートである。1 is a flowchart of a skin image analysis method according to the present embodiment. 肌パッチ画像群の指標値分布情報としてのカラーマップ画像の例を示す図である。FIG. 10 is a diagram showing an example of a color map image as index value distribution information of a group of skin patch images. 本実施形態に係る判別モデルの学習方法の処理の流れの概念図である。FIG. 2 is a conceptual diagram illustrating a processing flow of a discriminant model learning method according to the present embodiment. 本実施形態に係る判別モデルの学習方法のフローチャートである。10 is a flowchart of a method for learning a discriminant model according to the present embodiment. 肌領域サイズと判別モデルの判別精度との関係を示すグラフである。10 is a graph showing the relationship between the size of a skin region and the discrimination accuracy of a discriminant model. 人の目視評価で得られた化粧感スコアと本実施形態に係る肌画像分析方法で得られた化粧感スコアとの関係を示すグラフである。10 is a graph showing the relationship between the makeup feel score obtained by human visual evaluation and the makeup feel score obtained by the skin image analysis method according to the present embodiment. 化粧料塗布態様が異なる８種の評価用肌画像の各々に関して本実施形態に係る肌画像分析方法で取得された化粧感の代表指標値を示すグラフである。10 is a graph showing representative index values of makeup appearance obtained by the skin image analysis method according to the present embodiment for each of eight types of skin images for evaluation that differ in the manner in which cosmetics are applied. 化粧料塗布態様が異なる８種の評価用肌画像の各々に関する肌パッチ画像群の指標値分布を示すカラーマップ画像を示す図である。FIG. 10 is a diagram showing a color map image showing the index value distribution of a group of skin patch images for each of eight types of skin images for evaluation that have different cosmetic application patterns. 本実施形態に係る肌画像分析方法で得られる推定年齢帯と実年齢帯との関係を示すグラフである。10 is a graph showing the relationship between estimated age ranges and actual age ranges obtained by the skin image analysis method according to the present embodiment. ５名の被評価者の評価用肌画像と肌パッチ画像群の指標値分布を示すカラーマップ画像との関係を示す図である。FIG. 10 is a diagram showing the relationship between the evaluation skin images of five subjects and a color map image showing the index value distribution of a group of skin patch images. 二種類のファンデーション化粧料の塗布直後と塗布から４時間、８時間及び１０時間の各経過後とにおける各評価用肌画像に関するカラーマップ画像を示す図である。FIG. 10 is a diagram showing color map images of skin images for evaluation taken immediately after application of two types of foundation cosmetics and 4 hours, 8 hours, and 10 hours after application. 本実施形態に係る判別モデルの学習方法におけるエポック数と判別モデルの判別精度との関係を示すグラフである。10 is a graph showing the relationship between the number of epochs and the discrimination accuracy of the discriminant model in the discriminant model learning method according to the present embodiment. 顔画像を入力して肌の目視評価スコアを推定する学習済みモデルと肌パッチ画像を入力して肌の目視評価スコアを推定する学習済みモデルとの推定精度を比較するためのグラフである。This is a graph comparing the estimation accuracy of a trained model that estimates a visual skin assessment score by inputting a face image and a trained model that estimates a visual skin assessment score by inputting a skin patch image. 実施例８における肌パッチ型判別モデルから得られる目視評価スコアの推定値と年齢との相関、及び化粧料塗布に伴う目視評価スコアの推定値の変化を示すレーダーチャートである。10 is a radar chart showing the correlation between the estimated visual evaluation score obtained from the skin patch type discrimination model in Example 8 and age, and the change in the estimated visual evaluation score following the application of a cosmetic material.

以下、本発明の好ましい実施形態の例（以降、本実施形態と表記する）について説明する。以下に挙げる実施形態は例示であり、本発明は以下の実施形態の構成に限定されない。 The following describes a preferred embodiment of the present invention (hereinafter referred to as the present embodiment). The embodiment described below is merely an example, and the present invention is not limited to the configuration of the following embodiment.

本実施形態に係る肌画像分析方法（以下、本分析方法と表記する）及び判別モデルの学習方法（以下、本学習方法と表記する）は、一台以上の情報処理装置が備える一以上のプロセッサにより実行される。
図１は、本分析方法及び本学習方法を実行可能な情報処理装置１０のハードウェア構成例を概念的に示す図である。
情報処理装置１０は、いわゆるコンピュータであり、ＣＰＵ１１、メモリ１２、入出力インタフェース（Ｉ／Ｆ）１３、通信ユニット１４等を有する。情報処理装置１０は、据え置き型のＰＣ（Personal Computer）であってもよいし、携帯型のＰＣ、スマートフォン、タブレット等のような携帯端末であってもよいし、専用コンピュータであってもよい。 The skin image analysis method (hereinafter referred to as the present analysis method) and the discriminant model learning method (hereinafter referred to as the present learning method) according to this embodiment are executed by one or more processors provided in one or more information processing devices.
FIG. 1 is a diagram conceptually illustrating an example of the hardware configuration of an information processing device 10 that can execute the present analysis method and the present learning method.
The information processing device 10 is a so-called computer, and includes a CPU 11, a memory 12, an input/output interface (I/F) 13, a communication unit 14, etc. The information processing device 10 may be a desktop personal computer (PC), a portable terminal such as a portable PC, a smartphone, or a tablet, or a dedicated computer.

ＣＰＵ１１は、いわゆるプロセッサであり、一般的なＣＰＵ（Central Processing Unit）に加えて、特定用途向け集積回路（ＡＳＩＣ）、ＤＳＰ（Digital Signal Processor）、ＧＰＵ（Graphics Processing Unit）等も含まれ得る。メモリ１２は、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、補助記憶装置（ハードディスク等）である。
入出力Ｉ／Ｆ１３は、表示装置１５、入力装置１６等のユーザインタフェース装置と接続可能である。表示装置１５は、ＬＣＤ（Liquid Crystal Display）やＣＲＴ（Cathode Ray Tube）ディスプレイのような、ＣＰＵ１１等により処理された描画データに対応する画面を表示する装置である。入力装置１６は、キーボード、マウス等のようなユーザ操作の入力を受け付ける装置である。表示装置１５及び入力装置１６は一体化され、タッチパネルとして実現されてもよい。
通信ユニット１４は、通信網を介した他のコンピュータとの通信や、プリンタ等の他の機器との信号のやりとり等を行う。通信ユニット１４には、可搬型記録媒体等も接続され得る。 The CPU 11 is a so-called processor, and may include not only a general CPU (Central Processing Unit) but also an application specific integrated circuit (ASIC), a DSP (Digital Signal Processor), a GPU (Graphics Processing Unit), etc. The memory 12 is a RAM (Random Access Memory), a ROM (Read Only Memory), an auxiliary storage device (such as a hard disk), etc.
The input/output I/F 13 can be connected to user interface devices such as a display device 15 and an input device 16. The display device 15 is a device such as an LCD (Liquid Crystal Display) or a CRT (Cathode Ray Tube) display that displays a screen corresponding to drawing data processed by the CPU 11 or the like. The input device 16 is a device such as a keyboard or mouse that accepts input of user operations. The display device 15 and the input device 16 may be integrated and realized as a touch panel.
The communication unit 14 communicates with other computers via a communication network and exchanges signals with other devices such as a printer, etc. The communication unit 14 can also be connected to a portable recording medium, etc.

情報処理装置１０のハードウェア構成は、図１の例に制限されない。情報処理装置１０は、図示されていない他のハードウェア要素を含んでもよい。また、各ハードウェア要素の数も、図１の例に制限されない。例えば、情報処理装置１０は、複数のＣＰＵ１１を有していてもよい。また、情報処理装置１０は、複数の筐体からなる複数台のコンピュータにより実現されていてもよい。 The hardware configuration of the information processing device 10 is not limited to the example shown in Figure 1. The information processing device 10 may include other hardware elements not shown. Furthermore, the number of each hardware element is not limited to the example shown in Figure 1. For example, the information processing device 10 may have multiple CPUs 11. Furthermore, the information processing device 10 may be realized by multiple computers consisting of multiple housings.

情報処理装置１０は、ＣＰＵ１１によりメモリ１２に格納されたコンピュータプログラムが実行されることにより、本分析方法及び本学習方法を実行することができる。このコンピュータプログラムは、例えば、ＣＤ（Compact Disc）、メモリカード等のような可搬型記録媒体やネットワーク上の他のコンピュータから入出力Ｉ／Ｆ１３又は通信ユニット１４を介してインストールされ、メモリ１２に格納される。
本明細書では、説明の便宜のために、本分析方法及び本学習方法が共通の情報処理装置１０で実行される例を挙げるが、本分析方法及び本学習方法は、同じ又は異なるハードウェア構成をそれぞれ有する別々の情報処理装置で実行されてもよい。また、本分析方法及び本学習方法はそれぞれ別々のコンピュータプログラムで実装されてもよい。 The information processing device 10 can execute the present analysis method and the present learning method by having the CPU 11 execute a computer program stored in the memory 12. This computer program is installed from a portable recording medium such as a CD (Compact Disc) or a memory card, or from another computer on a network, via the input/output I/F 13 or the communication unit 14, and stored in the memory 12.
For ease of explanation, this specification provides an example in which the analysis method and the learning method are executed on a common information processing device 10, but the analysis method and the learning method may also be executed on separate information processing devices having the same or different hardware configurations. Furthermore, the analysis method and the learning method may each be implemented as separate computer programs.

情報処理装置１０（ＣＰＵ１１）は、教師データに基づいて機械学習されている学習済みの判別モデルを利用可能であり、更に、本学習方法を実行してその判別モデルを学習することもできる。
ここでの「判別モデル」は、教師データを用いた機械学習、即ち教師あり学習により得られたモデルであり、ＡＩ（Artificial Intelligence）モデル、機械学習（Machine Learning（ＭＬ））モデル等と表記可能である。
本実施形態で利用される判別モデルは、肌パッチ画像の入力に応じてその肌パッチ画像の肌属性又は肌状態を判別し得る情報を出力することができればよく、それのデータ構造や学習アルゴリズム等は限定されない。なお、判別モデルに入力される肌パッチ画像や判別モデルからの出力により判別可能となる肌属性及び肌状態については後述する。 The information processing device 10 (CPU 11) can use a trained discrimination model that has been machine-learned based on training data, and can also train the discrimination model by executing the present learning method.
The "discrimination model" here is a model obtained by machine learning using training data, i.e., supervised learning, and can be expressed as an AI (Artificial Intelligence) model, a machine learning (ML) model, etc.
The discriminant model used in this embodiment is only required to be able to output information capable of discriminating the skin attributes or skin conditions of a skin patch image in response to the input of the skin patch image, and there are no limitations on its data structure, learning algorithm, etc. Note that the skin attributes and skin conditions that can be discriminated based on the skin patch images input to the discriminant model and the output from the discriminant model will be described later.

当該判別モデルは、例えば、回帰分析で得られる回帰式であってもよいし、ディープニューラルネットワークを含む畳み込みニューラルネットワークで構成されていてもよく、コンピュータプログラムとパラメータとの組合せ、複数の関数とパラメータとの組合せなどにより実現される。また、判別モデルは、ニューラルネットワークで構築される場合で、かつ、入力層、中間層及び出力層を一つのニューラルネットワークの単位と捉えた場合に、一つのニューラルネットワークを指してもよいし、複数のニューラルネットワークの組合せを指してもよい。また、判別モデルは、複数の重回帰式の組合せで構成されてもよいし、一つの重回帰式で構成されてもよい。また、判別モデルは、ニューラルネットワークと回帰式や判別式等のような他のモデルとの組合せで構成され、ニューラルネットワークの出力として得られた値を当該他のモデルに入力して得られる値を出力とするものであってもよい。本実施形態では、当該判別モデルがディープニューラルネットワークで構成されている例を挙げる。 The discriminant model may be, for example, a regression equation obtained by regression analysis, or may be composed of a convolutional neural network including a deep neural network, and may be realized by a combination of a computer program and parameters, or a combination of multiple functions and parameters. Furthermore, when the discriminant model is constructed using a neural network, and the input layer, intermediate layer, and output layer are considered as units of a single neural network, the discriminant model may refer to a single neural network or a combination of multiple neural networks. The discriminant model may also be composed of a combination of multiple multiple regression equations, or a single multiple regression equation. The discriminant model may also be composed of a combination of a neural network and another model, such as a regression equation or a discriminant equation, and may output a value obtained by inputting a value obtained as an output of the neural network into the other model. This embodiment provides an example in which the discriminant model is composed of a deep neural network.

この判別モデルは、情報処理装置１０内のメモリ１２に格納されていてもよいし、情報処理装置１０が通信でアクセス可能な他のコンピュータのメモリに格納されていてもよい。
このように情報処理装置１０は、当該判別モデルを利用可能な装置であって本分析方法を実行可能な肌画像分析装置と表記可能であるし、当該判別モデルを学習可能な判別モデルの学習装置と表記することもできる。
以降の説明では、本分析方法及び本学習方法の実行主体をＣＰＵ１１として説明する。 This discrimination model may be stored in the memory 12 within the information processing device 10, or may be stored in the memory of another computer that the information processing device 10 can access via communication.
In this way, the information processing device 10 can be described as a skin image analysis device that can use the discriminant model and execute the present analysis method, and can also be described as a discriminant model learning device that can learn the discriminant model.
In the following description, the CPU 11 will be described as the entity that executes the present analysis method and the present learning method.

［肌画像分析方法（本分析方法）］
ここで、本分析方法について図２及び図３を用いて説明する。
図２は、本実施形態に係る肌画像分析方法の処理の流れの概念図であり、図３は、本実施形態に係る肌画像分析方法のフローチャートである。本分析方法は、図１に例示されるような情報処理装置１０（ＣＰＵ１１）により実行される。
ＣＰＵ１１は、上述したとおり、所定画像サイズの肌パッチ画像の入力に応じてその肌パッチ画像の肌属性又は肌状態を判別する学習済みの判別モデルを利用可能である。この判別モデルは、教師用肌画像の被写肌共通の属性又は状態を示す正解情報とその教師用肌画像から複数抽出される所定画像サイズの教師用肌パッチ画像群との複数の組合せを含む複数の教師データに基づいて機械学習されている。なお、この判別モデルは本学習方法により学習されればよく、本学習方法については後述する。 [Skin image analysis method (this analysis method)]
Here, this analysis method will be explained with reference to FIGS.
Fig. 2 is a conceptual diagram of the processing flow of the skin image analysis method according to this embodiment, and Fig. 3 is a flowchart of the skin image analysis method according to this embodiment. This analysis method is executed by an information processing device 10 (CPU 11) as exemplified in Fig. 1.
As described above, the CPU 11 can use a trained discrimination model that discriminates the skin attributes or skin conditions of a skin patch image of a predetermined image size in response to the input of the skin patch image. This discrimination model is machine-trained based on multiple training data sets that include multiple combinations of correct answer information indicating common attributes or conditions of the subject skin of the training skin images and a group of training skin patch images of a predetermined image size extracted from the training skin images. Note that this discrimination model can be trained using the present training method, which will be described later.

教師用肌画像は、教師データとして利用される画像であって人の肌が写る画像である。教師用肌画像に写る肌は、肌属性又は肌状態の判別対象となる肌を含んでいればよく、例えば、人の顔の肌、人の首の肌、人の腕の肌、人の手の甲或いは平の肌、脚の肌等である。教師用肌画像には、目、鼻の孔、口等のような判別対象となる肌以外の人体部位が写っていてもよい。
肌の属性は、例えば、年齢、性別等のようなその肌を持つ人の属性、その肌が属する人体部位（頬、おでこ、顔、首、脚等）等である。
肌の状態は、例えば、素肌か化粧肌かを示す状態、化粧崩れしているか否かを示す状態、肌に潤いがあるか否かを示す状態、肌が乾燥しているか否かを示す状態等である。本実施形態で対象とされる肌の状態は、シミやシワ等のような局所的特徴の存在を示すものよりは、全体的な見た目の印象を示すものであることが好ましい。
教師用肌パッチ画像群は、一つの教師用肌画像から抽出される複数の肌パッチ画像を含む画像集合である。
肌パッチ画像とは、肌領域（肌を示す画像領域）が所定の割合以上含まれる小区画画像である。所定の割合は少なくとも９０％以上とされる。肌パッチ画像は、予め決められた形状とされ、処理の容易性から矩形であることが好ましいが、円形、楕円形等、その形状は限定されない。
肌パッチ画像の画像サイズは予め決められており、例えば、２２４ピクセル×２２４ピクセルのサイズに決められる。但し、肌パッチ画像のサイズはこのような例に限定されず、判別モデルの構成等に応じて適宜決められればよい。 The teacher skin image is an image used as teacher data and shows human skin. The skin shown in the teacher skin image may include skin that is the target of skin attribute or skin condition discrimination, such as the skin of a human face, the skin of a human neck, the skin of a human arm, the skin of the back or palm of a human hand, the skin of a human leg, etc. The teacher skin image may also show other body parts other than the skin that is the target of discrimination, such as the eyes, nostrils, mouth, etc.
The attributes of the skin include, for example, attributes of the person who has the skin, such as age and sex, and the body part to which the skin belongs (cheek, forehead, face, neck, leg, etc.).
The skin condition may be, for example, a condition indicating whether the skin is bare or made up, a condition indicating whether the makeup has come off, a condition indicating whether the skin is moisturized, a condition indicating whether the skin is dry, etc. The skin condition targeted in this embodiment is preferably one that indicates the overall impression of appearance rather than one that indicates the presence of local features such as blemishes or wrinkles.
The teacher skin patch image group is an image collection including a plurality of skin patch images extracted from one teacher skin image.
A skin patch image is a small section image that contains a skin region (an image region showing skin) at a predetermined rate or more. The predetermined rate is set to be at least 90%. The skin patch image has a predetermined shape, and is preferably rectangular for ease of processing, but the shape is not limited to a circle, ellipse, or the like.
The image size of the skin patch image is predetermined, and is set to, for example, 224 pixels x 224 pixels. However, the size of the skin patch image is not limited to this example and may be determined appropriately depending on the configuration of the discriminant model, etc.

本分析方法において、まず、ＣＰＵ１１は、評価用肌画像を取得する（Ｓ３１）。
評価用肌画像は、被評価者の肌が写る画像であればよく、肌以外の部位が写っていてもよい。評価用肌画像の画像サイズは、その評価用肌画像から複数の肌パッチ画像を抽出することができれば、何ら制限されない。
工程（Ｓ３１）で取得される評価用肌画像は、例えば、ＪＰＥＧ（Joint Photographic Experts Group）形式、ＢＭＰ（Bitmap image）形式、ＴＩＦＦ（Tagged Image File Format）形式、ＧＩＦ（Graphic Interchange Format）形式等の画像データとして取得される。但し、評価用肌画像のデータ形式は制限されない。また、取得される評価用肌画像は、肌の見た目の状態又は属性を判別するために、可視光を撮像するカメラで撮像されたカラー画像であることが好ましい。
ＣＰＵ１１は、評価用肌画像を撮像したカメラからその画像を取得してもよいし、他のコンピュータや可搬型記録媒体からその画像を取得してもよい。 In this analysis method, first, the CPU 11 acquires a skin image for evaluation (S31).
The evaluation skin image may be an image showing the skin of the subject, or may show other parts of the subject. The image size of the evaluation skin image is not limited as long as multiple skin patch images can be extracted from the evaluation skin image.
The skin image for evaluation acquired in step (S31) is acquired as image data in, for example, JPEG (Joint Photographic Experts Group) format, BMP (Bitmap image) format, TIFF (Tagged Image File Format), GIF (Graphic Interchange Format), or other format. However, the data format of the skin image for evaluation is not limited. Furthermore, the acquired skin image for evaluation is preferably a color image captured with a camera that captures visible light in order to determine the appearance or attributes of the skin.
The CPU 11 may acquire the skin image for evaluation from a camera that captured the image, or may acquire the image from another computer or a portable recording medium.

次に、ＣＰＵ１１は、工程（Ｓ３１）で取得された評価用肌画像から肌パッチ画像群を取得する（Ｓ３２）。肌パッチ画像群に含まれる肌パッチ画像の数は、後述する指標値の取得数に対応するため多いことが好ましいが、特に制限されない。
また、工程（Ｓ３２）で取得される各肌パッチ画像の所定画像サイズは、上述した教師用肌パッチ画像群を形成する各肌パッチ画像の画像サイズと同じであることが好ましいが、その画像サイズとは必ずしも一致しなくてもよい。 Next, the CPU 11 acquires a group of skin patch images from the evaluation skin images acquired in step S31 (S32). The number of skin patch images included in the group of skin patch images is preferably large because it corresponds to the number of acquired index values described below, but is not particularly limited.
In addition, it is preferable that the predetermined image size of each skin patch image obtained in step (S32) is the same as the image size of each skin patch image forming the above-mentioned teacher skin patch image group, but it does not necessarily have to be the same as that image size.

工程（Ｓ３２）で取得される肌パッチ画像群に含まれる個々の肌パッチ画像及び教師データの教師用肌パッチ画像群に含まれる個々の教師用肌パッチ画像に写る被写体の肌領域は、被写体の１ｃｍ^２以上２０ｃｍ^２以下の肌領域であることが好ましく、３ｃｍ^２以上１９ｃｍ^２以下の肌領域であることが更に好ましい。実施例として後述するように、本発明者らは、教師用肌パッチ画像に写る被写体の肌領域のサイズを様々に変えて判別モデルによる判別精度を検証したところ、当該肌領域のサイズが判別モデルの判別精度に影響を与えることを新たに見出すと共に、判別モデルの判別精度を高くするのに上述のような範囲の肌領域サイズを用いることを新たに見出した。
そのため、個々の肌パッチ画像及び個々の教師用肌パッチ画像の取得元となる評価用肌画像及び教師用肌画像は、所定画像サイズを有する肌パッチ画像及び教師用肌パッチ画像が上述のような範囲の肌領域を示すような解像度等の撮影条件にて撮像されることが好ましい。また、被写体の肌に所定サイズ（例えば１ｃｍ角）の色票を貼った状態で撮影することで、評価用肌画像及び教師用肌画像上での画素サイズと被写体の肌の実サイズとの関係を把握することができる。 The skin area of the subject appearing in each skin patch image included in the skin patch image group acquired in step (S32) and each teacher skin patch image included in the teacher skin patch image group of the teacher data is preferably a skin area of the subject of 1 cm ² or more and 20 cm ² or less, and more preferably a skin area of 3 cm ² or more and 19 cm ² or less. As will be described later as an example, the inventors have verified the discrimination accuracy of the discriminant model by varying the size of the skin area of the subject appearing in the teacher skin patch image. As a result, they have newly discovered that the size of the skin area affects the discrimination accuracy of the discriminant model, and have newly discovered that using a skin area size in the above-mentioned range can improve the discrimination accuracy of the discriminant model.
Therefore, it is preferable that the evaluation skin images and teacher skin images from which the individual skin patch images and individual teacher skin patch images are obtained are captured under shooting conditions such as resolution that the skin patch images and teacher skin patch images having a predetermined image size show the skin area in the above-mentioned range. Furthermore, by capturing an image with a color chart of a predetermined size (e.g., 1 cm square) attached to the subject's skin, it is possible to grasp the relationship between the pixel size on the evaluation skin images and teacher skin images and the actual size of the subject's skin.

工程（Ｓ３２）において肌パッチ画像群は様々な手法で取得可能である。
例えば、評価用肌画像を所定画像サイズの矩形小区画画像に分割し、矩形小区画画像群のうち肌領域（肌を示す画像領域）が所定の割合以上含まれる矩形小区画画像すべてを肌パッチ画像として切り出す手法がある。この手法において、最終的な画像サイズよりも大きい画像サイズで小区画画像を切り出してから、その小区画画像から最終的な画像サイズの肌パッチ画像を更に切り出すようにしてもよい。
また、評価用肌画像上で所定画像サイズの切り出し窓を移動させながら、その窓内において肌領域が所定の割合以上含まれるタイミングでその窓内の小区画画像を切り出すことで、肌パッチ画像群を取得する手法もある。この手法では、肌パッチ画像間において重複して同じ肌領域が部分的に含まれるようにしてもよい。この場合、前回切り出した位置から所定量以上切り出し窓を移動させた際に肌領域の判定を行うようにすれば、肌パッチ画像間において同じ肌領域が大部分を占めるのを避けることができる。
また、他の手法で肌パッチ画像群が取得されてもよく、肌パッチ画像群の取得手法は制限されない。 In step (S32), the set of skin patch images can be obtained by various methods.
For example, one method is to divide the evaluation skin image into rectangular small-partition images of a predetermined image size, and then cut out all rectangular small-partition images that contain a skin region (image region showing skin) at a predetermined ratio or more from the group of rectangular small-partition images as skin patch images. In this method, small-partition images may be cut out at an image size larger than the final image size, and then skin patch images of the final image size may be further cut out from the small-partition images.
Another method involves acquiring a group of skin patch images by moving a cropping window of a predetermined image size on the evaluation skin image and cropping small section images within the window when a predetermined percentage or more of the skin region is included within the window. In this method, the same skin region may be partially included in overlapping skin patch images. In this case, determining the skin region when the cropping window is moved by a predetermined amount or more from the previously cropped position can prevent the same skin region from occupying a large portion of the skin patch images.
Furthermore, the skin patch image group may be acquired by other methods, and the method for acquiring the skin patch image group is not limited.

次に、ＣＰＵ１１は、工程（Ｓ３２）で取得された各肌パッチ画像に対してそれぞれ画素値の正規化を行う（Ｓ３５）。例えば、ＣＰＵ１１は、一つの肌パッチ画像において画素値（ＲＧＢ値）の平均値及び分散値を算出し、その肌パッチ画像の各画素値をその平均値及び分散値を用いてそれぞれ線形演算することで、その肌パッチ画像の画素値を正規化することができる。このとき、工程（Ｓ３２）で取得された全ての肌パッチ画像を対象にして算出された画素値の平均値及び分散値が利用されてもよい。また、正規化手法についても、平均値をゼロに分散値を１にするような一般的な正規化手法のみならず、判別モデルごとに決められる正規化パラメータに基づく正規化手法が利用されてもよい。工程（Ｓ３２）における正規化手法には公知の様々な手法が利用できる。 Next, CPU 11 normalizes the pixel values of each skin patch image acquired in step (S32) (S35). For example, CPU 11 can normalize the pixel values of one skin patch image by calculating the mean value and variance of the pixel values (RGB values) for that skin patch image and performing linear calculations on each pixel value of that skin patch image using the mean value and variance. In this case, the mean value and variance of the pixel values calculated for all skin patch images acquired in step (S32) may be used. Furthermore, as for the normalization method, not only general normalization methods that set the mean value to zero and the variance to 1 may be used, but also normalization methods based on normalization parameters determined for each discriminant model. Various known methods can be used for the normalization method in step (S32).

続いて、ＣＰＵ１１は、正規化された各肌パッチ画像をそれぞれ判別モデルに入力し（Ｓ３７）、各肌パッチ画像に関する肌属性又は肌状態の指標値をそれぞれ取得する（Ｓ３８）。
取得される指標値が示す肌属性又は肌状態は、利用する判別モデルによって決まる。例えば、教師用肌画像の被写肌共通の肌状態として素肌又は化粧肌のいずれか一方を示す正解情報で学習されているような判別モデルが利用される場合には、（Ｓ３８）では、各肌パッチ画像に関して素肌感（素肌らしさ）又は化粧感（化粧肌らしさ）の指標値が肌状態の指標値としてそれぞれ取得される。また、教師用肌画像の被写肌共通の属性として男性又は女性のいずれか一方を示す正解情報で学習されているような判別モデルが利用される場合には、各肌パッチ画像に関して男性肌らしさ又は女性肌らしさの指標値が肌属性の指標値としてそれぞれ取得される。また、教師用肌画像の被写肌共通の属性として被写体の実年齢或いは実年齢帯（年代）を示す正解情報で学習されているような判別モデルが利用される場合には、各肌パッチ画像に関して肌の見た目年齢若しくは肌の見た目年齢帯を示す指標値又は肌の見た目年齢帯ごとのそれらしさの度合を示す指標値が肌属性の指標値としてそれぞれ取得される。また、教師用肌画像の被写肌共通の状態として化粧料塗布直後又は化粧料塗布から所定時間経過後のいずれか一方を示す正解情報で学習されているような判別モデルが利用される場合には、各肌パッチ画像に関して化粧崩れ度が肌状態の指標値としてそれぞれ取得される。また、教師用肌画像の被写肌共通の状態としてスキンケア剤塗布前又はスキンケア剤塗布後のいずれか一方を示す正解情報で学習されているような判別モデルが利用される場合には、各肌パッチ画像に関して肌のうるおい感の指標値が肌状態の指標値としてそれぞれ取得される。 Next, the CPU 11 inputs each normalized skin patch image into a discriminant model (S37), and obtains index values of skin attributes or skin conditions for each skin patch image (S38).
The skin attributes or skin conditions indicated by the acquired index values depend on the discriminant model used. For example, when a discriminant model trained with correct answer information indicating either bare skin or made-up skin as the common skin condition of the subject skin of the teacher skin images is used, in (S38), an index value for bare skin feel (bare skin-likeness) or made-up feel (made-up skin-likeness) is acquired as the skin condition index value for each skin patch image. When a discriminant model trained with correct answer information indicating either male or female as the common attribute of the subject skin of the teacher skin images is used, an index value for male skin-likeness or female skin-likeness is acquired as the skin attribute index value for each skin patch image. When a discriminant model trained with correct answer information indicating the subject's actual age or actual age range (age group) as the common attribute of the subject skin of the teacher skin images is used, an index value for the skin attribute indicating the skin's apparent age or skin's apparent age range, or an index value for the skin's apparent age range, is acquired as the skin attribute index value for each skin patch image. Furthermore, when a discrimination model trained with correct answer information indicating either immediately after application of a cosmetic product or a predetermined time after application of a cosmetic product as a common skin condition for all subject skin images is used, the degree of makeup smearing is acquired as a skin condition index value for each skin patch image. Furthermore, when a discrimination model trained with correct answer information indicating either before application of a skin care product or after application of a skin care product is used as a common skin condition for all subject skin images, the index value of skin moisturization is acquired as a skin condition index value for each skin patch image.

このように工程（Ｓ３８）において各肌パッチ画像に関してそれぞれ取得される指標値は、男性肌らしさ又は女性肌らしさを示す度合の第一指標値、素肌らしさ又は化粧肌らしさを示す度合の第二指標値、肌の各年齢帯らしさを示す度合の第三指標値、化粧崩れの度合を示す第四指標値、又は、肌の潤い感の度合を示す第五指標値のいずれか一つを含む。
但し、判別モデルの出力となる指標値はこのような例のみに限定されず、肌属性又は肌状態の指標となり得る値であればよい。例えば、（Ｓ３８）では、或る肌状態又は或る肌属性であるか否かの２値が当該指標値として取得されてもよい。但し、本実施形態において得られる指標値は、シミやシワ等のような局所的特徴の存在を示すものよりは、全体的な見た目の印象を示すものであることが好ましい。 In this way, the index values obtained for each skin patch image in step (S38) include one of a first index value indicating the degree of male or female skin-likeness, a second index value indicating the degree of bare skin-likeness or made-up skin-likeness, a third index value indicating the degree of skin-likeness for each age group, a fourth index value indicating the degree of makeup breakdown, or a fifth index value indicating the degree of moisturized skin.
However, the index value output from the discriminant model is not limited to this example, and may be any value that can be an index of a skin attribute or skin condition. For example, in (S38), a binary value indicating whether or not a certain skin condition or certain skin attribute exists may be acquired as the index value. However, in this embodiment, it is preferable that the index value obtained indicates an overall appearance impression rather than indicating the presence of local features such as blemishes or wrinkles.

図３には、複数の工程（処理）が順番に表されているが、本分析方法の各工程の実行順序は、図３の例のみに限定されない。例えば、肌パッチ画像群の全ての肌パッチ画像に対して正規化が完了した後に工程（Ｓ３７）及び工程（Ｓ３８）を実行するのではなく、個々の肌パッチ画像に関して、正規化（Ｓ３５）、判別モデルへの入力（Ｓ３７）及び指標値の取得（Ｓ３８）をそれぞれ一つの流れで実行するようにしてもよい。また、工程（Ｓ３２）において肌パッチ画像群をすべて取得した後に工程（Ｓ３５）以降を実行するのではなく、工程（Ｓ３２）で一つの肌パッチ画像が取得される度に工程（Ｓ３５）以降を実行するようにしてもよい。 While Figure 3 shows multiple steps (processes) in sequence, the order in which the steps of this analysis method are performed is not limited to the example of Figure 3. For example, rather than performing steps (S37) and (S38) after normalization has been completed for all skin patch images in the skin patch image group, normalization (S35), input to the discriminant model (S37), and acquisition of index values (S38) may each be performed in a single flow for each individual skin patch image. Furthermore, rather than performing steps (S35) and subsequent steps after acquiring all of the skin patch image group in step (S32), steps (S35) and subsequent steps may be performed each time a skin patch image is acquired in step (S32).

本分析方法では、上述のように、教師用肌画像の被写肌共通の属性又は状態を示す正解情報とその教師用肌画像から複数抽出された教師用肌パッチ画像群とからなる教師データ群を用いて機械学習されている学習済みの判別モデルに対して、評価用肌画像から取得された所定画像サイズの肌パッチ画像群を入力することで、各肌パッチ画像に関してそれぞれ肌属性又は肌状態の指標値が取得される。このように肌全体の共通の属性又は状態を示す正解情報を用いて学習された判別モデル用いて、各肌パッチ画像それぞれについて肌属性又は肌状態の指標値を取得することで、照明等のような評価用肌画像の撮影環境のばらつきや顔の局所部位の特徴差による影響を抑えて、評価用肌画像に写る被写肌全体から得られる見た目の肌状態又は肌属性を分析することができる。 In this analysis method, as described above, a group of skin patch images of a predetermined image size obtained from the evaluation skin images is input into a trained discrimination model that has been machine-trained using a training data set consisting of ground truth information indicating the common attributes or conditions of the subject skin in the training skin images and a group of training skin patch images extracted from the training skin images, thereby obtaining index values for the skin attributes or skin conditions for each skin patch image. By obtaining index values for the skin attributes or skin conditions for each skin patch image using a discrimination model trained using ground truth information indicating the common attributes or conditions of the entire skin, it is possible to analyze the apparent skin conditions or skin attributes obtained from the entire subject skin captured in the evaluation skin images, while minimizing the effects of variations in the shooting environment of the evaluation skin images, such as lighting, and differences in the characteristics of local areas of the face.

ＣＰＵ１１は、工程（Ｓ３８）で取得される各肌パッチ画像の指標値を次のように加工することができる。
例えば、ＣＰＵ１１は、各肌パッチ画像の指標値の、各肌パッチ画像の被写肌領域の位置に応じた分布を示す指標値分布情報を生成することができる。この指標値分布情報は、例えば、各肌パッチ画像の被写肌領域の位置に応じて指標値が並べられたグラフやマップとされてもよい。この指標値分布情報によれば、肌状態又は肌属性の指標値の分布を把握することができる。 The CPU 11 can process the index values of each skin patch image acquired in step (S38) as follows.
For example, the CPU 11 can generate index value distribution information that indicates the distribution of index values of each skin patch image according to the position of the skin region of each skin patch image. This index value distribution information may be, for example, a graph or map in which index values are arranged according to the position of the skin region of each skin patch image. This index value distribution information allows the distribution of index values of skin conditions or skin attributes to be understood.

図４は、肌パッチ画像群の指標値分布情報としてのカラーマップ画像の例を示す図である。
ＣＰＵ１１は、図４に示されるように、工程（Ｓ３１）で取得された評価用肌画像に各肌パッチ画像の指標値を各肌パッチ画像の被写肌領域の位置に応じてマッピングさせたカラーマップ画像を当該指標値分布情報として生成することができる。
図４は、図面制約により二値画像とせざるを得ず分かりづらいが、黒に近づく程、肌状態又は肌属性の指標値が高くなり、白に近づく程、その指標値が低くなることを示している。図４では白黒で示されているが、当該カラーマップ画像では、評価用肌画像上にて各肌パッチ画像の位置がその指標値に対応する色とされる。このカラーマップ画像によれば、視覚的に指標値の分布を把握することができると共に、評価用肌画像上でその分布が示されるため、肌の部位ごとの肌状態又は肌属性の指標を直感的に把握することができる。 FIG. 4 is a diagram showing an example of a color map image as index value distribution information of a group of skin patch images.
As shown in Figure 4, the CPU 11 can generate a color map image as the index value distribution information in which the index values of each skin patch image are mapped to the evaluation skin image acquired in step (S31) according to the position of the subject skin area of each skin patch image.
Although Fig. 4 is difficult to understand due to the constraints of the drawing, it is necessary to use a binary image, which indicates that the closer to black the color, the higher the index value of the skin condition or skin attribute, and the closer to white the color, the lower the index value. While Fig. 4 is shown in black and white, in the color map image, the position of each skin patch image on the evaluation skin image is represented by a color corresponding to its index value. This color map image allows users to visually grasp the distribution of index values, and because the distribution is displayed on the evaluation skin image, users can intuitively grasp the index of the skin condition or skin attribute for each skin region.

また、ＣＰＵ１１は、取得されて各肌パッチ画像の指標値に基づいて、被評価者の肌属性又は肌状態の代表評価値を算出することもできる。工程（Ｓ３１）で取得された評価用肌画像に関して一つの代表評価値が算出されてもよいし、その評価用肌画像の被写肌領域を各肌パッチ画像よりも大きいいくつかの部位（例えば頬、額、口の周り、鼻等）に分け、各部位に属する肌パッチ画像の指標値を用いて各部位の代表評価値が算出されてもよい。代表評価値は、例えば、各肌パッチ画像の指標値の平均、標準偏差、分散等の統計値とされる。
このように各肌パッチ画像の指標値から代表評価値を得ることで、評価用肌画像に写る被評価者の肌全体或いは部位ごとの肌状態又は肌属性の総合評価を容易に把握することができる。 The CPU 11 can also calculate a representative evaluation value for the skin attributes or skin condition of the person being evaluated based on the index values of each acquired skin patch image. A single representative evaluation value may be calculated for the evaluation skin image acquired in step (S31), or the subject skin area of the evaluation skin image may be divided into several parts (e.g., cheeks, forehead, around the mouth, nose, etc.) that are larger than each skin patch image, and a representative evaluation value for each part may be calculated using the index values of the skin patch images belonging to each part. The representative evaluation value may be, for example, a statistical value such as the average, standard deviation, or variance of the index values of each skin patch image.
By obtaining a representative evaluation value from the index values of each skin patch image in this way, it is possible to easily grasp the overall evaluation of the skin condition or skin attributes of the entire skin or each part of the skin of the person being evaluated that appears in the evaluation skin image.

更に、上述の代表評価値を算出する肌領域を指定可能とすることもできる。この場合、ＣＰＵ１１は、工程（Ｓ３１）で取得された評価用肌画像中の所望の画像領域を指定する領域指定情報を取得する工程を更に実行し、その工程で取得された領域指定情報が示す画像領域に対応する肌パッチ画像群に関して取得された指標値に基づいて、その画像領域の肌属性又は肌状態の代表評価値を算出すればよい。ＣＰＵ１１は、当該評価用肌画像を表示装置１５に表示し、表示された評価用肌画像に対して入力装置１６を用いて指定された画像領域を特定することで、当該領域指定情報を得ることができる。ＣＰＵ１１は、このような領域指定情報を他のコンピュータから通信により取得することもできる。
このようにすれば、被評価者やその他のユーザにとって関心のある肌領域に関して肌状態又は肌属性の代表評価値を得ることができる。 Furthermore, it is also possible to specify a skin region for which the representative evaluation value is to be calculated. In this case, the CPU 11 further executes a step of acquiring region designation information for designating a desired image region in the evaluation skin image acquired in step (S31), and calculates a representative evaluation value for the skin attribute or skin condition of the image region based on index values acquired for a group of skin patch images corresponding to the image region indicated by the region designation information acquired in that step. The CPU 11 displays the evaluation skin image on the display device 15 and identifies a designated image region on the displayed evaluation skin image using the input device 16, thereby obtaining the region designation information. The CPU 11 can also acquire such region designation information from another computer via communication.
In this way, a representative evaluation value of the skin condition or skin attribute can be obtained for a skin area that is of interest to the person being evaluated or other users.

［判別モデルの学習方法（本学習方法）］
次に、上述した本分析方法で利用される判別モデルの学習方法について図５及び図６を用いて説明する。
図５は、本実施形態に係る判別モデルの学習方法の処理の流れの概念図であり、図６は、本実施形態に係る判別モデルの学習方法のフローチャートである。本学習方法は、図１に例示されるような情報処理装置１０（ＣＰＵ１１）により実行される。
本実施形態では、上述したとおり、ディープニューラルネットワークで構成された判別モデルであって、所定画像サイズの肌パッチ画像の入力に応じてその肌パッチ画像の肌属性又は肌状態を判別する判別モデルが利用される。本学習方法が実行されるにあたり、この判別モデルは、情報処理装置１０内のメモリ１２に格納されていてもよいし、情報処理装置１０が通信でアクセス可能な他のコンピュータのメモリに格納されていてもよい。 [Discrimination model learning method (this learning method)]
Next, a method for learning the discriminant model used in the above-described analysis method will be described with reference to FIGS.
Fig. 5 is a conceptual diagram of the process flow of the discriminant model training method according to this embodiment, and Fig. 6 is a flowchart of the discriminant model training method according to this embodiment. This training method is executed by an information processing device 10 (CPU 11) as exemplified in Fig. 1.
As described above, this embodiment utilizes a discriminant model configured with a deep neural network that discriminates the skin attributes or skin condition of a skin patch image of a predetermined image size in response to the input of the skin patch image. When this learning method is executed, this discriminant model may be stored in the memory 12 within the information processing device 10, or may be stored in the memory of another computer that the information processing device 10 can access via communication.

まず、ＣＰＵ１１は、教師用肌画像とその教師用肌画像の被写肌共通の属性又は状態を示す正解情報との組合せを複数取得する（Ｓ６１）。
この教師用肌画像は、所定画像サイズを有する教師用肌パッチ画像が上述のように被写体の１ｃｍ^２以上２０ｃｍ^２以下の肌領域を示すような解像度等の撮影条件にて撮像されることが好ましい。また、教師用肌画像の被写体の肌に所定サイズ（例えば１ｃｍ角）の色票を貼った状態で撮影することで、教師用肌画像上での画素サイズと被写体の肌の実サイズとの関係を把握することができる。
ここでの「教師用肌画像」、被写肌共通の「属性」及び「状態」については上述したとおりである。例えば、教師用肌画像の被写肌共通の状態を示す正解情報は、素肌又は化粧肌のいずれか一方を示す。肌状態の正解情報の他の例としては、化粧塗布直後又は化粧塗布から所定時間経過後のいずれか一方の状態や、スキンケア剤塗布前又はスキンケア剤塗布後のいずれか一方の状態などを示す。また、肌属性の正解情報の例としては、男性又は女性のいずれか一方や、被写体の実年齢又は実年齢帯を示す。
なお、「被写肌共通」とは、一つの教師用肌画像に対してその教師用肌画像の被写肌の属性又は状態を示す一つの正解情報が付され、その教師用肌画像から後述するように取得される教師用肌パッチ画像群に対してその一つの正解情報が共通して関連付けられることを意味する。 First, the CPU 11 acquires a plurality of combinations of teacher skin images and correct answer information indicating attributes or states common to the subject skin of the teacher skin images (S61).
It is preferable that this teacher skin image be captured under shooting conditions such as resolution such that the teacher skin patch image having a predetermined image size shows a skin area of the subject of 1 ^cm2 or more and 20 cm2 or ^less , as described above. In addition, by capturing an image with a color chart of a predetermined size (for example, 1 cm square) attached to the skin of the subject of the teacher skin image, it is possible to grasp the relationship between the pixel size on the teacher skin image and the actual size of the subject's skin.
The "teacher skin image" and the "attributes" and "conditions" common to all subject skins are as described above. For example, the correct answer information indicating the common condition of all subject skins in the teacher skin image indicates either bare skin or made-up skin. Other examples of correct answer information for skin conditions include either the condition immediately after makeup application or a predetermined time after makeup application, or either the condition before or after application of a skin care product. Furthermore, examples of correct answer information for skin attributes include either male or female, or the actual age or actual age range of the subject.
Note that "common to subject skin" means that one correct answer information indicating the attributes or state of the subject skin of one teacher skin image is assigned to one teacher skin image, and that one correct answer information is commonly associated with a group of teacher skin patch images obtained from that teacher skin image as described below.

続いて、ＣＰＵ１１は、工程（Ｓ６１）で取得された教師用肌画像と正解情報との組合せごとに、工程（Ｓ６３）から工程（Ｓ６７）をそれぞれ実行する。 Next, the CPU 11 executes steps (S63) to (S67) for each combination of the teacher skin image and correct answer information obtained in step (S61).

ＣＰＵ１１は、対象の組合せに係る教師用肌画像から肌領域が所定の割合以上含まれる所定画像サイズの教師用肌パッチ画像群を取得する（Ｓ６３）。一つの教師用肌画像から取得される教師用肌パッチ画像の数は、教師データ数を増やすという観点から多い程好ましいが、特に制限されない。
また、肌パッチ画像群を形成する各肌パッチ画像の所定画像サイズは、学習対象となる判別モデルの構成に応じて決められる。
ここで、工程（Ｓ６３）で取得される教師用肌パッチ画像群に含まれる個々の教師用肌パッチ画像に写る被写体の肌領域は、被写体の１ｃｍ^２以上２０ｃｍ^２以下の肌領域であることが好ましく、３ｃｍ^２以上１９ｃｍ^２以下の肌領域であることが更に好ましい。このような教師用肌パッチ画像に写る被写体の肌領域の範囲は上述したとおりである。 The CPU 11 acquires a group of teacher skin patch images of a predetermined image size that contain a predetermined ratio or more of skin regions from the teacher skin image related to the target combination (S63). The number of teacher skin patch images acquired from one teacher skin image is preferably as large as possible from the viewpoint of increasing the number of teacher data, but is not particularly limited.
Furthermore, the predetermined image size of each skin patch image forming the skin patch image group is determined according to the configuration of the discriminant model to be trained.
Here, the skin area of the subject appearing in each of the teacher skin patch images included in the teacher skin patch image group acquired in step (S63) is preferably a skin area of the subject of 1 ^cm to 20 ^cm , more preferably a skin area of 3 ^cm to 19 ^cm . The range of the skin area of the subject appearing in such teacher skin patch images is as described above.

工程（Ｓ６３）における教師用肌画像から教師用肌パッチ画像群を取得する手法は、本分析方法における工程（Ｓ３２）と同じであってもよいし、異なっていてもよい。教師用肌パッチ画像群の取得手法においても、工程（Ｓ３２）で述べた肌パッチ画像群の取得手法と同様に様々な手法が採用可能である。各取得手法の例については、工程（Ｓ３２）で述べたとおりであるため、ここでは説明を省略する。教師用肌パッチ画像群の取得手法は制限されない。 The method for acquiring the teacher skin patch image group from the teacher skin image in step (S63) may be the same as or different from step (S32) in this analysis method. As with the method for acquiring the skin patch image group described in step (S32), various methods can be used to acquire the teacher skin patch image group. Examples of each acquisition method are as described in step (S32), and therefore will not be described here. There are no restrictions on the method for acquiring the teacher skin patch image group.

続いて、ＣＰＵ１１は、工程（Ｓ６３）で取得された各教師用肌パッチ画像に対してそれぞれ画素値の正規化を行う（Ｓ６５）。例えば、ＣＰＵ１１は、一つの教師用肌パッチ画像において画素値（ＲＧＢ値）の平均値及び分散値を算出し、その教師用肌パッチ画像の各画素値をその平均値及び分散値を用いてそれぞれ線形演算することで、その教師用肌パッチ画像の画素値を正規化することができる。工程（Ｓ６５）における正規化手法は、本分析方法の工程（Ｓ３５）で行われる肌パッチ画像に対する正規化手法と同じであることが好ましい。工程（Ｓ６５）における正規化手法においても公知の様々な手法が利用できる。 Next, CPU 11 normalizes the pixel values of each teacher skin patch image acquired in step (S63) (S65). For example, CPU 11 can normalize the pixel values of a teacher skin patch image by calculating the average value and variance of the pixel values (RGB values) for one teacher skin patch image and then performing linear calculations on each pixel value of the teacher skin patch image using the average value and variance. The normalization method used in step (S65) is preferably the same as the normalization method used for skin patch images in step (S35) of this analysis method. Various well-known methods can also be used for the normalization method used in step (S65).

工程（Ｓ６３）で取得された教師用肌パッチ画像群に含まれる全ての教師用肌画像に関して正規化（Ｓ６５）が完了すると、ＣＰＵ１１は、対象の組合せに係る教師用肌画像に対応する正解情報を工程（Ｓ６５）で正規化された教師用肌パッチ画像に対して共通して関連付けた教師データを生成する（Ｓ６７）。 Once normalization (S65) is complete for all teacher skin images included in the group of teacher skin patch images obtained in step (S63), the CPU 11 generates teacher data (S67) that commonly associates the correct answer information corresponding to the teacher skin images related to the target combination with the teacher skin patch images normalized in step (S65).

このように、工程（Ｓ６１）で取得された教師用肌画像と正解情報との全ての組合せに関して工程（Ｓ６３）から工程（Ｓ６７）が実行されることで、各教師用肌パッチ画像が取得された元の教師用肌画像に対応する正解情報と、正規化された教師用肌パッチ画像群の各々とが関連付けられた教師データ群が生成される。
ＣＰＵ１１は、生成された教師データ群を用いて当該判別モデルを学習させる（Ｓ６９）。本実施形態ではディープラーニングにより当該判別モデルの学習が行われる。但し、判別モデルに対する具体的な学習アルゴリズムは何ら限定されない。 In this way, by executing steps (S63) to (S67) for all combinations of teacher skin images and correct answer information obtained in step (S61), a group of teacher data is generated in which correct answer information corresponding to the original teacher skin image from which each teacher skin patch image was obtained is associated with each of the normalized teacher skin patch images.
The CPU 11 trains the discriminant model using the generated training data group (S69). In this embodiment, the discriminant model is trained by deep learning. However, the specific learning algorithm for the discriminant model is not limited in any way.

図６には、複数の工程（処理）が順番に表されているが、本学習方法の各工程の実行順序は、図６の例のみに限定されない。例えば、教師用肌パッチ画像群の全ての教師用肌パッチ画像に対して正規化が完了した後に工程（Ｓ６７）を実行するのではなく、個々の教師用肌パッチ画像に関して、正規化（Ｓ６５）及び正解情報との関連付け（Ｓ６７）をそれぞれ一つの流れで実行するようにしてもよい。また、工程（Ｓ６３）において教師用肌パッチ画像群をすべて取得した後に工程（Ｓ６５）以降を実行するのではなく、工程（Ｓ６３）で一つの教師用肌パッチ画像が取得される度に工程（Ｓ６５）以降を実行するようにしてもよい。 While Figure 6 shows multiple steps (processing) in sequence, the order in which the steps of this learning method are performed is not limited to the example of Figure 6. For example, rather than performing step (S67) after normalization has been completed for all teacher skin patch images in the teacher skin patch image group, normalization (S65) and association with correct answer information (S67) may be performed in a single flow for each individual teacher skin patch image. Also, rather than performing step (S65) and subsequent steps after acquiring all of the teacher skin patch image group in step (S63), step (S65) and subsequent steps may be performed each time a teacher skin patch image is acquired in step (S63).

本分析方法では、上述のように、一つの教師用肌画像から取得される教師用肌パッチ画像群に対してその教師用肌画像の被写肌の属性又は状態を示す正解情報が共通して関連付けられた教師データを用いて、判別モデルが学習される。
これにより、判別モデルの精度を高めるためには大量の教師データが必要とされるところ、少数の教師用肌画像から多数の教師用肌パッチ画像群を取得することができるため、効率よく高精度の判別モデルを構築することができる。
また、各肌パッチ画像に関連付けられる正解情報が元の教師用肌画像に写る肌全体の共通の属性又は状態を示す正解情報とされることで、照明等のような教師用肌画像の撮影環境のばらつきや顔の局所部位の特徴差による影響を抑えて、高精度な判別モデルを構築することができる。 In this analysis method, as described above, a discriminant model is trained using training data in which correct answer information indicating the attributes or condition of the subject skin of a group of training skin patch images obtained from a single training skin image is commonly associated with the group of training skin patch images obtained from the training skin image.
This allows for the efficient construction of a highly accurate discrimination model, since a large number of training skin patch images can be obtained from a small number of training skin images, whereas a large amount of training data is required to improve the accuracy of the discrimination model.
Furthermore, by making the correct answer information associated with each skin patch image correct answer information that indicates the common attributes or state of the entire skin appearing in the original teacher skin image, it is possible to suppress the effects of variations in the shooting environment of the teacher skin image, such as lighting, and differences in the characteristics of local areas of the face, thereby constructing a highly accurate discrimination model.

上記の実施形態及び変形例の一部又は全部は、次のようにも特定され得る。但し、上述の実施形態及び変形例が以下の記載に制限されるものではない。 Some or all of the above-described embodiments and variations may also be specified as follows. However, the above-described embodiments and variations are not limited to the following descriptions.

＜１＞教師用肌画像の被写肌共通の属性又は状態を示す正解情報と該教師用肌画像から複数抽出される所定画像サイズの教師用肌パッチ画像群との複数の組合せを含む複数の教師データに基づいて機械学習されている学習済みの判別モデルを利用可能な一以上のプロセッサが、
被評価者の肌が写る評価用肌画像を取得する工程と、
前記取得された評価用肌画像から所定画像サイズの肌パッチ画像群を取得する工程と、
前記取得された各肌パッチ画像に対してそれぞれ画素値の正規化を行う工程と、
正規化された各肌パッチ画像をそれぞれ前記判別モデルに入力することで、各肌パッチ画像に関する肌属性又は肌状態の指標値をそれぞれ取得する工程と、
を実行する肌画像分析方法。 <1> One or more processors that can use a trained discrimination model that has been machine-learned based on a plurality of training data including a plurality of combinations of correct answer information indicating a common attribute or state of subject skin of a training skin image and a group of training skin patch images of a predetermined image size extracted from the training skin image,
A step of acquiring an evaluation skin image showing the skin of the person to be evaluated;
acquiring a group of skin patch images of a predetermined image size from the acquired skin images for evaluation;
normalizing pixel values of each of the acquired skin patch images;
inputting each normalized skin patch image into the discriminant model to obtain a skin attribute or skin condition index value for each skin patch image;
Perform skin image analysis method.

＜２＞前記一以上のプロセッサが、
前記取得された各指標値の、各肌パッチ画像の被写肌領域の位置に応じた分布を示す、指標値分布情報を生成する工程
を更に実行する＜１＞に記載の肌画像分析方法。
＜３＞前記取得された評価用肌画像に前記取得された各指標値を各肌パッチ画像の被写肌領域の位置に応じてマッピングさせたカラーマップ画像を前記指標値分布情報として生成する、
＜２＞に記載の肌画像分析方法。
＜４＞前記一以上のプロセッサが、
前記取得された肌パッチ画像ごとの指標値に基づいて、前記被評価者の肌属性又は肌状態の代表評価値を算出する工程、
を更に実行する＜１＞から＜３＞のいずれか一つに記載の肌画像分析方法。
＜５＞前記一以上のプロセッサが、
前記取得された評価用肌画像中の所望の画像領域を指定する領域指定情報を取得する工程と、
前記取得された領域指定情報が示す画像領域に対応する肌パッチ画像群に関して取得された前記指標値に基づいて、該画像領域の肌属性又は肌状態の代表評価値を算出する工程と、
を更に実行する＜１＞から＜４＞のいずれか一つに記載の肌画像分析方法。
＜６＞前記教師データに含まれる前記教師用肌画像の被写肌共通の状態を示す前記正解情報は、素肌又は化粧肌のいずれか一方を示し、
前記各肌パッチ画像に関して素肌感又は化粧感の指標値が肌状態の指標値としてそれぞれ取得される、
＜１＞から＜５＞のいずれか一つに記載の肌画像分析方法。
＜７＞前記教師データに含まれる前記教師用肌画像の被写肌共通の属性を示す前記正解情報は、男性又は女性のいずれか一方を示し、
前記各肌パッチ画像に関して男性肌らしさ又は女性肌らしさの指標値が肌属性の指標値としてそれぞれ取得される、
＜１＞から＜５＞のいずれか一つに記載の肌画像分析方法。
＜８＞前記教師データに含まれる前記教師用肌画像の被写肌共通の属性を示す前記正解情報は、該教師用肌画像の被写体の実年齢又は実年齢帯を示し、
前記各肌パッチ画像に関して、肌の見た目年齢帯ごとの指標値又は見た目年齢が肌属性の指標値としてそれぞれ取得される、
＜１＞から＜５＞のいずれか一つに記載の肌画像分析方法。
＜９＞前記教師データに含まれる前記教師用肌画像の被写肌共通の状態を示す前記正解情報は、化粧塗布直後又は化粧塗布から所定時間経過後のいずれか一方の状態を示し、
前記各肌パッチ画像に関して化粧崩れ度が肌状態の指標値としてそれぞれ取得される、
＜１＞から＜５＞のいずれか一つに記載の肌画像分析方法。
＜１０＞前記教師データに含まれる前記教師用肌画像の被写肌共通の状態を示す前記正解情報は、スキンケア剤塗布前又はスキンケア剤塗布後のいずれか一方の状態を示し、
前記各肌パッチ画像に関して肌のうるおい感の指標値が肌状態の指標値としてそれぞれ取得される、
＜１＞から＜５＞のいずれか一つに記載の肌画像分析方法。
＜１１＞前記肌パッチ画像に写る肌領域は、被写体の１ｃｍ^２以上２０ｃｍ^２以下の肌領域である、
＜１＞から＜１０＞のいずれか一つに記載の肌画像分析方法。
＜１２＞前記判別モデルは、ディープニューラルネットワークを含む畳み込みニューラルネットワークで構成されている、
＜１＞から＜１１＞のいずれか一つに記載の肌画像分析方法。 <2> The one or more processors
The skin image analysis method according to <1> further comprises: generating index value distribution information indicating a distribution of the acquired index values according to the position of the subject skin region of each skin patch image.
<3> A color map image is generated as the index value distribution information by mapping each of the acquired index values onto the acquired skin image for evaluation according to the position of the skin region of each skin patch image.
The skin image analysis method according to <2>.
<4> The one or more processors
A step of calculating a representative evaluation value of the skin attributes or skin condition of the subject based on the index value for each of the acquired skin patch images;
<4> The skin image analysis method according to any one of <1> to <3>, further comprising:
<5> The one or more processors
acquiring area designation information that designates a desired image area in the acquired skin image for evaluation;
calculating a representative evaluation value of a skin attribute or a skin condition of an image region based on the index values acquired for a group of skin patch images corresponding to the image region indicated by the acquired region designation information;
<4> The skin image analysis method according to any one of <1> to <4>, further comprising:
<6> The correct answer information indicating a common state of subject skin of the teacher skin image included in the teacher data indicates either bare skin or made-up skin,
a bare skin feel or a makeup feel index value is acquired for each of the skin patch images as a skin condition index value;
<5> The skin image analysis method according to any one of <1> to <5>.
<7> The correct answer information indicating a common attribute of subject skin of the teacher skin image included in the teacher data indicates either male or female,
An index value of male skin-likeness or female skin-likeness is acquired for each of the skin patch images as an index value of a skin attribute.
<5> The skin image analysis method according to any one of <1> to <5>.
<8> The correct answer information indicating a common attribute of the subject's skin of the teacher skin image included in the teacher data indicates an actual age or an actual age range of the subject of the teacher skin image,
For each of the skin patch images, an index value for each skin appearance age range or an appearance age is acquired as a skin attribute index value.
<5> The skin image analysis method according to any one of <1> to <5>.
<9> The correct answer information indicating a common state of the subject skin of the teacher skin image included in the teacher data indicates either a state immediately after makeup application or a state after a predetermined time has elapsed since makeup application,
A makeup smearing degree is acquired for each of the skin patch images as an index value of skin condition.
<5> The skin image analysis method according to any one of <1> to <5>.
<10> The correct answer information indicating a common state of the subject skin of the teacher skin image included in the teacher data indicates either a state before application of a skin care agent or a state after application of a skin care agent,
A skin moisturizing index value is acquired for each of the skin patch images as a skin condition index value.
<5> The skin image analysis method according to any one of <1> to <5>.
<11> The skin area shown in the skin patch image is a skin area of the subject that is 1 ^cm2 or more and 20 ^cm2 or less.
<10> The skin image analysis method according to any one of <1> to <10>.
<12> The discrimination model is configured by a convolutional neural network including a deep neural network.
<12> The skin image analysis method according to any one of <1> to <11>.

＜１３＞所定画像サイズの肌パッチ画像の入力に応じて該肌パッチ画像の肌属性又は肌状態を判別する判別モデルの学習方法であって、
教師用肌画像と該教師用肌画像の被写肌共通の属性又は状態を示す正解情報との組合せを複数取得する工程と、
前記取得された教師用肌画像から肌領域が所定の割合以上含まれる所定画像サイズの教師用肌パッチ画像群を取得する工程と、
前記取得された各教師用肌パッチ画像に対してそれぞれ画素値の正規化を行う工程と、
各教師用肌パッチ画像が取得された元の前記教師用肌画像に対応する前記正解情報と、前記正規化された教師用肌パッチ画像群の各々とを関連付けた教師データ群を生成する工程と、
前記教師データ群を用いて前記判別モデルを学習させる行程と、
を含む判別モデルの学習方法。
＜１４＞
前記一以上のプロセッサ及びメモリを少なくとも備える肌画像分析装置であって、
＜１＞から＜１２＞のいずれか一つに記載の肌画像分析方法を実行可能な肌画像分析装置。
＜１５＞＜１３＞に記載の判別モデルの学習方法を実行可能な判別モデル学習装置。 <13> A method for learning a discriminant model that discriminates a skin attribute or a skin condition of a skin patch image in response to an input of the skin patch image of a predetermined image size, the method comprising:
obtaining a plurality of combinations of teacher skin images and correct answer information indicating attributes or states common to the subject skin of the teacher skin images;
a step of acquiring a group of teacher skin patch images of a predetermined image size, each containing a skin region at a predetermined ratio or more from the acquired teacher skin images;
normalizing pixel values of each of the acquired teacher skin patch images;
generating a group of teacher data in which the correct answer information corresponding to the teacher skin image from which each teacher skin patch image was obtained is associated with each of the normalized teacher skin patch images;
a step of training the discriminant model using the training data group;
A method for training discriminant models including:
<14>
A skin image analysis device comprising at least the one or more processors and a memory,
A skin image analysis device capable of executing the skin image analysis method according to any one of <1> to <12>.
<15> A discriminant model learning device capable of executing the discriminant model learning method according to <13>.

以下に複数の実施例を挙げ、上述の内容を更に詳細に説明する。但し、以下の各実施例の記載は、上述の内容に何ら限定を加えるものではない。
以下の各実施例では、ＶＧＧ１６と呼ばれる１６層からなる畳み込みニューラルネットワークモデルをベースにして転移学習（重みの微調整を含むファインチューニング）により構築された判別モデルが利用された。ＶＧＧ１６は、２２４×２２４ピクセルの矩形画像を入力して約１０００カテゴリに分類するモデルであるので、それが転移学習により肌属性又は肌状態を示す２つのカテゴリに分類するように調整された。但し、上述したとおり、判別モデルの構成はこのような例に限定されない。 The above content will be explained in more detail below with reference to several examples, but the description of each example below does not impose any limitations on the above content.
In the following examples, a discriminant model constructed by transfer learning (fine tuning including fine adjustment of weights) based on a 16-layer convolutional neural network model called VGG16 was used. VGG16 is a model that inputs rectangular images of 224 x 224 pixels and classifies them into approximately 1,000 categories, and was adjusted by transfer learning to classify the images into two categories indicating skin attributes or skin conditions. However, as mentioned above, the configuration of the discriminant model is not limited to this example.

実施例１では、教師用肌パッチ画像に写る被写体の肌領域のサイズ（以降、肌領域サイズと表記する）を様々に変えて判別モデルの判別精度が検証された。実施例１では、各教師用肌パッチ画像及び各評価用肌パッチ画像は２２４×２２４ピクセルの矩形画像とされ、それの肌領域サイズとして４．５ｍｍ角（ｍｍ四方）、８．９ｍｍ角、１１．１ｍｍ角、１３．４ｍｍ角、１６．０ｍｍ角、１７．８ｍｍ角、２６．７ｍｍ角、３５．６ｍｍ角、４４．５ｍｍ角の９種が用いられた。
各教師用肌パッチ画像がこのような９種の肌領域サイズとなるような撮影条件で２６９名分の教師用肌画像が撮像され、各教師用肌画像からそれぞれ教師用肌パッチ画像群が取得され、上述の学習方法により判別モデルが学習された。即ち、９種の肌領域サイズに対応する９種の教師データ群を用いて学習された９種の判別モデルが構築された。
一方で、同じ２６９名分の評価用肌画像が４種類の異なる照明条件で撮像され、各評価用肌画像からそれぞれ評価用肌パッチ画像群が取得され、各評価用肌パッチ画像を学習済みの判別モデルに入力し、その出力が評価された。具体的には、９種の肌領域サイズに対応する各肌パッチ画像が、対応する肌領域サイズの教師データで学習された判別モデルにそれぞれ入力された。 In Example 1, the size of the skin area of the subject appearing in the teacher skin patch image (hereinafter referred to as skin area size) was varied to verify the discrimination accuracy of the discrimination model. In Example 1, each teacher skin patch image and each evaluation skin patch image was a rectangular image of 224 x 224 pixels, and nine types of skin area sizes were used: 4.5 mm square (mm square), 8.9 mm square, 11.1 mm square, 13.4 mm square, 16.0 mm square, 17.8 mm square, 26.7 mm square, 35.6 mm square, and 44.5 mm square.
Teacher skin images of 269 people were captured under shooting conditions such that each teacher skin patch image had these nine skin region sizes, and teacher skin patch image groups were obtained from each teacher skin image, and discriminant models were trained using the above-mentioned training method. That is, nine discriminant models were constructed that were trained using nine teacher data groups corresponding to the nine skin region sizes.
On the other hand, evaluation skin images of the same 269 people were captured under four different lighting conditions, and a group of evaluation skin patch images was obtained from each evaluation skin image. Each evaluation skin patch image was input into a trained discrimination model, and the output was evaluated. Specifically, each skin patch image corresponding to nine skin region sizes was input into a discrimination model trained with training data for the corresponding skin region size.

図７は、肌領域サイズと判別モデルの判別精度との関係を示すグラフである。図７に示されるように、実施例１によれば、肌領域サイズが判別モデルの判別精度に影響を与えることが実証された。特に、肌領域サイズが１７．８ｍｍ角とされる場合に、最も高い判別精度を示すことが確認された。
また、肌領域サイズが８．９ｍｍ角（約０．７９ｃｍ^２）より大きければ、概ね６０％以上の判別精度を得ることができ、１６．０ｍｍ角（約２．５６ｃｍ^２）以上かつ４４．５ｍｍ角（約１９．８ｃｍ^２）以下の範囲であれば、９０％以上の判別精度をえることができることが実証された。
これにより、評価用肌パッチ画像及び教師用肌パッチ画像に写る被写体の肌領域は、被写体の１ｃｍ^２以上２０ｃｍ^２以下の肌領域であることが好ましく、３ｃｍ^２以上１９ｃｍ^２以下の肌領域であることが更に好ましいといえる。 7 is a graph showing the relationship between the size of the skin region and the discrimination accuracy of the discrimination model. As shown in FIG. 7, Example 1 demonstrated that the size of the skin region affects the discrimination accuracy of the discrimination model. In particular, it was confirmed that the highest discrimination accuracy was achieved when the skin region size was 17.8 mm square.
It was also demonstrated that if the skin area size is larger than 8.9 mm square (approximately 0.79 cm ² ), a discrimination accuracy of approximately 60% or more can be achieved, and if the skin area size is in the range of 16.0 mm square (approximately 2.56 cm ² ) or more and 44.5 mm square (approximately 19.8 cm ² ) or less, a discrimination accuracy of 90% or more can be achieved.
Therefore, it can be said that the skin area of the subject appearing in the evaluation skin patch image and the teacher skin patch image is preferably a skin area of the subject of 1 ^cm2 or more and 20 ^cm2 or less, and more preferably a skin area of 3 ^cm2 or more and 19 ^cm2 or less.

実施例２では、人の目視評価による判別精度と比較することで、本分析方法で得られる各肌パッチ画像に関する化粧感の指標値の精度について検証された。
目視評価では、５名の訓練された判定者が顔画像を見てベースメイク化粧料が塗布されているか否かが判定された。顔画像としては、２６９名の被写体に関してベースメイク化粧料が塗布された正面顔とベースメイク化粧料が塗布されていない（素肌の）正面顔とがそれぞれ撮像され、合計５３８枚の正面顔画像が準備された。
一方で、その５３８枚の正面顔画像を教師用肌画像とし各教師用肌画像に対してベースメイク化粧料が塗布されているか否かの正解情報を付し、教師用肌画像ごとに肌パッチ画像群とその正解情報とを関連付けた教師データ群を用いて学習することで学習済みの判別モデルが構築された。
当該判別モデルは、素肌らしさ及び化粧肌らしさ（ベースメイク化粧料が塗布されているらしさ）の二つの指標値を出力するモデルとされた。
これに対して、当該２６９名の被検体に関して教師データとして用いていない左右横向きの顔画像についてベースメイク化粧料が塗布されたものとそうでないものとがそれぞれ撮像され、合計１０７６枚の横顔画像が評価用肌画像として準備された。そして、その評価用肌画像ごとに学習済みの判別モデルを用いた本分析方法を実行して各肌パッチ画像の素肌らしさ及び化粧肌らしさの指標値を取得し、被写体ごとに平均指標値が最大となる肌状態（素肌又は化粧肌）が本分析方法における推定肌状態（素肌又は化粧肌）に決定された。 In Example 2, the accuracy of the makeup feel index values for each skin patch image obtained by this analysis method was verified by comparing with the discrimination accuracy based on human visual evaluation.
In the visual evaluation, five trained judges viewed the facial images and judged whether or not base makeup was applied. For the facial images, frontal faces of 269 subjects were photographed both with and without base makeup (bare skin), resulting in a total of 538 frontal facial images.
On the other hand, the 538 frontal face images were used as training skin images, and correct answer information on whether or not base makeup cosmetics had been applied was attached to each training skin image.A trained discrimination model was constructed by learning using a training data group in which a group of skin patch images and their correct answer information were associated for each training skin image.
The discriminant model was a model that outputs two index values: bare skin resemblance and made-up skin resemblance (resemblance that base makeup cosmetic material has been applied).
In contrast, for the 269 subjects, left and right side-facing facial images not used as training data were captured with and without base makeup applied, resulting in a total of 1,076 side-facing facial images being prepared as evaluation skin images. Then, for each evaluation skin image, the present analysis method using the trained discriminant model was executed to obtain index values for bare skin-likeness and made-up skin-likeness for each skin patch image, and the skin condition (bare skin or made-up skin) with the largest average index value for each subject was determined as the estimated skin condition (bare skin or made-up skin) in the present analysis method.

結果、５名の判定者の判別精度では９３．１％が最高であったのに対して、本分析方法で得られる指標値を用いた判別精度は９７．２％であった。これにより、人の目視評価による判別精度よりも本分析方法で得られる各肌パッチ画像に関する素肌らしさ又は化粧肌らしさ（化粧感）の指標値の精度のほうが高いことが実証された。 As a result, the highest discrimination accuracy among the five judges was 93.1%, while the discrimination accuracy using the index values obtained with this analysis method was 97.2%. This demonstrates that the accuracy of the index values for bare skin or made-up skin (make-up look) for each skin patch image obtained with this analysis method is higher than the discrimination accuracy achieved by human visual evaluation.

実施例２では、更に、人の目視評価の判別と本分析方法で得られる指標値を用いた判別との相関性について検証された。目視評価において各判定者に化粧感を６段階のスコア（素肌を０とし、化粧感（化粧肌らしさの度合）を１から５の５段階で示す）で評定させ、本分析方法で得られる素肌らしさ又は化粧肌らしさ（化粧感）の指標値（－５から２０の範囲の値）をそのまま化粧感スコアとした。そして５名の判定者の化粧感スコアの平均値と、本分析方法で得られる化粧感スコアとの相関が分析された。
図８は、人の目視評価で得られた化粧感スコアと本分析方法で得られた化粧感スコアとの関係を示すグラフである。図８では、縦軸に本分析方法で得られた化粧感スコアが示されており、横軸に人の目視評価で得られた化粧感スコアが示されている。
結果、図８に示されるように、５名の判定者の化粧感スコアの平均値と本分析方法で得られる化粧感スコアとの間には相関係数０．７２で相関が認められた。 In Example 2, the correlation between the judgment based on human visual evaluation and the judgment based on the index value obtained by this analysis method was further verified. In the visual evaluation, each judge rated the makeup feel using a 6-point score (bare skin being 0, and the makeup feel (degree of makeup-likeness) being indicated on a 5-point scale from 1 to 5). The index value (a value ranging from -5 to 20) for bare skin-likeness or makeup-likeness (makeup feel) obtained by this analysis method was used directly as the makeup feel score. The correlation between the average makeup feel scores of the five judges and the makeup feel score obtained by this analysis method was then analyzed.
8 is a graph showing the relationship between the makeup feel score obtained by human visual evaluation and the makeup feel score obtained by the present analysis method. In Fig. 8, the vertical axis shows the makeup feel score obtained by the present analysis method, and the horizontal axis shows the makeup feel score obtained by human visual evaluation.
As a result, as shown in FIG. 8, a correlation was observed between the average makeup feel scores of the five judges and the makeup feel score obtained by this analysis method, with a correlation coefficient of 0.72.

実施例３では、本分析方法で得られる肌パッチ画像ごとの化粧感の指標値が様々な化粧料塗布態様に適応可能であるかについて検証された。
評価用肌画像として、被評価者の素肌の顔画像に加えて７種の化粧料塗布態様の化粧顔画像が準備された。７種の化粧料塗布態様の化粧顔画像とは、スキンケア化粧料塗布後の化粧顔画像（スキンケアと表記）、スキンケア化粧料及びサンスクリーン化粧料を塗布した後の化粧顔画像（＋サンスクリーンと表記）、スキンケア化粧料・サンスクリーン化粧料・第一ベースメイク化粧料を塗布した後の化粧顔画像（＋下地と表記）、スキンケア化粧料・サンスクリーン化粧料・第一ベースメイク化粧料・パウダーファンデーション化粧料を塗布した後の化粧顔画像（＋ＰＦＤと表記）、スキンケア化粧料及び第二ベースメイク化粧料を塗布した後の化粧顔画像（＋下地２と表記）、スキンケア化粧料・第二ベースメイク化粧料・クリームファンデーション化粧料を塗布した後の化粧顔画像（＋クリームＦＤと表記）、スキンケア化粧料・第二ベースメイク化粧料・クリームファンデーション化粧料・フェイスパウダー化粧料を塗布した後の化粧顔画像（＋ルースＰＤと表記）である。
そして、素肌顔画像を加えた８種の評価用肌画像の各々について肌パッチ画像群を取得し、各肌パッチ画像を当該判別モデルにそれぞれ入力することで、各肌パッチ画像の化粧感の指標値がそれぞれ取得された。最終的に、元の評価用肌画像ごとに各肌パッチ画像の化粧感の代表指標値（平均値）が算出された。 In Example 3, it was verified whether the index value of the makeup feel for each skin patch image obtained by this analysis method is applicable to various cosmetic application modes.
As skin images for evaluation, in addition to a facial image of the subject's bare skin, makeup facial images of seven different cosmetic application modes were prepared. The makeup facial images of the seven cosmetic application modes are a makeup facial image after application of a skin care cosmetic (denoted as "skin care"), a makeup facial image after application of a skin care cosmetic and a sunscreen cosmetic (denoted as "+sunscreen"), a makeup facial image after application of a skin care cosmetic, a sunscreen cosmetic, and a first base makeup cosmetic (denoted as "base"), a makeup facial image after application of a skin care cosmetic, a sunscreen cosmetic, a first base makeup cosmetic, and a powder foundation cosmetic (denoted as "+PFD"), a makeup facial image after application of a skin care cosmetic and a second base makeup cosmetic (denoted as "base 2"), a makeup facial image after application of a skin care cosmetic, a second base makeup cosmetic, and a cream foundation cosmetic (denoted as "+cream FD"), and a makeup facial image after application of a skin care cosmetic, a second base makeup cosmetic, a cream foundation cosmetic, and a face powder cosmetic (denoted as "loose PD").
Then, a group of skin patch images was obtained for each of the eight evaluation skin images, including the bare skin face image, and each skin patch image was input into the discriminant model to obtain an index value for the makeup look of each skin patch image. Finally, a representative index value (average value) for the makeup look of each skin patch image was calculated for each of the original evaluation skin images.

図９は、化粧料塗布態様が異なる８種の評価用肌画像の各々に関して本分析方法で取得された化粧感の代表指標値を示すグラフである。
図９では、素肌の顔画像に関して算出された化粧感の代表指標値が最も低く、スキンケア化粧料塗布後の顔画像に関して算出された化粧感の代表指標値が次に低くなっている。その他の化粧料塗布態様の化粧顔画像に関する化粧感の代表指標値については、＋サンスクリーンと表記される化粧顔画像、＋下地２と表記される化粧顔画像、＋下地と表記される化粧顔画像、＋ＰＦＤと表記される化粧顔画像、＋クリームＦＤと表記される化粧顔画像、＋ルースＰＤと表記される化粧顔画像の順に、化粧感の代表指標値が高くなっている。
この結果は、各化粧料塗布態様の化粧顔に関する見た目の印象と概ね一致しており、本分析方法によれば、スキンケア化粧料やサンスクリーン化粧料、ベースメイク化粧料の塗布に伴う化粧感も検出することができ、更には、化粧料の重ね塗りによる化粧感の増加についても適切に検出することができるといえる。即ち、本分析方法で得られる肌パッチ画像ごとの化粧感の指標値が様々な化粧料塗布態様に適応可能であることが実証された。 FIG. 9 is a graph showing representative index values of the makeup feel obtained by this analysis method for each of eight types of skin images for evaluation that have different cosmetic application patterns.
9, the representative index value of the makeup feel calculated for the bare skin face image is the lowest, and the representative index value of the makeup feel calculated for the face image after the skin care cosmetic has been applied is next lowest. With regard to the representative index values of the makeup feel calculated for the makeup face images with other cosmetic application modes, the representative index values of the makeup feel increase in the order of the makeup face image labeled +Sunscreen, the makeup face image labeled +Base 2, the makeup face image labeled +Base, the makeup face image labeled +PFD, the makeup face image labeled +Cream FD, and the makeup face image labeled +Loose PD.
These results were generally consistent with the visual impression of the made-up face for each cosmetic application mode, and it can be said that this analysis method can detect the makeup feel associated with the application of skin care cosmetics, sunscreen cosmetics, and base makeup cosmetics, and can also appropriately detect the increase in makeup feel caused by the application of multiple cosmetics. In other words, it was demonstrated that the makeup feel index values obtained for each skin patch image using this analysis method can be applied to various cosmetic application modes.

図１０は、化粧料塗布態様が異なる８種の評価用肌画像の各々に関する肌パッチ画像群の指標値分布を示すカラーマップ画像を示す図である。図１０は、図面制約により二値画像とせざるを得ず分かりづらいが、黒に近づく程、化粧感の指標値が高くなり（化粧肌らしさが大きくなり）、白に近づく程、化粧感の指標値が低くなる（素肌らしさが大きくなる）ことを示している。
このように図１０で示されるカラーマップ画像によっても、本分析方法で得られる肌パッチ画像ごとの化粧感の指標値が様々な化粧料塗布態様に適応可能であることが分かる。更に言えば、肌パッチ画像群の指標値分布を示すカラーマップ画像によれば、顔全体の中で化粧感が高い部位と低い部位とを細かい小区画領域ごとに把握し易く提示することができる。 10 is a diagram showing a color map image showing the index value distribution of a group of skin patch images for each of eight types of evaluation skin images with different cosmetic application patterns. Fig. 10 is difficult to understand because it is a binary image due to drawing restrictions, but it shows that the closer to black the image, the higher the makeup index value (the more similar it is to made-up skin), and the closer to white the image, the lower the makeup index value (the more similar it is to natural skin).
10 also shows that the makeup look index values for each skin patch image obtained by this analysis method are applicable to various cosmetic application patterns. Furthermore, the color map image showing the index value distribution of the skin patch images makes it easy to grasp which areas of the face have a high and low makeup look for each small, divided region.

実施例４では、本分析方法で得られる各肌パッチ画像の肌の見た目年齢帯ごとの指標値の精度が検証された。
実施例４では、２０代から７０代までの各年齢帯に属する５１２名の女性を被写体とし素肌の顔画像とファンデーション化粧料塗布後の顔画像とがそれぞれ撮像され、各顔画像からそれぞれ肌パッチ画像群が取得され、合計４３０００個の肌パッチ画像が取得された。それら肌パッチ画像群の７割が教師用肌パッチ画像群として判別モデルの学習に用いられ、残りの３割が評価用肌パッチ画像群として当該判別モデルを用いた本分析方法に用いられた。
当該判別モデルは、２０代から７０代までの６つの各年齢帯の指標値をそれぞれ出力するモデルとされた。本分析方法では、当該判別モデルにより当該評価用肌パッチ画像群に含まれる各肌パッチ画像に関して肌の見た目年齢帯ごとの指標値がそれぞれ取得された。本実施例では更に、当該判別モデルから得られた各指標値を独立変数とした線形回帰により肌パッチごとの推定年齢が取得され、被写体ごとに得られた全肌パッチの推定年齢の平均が本分析方法における推定年齢に決定された。 In Example 4, the accuracy of the index values for each skin appearance age range of each skin patch image obtained by this analysis method was verified.
In Example 4, 512 women in various age groups from their 20s to 70s were photographed as subjects, with bare skin images and facial images after application of foundation cosmetics, and a group of skin patch images was obtained from each facial image, resulting in a total of 43,000 skin patch images. 70% of the group of skin patch images were used as a training group of skin patch images for training a discriminant model, and the remaining 30% were used as an evaluation group of skin patch images for this analysis method using the discriminant model.
The discriminant model was a model that output index values for each of six age ranges from 20s to 70s. In this analysis method, the discriminant model was used to obtain index values for each skin appearance age range for each skin patch image included in the group of evaluation skin patch images. In this example, an estimated age for each skin patch was obtained by linear regression using each index value obtained from the discriminant model as an independent variable, and the average estimated age for all skin patches obtained for each subject was determined as the estimated age in this analysis method.

図１１は、本分析方法で得られる推定年齢と実年齢との関係を示すグラフである。図１１において縦に延びる線は、実年齢帯ごとに本分析方法で得られた推定年齢の範囲を示し、横に延びる線は、実年齢帯ごとの被写体の実年齢の範囲を示し、丸印は、実年齢帯ごとの被写体の実年齢の平均値と本分析方法で得られた推定年齢の平均値を示す。
結果、本分析方法で得られる推定年齢と実年齢とは相関係数０．７７で相関があることが確認され、予測標準誤差が１０．７７であることが確認された。
即ち、本分析方法で得られる各肌パッチ画像の肌の見た目年齢帯ごとの指標値及び推定年齢が高精度で得られていることが実証された。 11 is a graph showing the relationship between estimated age and actual age obtained by this analysis method. In Fig. 11, the vertical lines indicate the range of estimated ages obtained by this analysis method for each actual age band, the horizontal lines indicate the range of actual ages of the subjects for each actual age band, and the circles indicate the average actual ages of the subjects for each actual age band and the average estimated age obtained by this analysis method.
As a result, it was confirmed that there was a correlation between the estimated age obtained by this analysis method and the actual age with a correlation coefficient of 0.77, and the standard error of prediction was confirmed to be 10.77.
In other words, it was demonstrated that the index values and estimated ages for each skin appearance age range of each skin patch image obtained by this analysis method can be obtained with high accuracy.

実施例５では、本分析方法で得られる各肌パッチ画像の男性肌らしさ又は女性肌らしさの指標値の精度が検証された。
実施例５では、男性２６名、女性２６名の合計５２名を被写体として素肌の顔画像がそれぞれ撮像され、各顔画像からそれぞれ肌パッチ画像群が取得され、合計１２００個の肌パッチ画像が取得された。それら肌パッチ画像群の７割が教師用肌パッチ画像群として判別モデルの学習に用いられ、残りの３割が評価用肌パッチ画像群として当該判別モデルを用いた本分析方法に用いられた。
当該判別モデルは、男性肌らしさ及び女性肌らしさの２つの指標値をそれぞれ出力するモデルとされた。本分析方法では、当該評価用肌パッチ画像群に含まれる各肌パッチ画像に関して男性肌らしさ及び女性肌らしさの指標値がそれぞれ取得され、指標値が大きいほうの性別が各肌パッチ画像の推定性別に決定された。 In Example 5, the accuracy of the index values of male skin-likeness or female skin-likeness of each skin patch image obtained by this analysis method was verified.
In Example 5, bare skin facial images were captured for a total of 52 subjects, 26 men and 26 women, and a group of skin patch images was obtained from each facial image, resulting in a total of 1,200 skin patch images. 70% of the group of skin patch images were used as a training group of skin patch images for training a discriminant model, and the remaining 30% were used as an evaluation group of skin patch images for this analysis method using the discriminant model.
The discriminant model was a model that outputs two index values, one for male skin-likeness and one for female skin-likeness. In this analysis method, the index values for male skin-likeness and one for female skin-likeness were obtained for each skin patch image included in the group of evaluation skin patch images, and the gender with the larger index value was determined as the estimated gender for each skin patch image.

結果、本分析方法で肌パッチ画像ごとに得られる推定性別が９７．１％の判別精度を示した。即ち、本分析方法で得られる各肌パッチ画像の男性肌らしさ又は女性肌らしさの指標値が高い精度を示すことが実証された。
図１２は、５名の被評価者の評価用肌画像の各々に関する肌パッチ画像群の指標値分布を示すカラーマップ画像を示す図である。図１２は、図面制約により二値画像とせざるを得ず分かりづらいが、黒に近づく程、男性肌らしさの指標値が高くなり（女性肌らしさの指標値が低くなり）、白に近づく程、男性肌らしさの指標値が低くなる（女性肌らしさの指標値が高くなる）ことを示している。
このように図１２で示されるカラーマップ画像によれば、顔全体の中で男性肌らしさ又は女性肌らしさが高い部位と低い部位とを細かい小区画領域ごとに把握し易く提示することができる。 As a result, the estimated gender obtained for each skin patch image using this analysis method showed a discrimination accuracy of 97.1%, demonstrating that the index value of the male skin-likeness or female skin-likeness of each skin patch image obtained using this analysis method shows high accuracy.
12 is a diagram showing a color map image showing the index value distribution of the skin patch image group for each of the evaluation skin images of the five evaluators. Although Fig. 12 is difficult to understand due to the constraints of the drawing, which necessitates a binary image, it shows that the closer to black the color, the higher the index value of male skin-likeness (the lower the index value of female skin-likeness), and the closer to white the color, the lower the index value of male skin-likeness (the higher the index value of female skin-likeness).
In this way, the color map image shown in FIG. 12 makes it possible to easily grasp the areas of the entire face that are highly masculine or feminine in appearance and the areas that are less masculine in appearance for each small divided area.

実施例６では、本分析方法で得られる各肌パッチ画像の化粧崩れ度の精度が検証された。
実施例６では、６０名の女性を被写体としてファンデーション化粧料塗布直後の顔画像とファンデーション化粧料塗布から６から１０時間経過後の顔画像とがそれぞれ撮像され、各顔画像からそれぞれ肌パッチ画像群が取得され、合計５０００個の肌パッチ画像が取得された。それら肌パッチ画像群の７割が教師用肌パッチ画像群として判別モデルの学習に用いられ、残りの３割が評価用肌パッチ画像群として当該判別モデルを用いた本分析方法に用いられた。
当該判別モデルは、化粧料塗布直後及び化粧料塗布から所定時間経過後の二つの指標値を出力するモデルとされた。本分析方法では、当該評価用肌パッチ画像群に含まれる各肌パッチ画像に関して化粧料塗布直後及び化粧料塗布から所定時間経過後の二つの指標値がそれぞれ取得され、指標値が大きいほうの状態（化粧料塗布直後又は化粧料塗布から所定時間経過後）が各肌パッチ画像の推定化粧料塗布状態に決定された。 In Example 6, the accuracy of the degree of makeup smearing of each skin patch image obtained by this analysis method was verified.
In Example 6, facial images of 60 women were taken immediately after application of a foundation cosmetic and 6 to 10 hours after application of the foundation cosmetic, and a group of skin patch images was obtained from each facial image, resulting in a total of 5,000 skin patch images. 70% of the group of skin patch images were used as a training group of skin patch images for training a discriminant model, and the remaining 30% were used as an evaluation group of skin patch images for this analysis method using the discriminant model.
The discriminant model was a model that output two index values: one immediately after cosmetic application and one after a predetermined time had elapsed since cosmetic application. In this analysis method, two index values were obtained for each skin patch image included in the group of evaluation skin patch images: one immediately after cosmetic application and one after a predetermined time had elapsed since cosmetic application, and the state with the larger index value (either immediately after cosmetic application or the predetermined time had elapsed since cosmetic application) was determined to be the estimated cosmetic application state for each skin patch image.

結果、本分析方法で肌パッチ画像ごとに得られる推定化粧料塗布状態が７８．４％の判別精度を示した。化粧崩れは、顔全体に一律に生じる場合もあるが、部分的に生じる場合も多い。このことを考慮すれば、この判別精度の値（７８．４％）は十分に高い精度を示すといえる。即ち、本分析方法で得られる各肌パッチ画像の化粧料塗布直後及び化粧料塗布から所定時間経過後の二つの指標値が高い精度を示すことが実証された。また、この二つの指標値は化粧崩れ度を示すといえる。
図１３は、二種類のファンデーション化粧料の塗布直後と塗布から４時間、８時間及び１０時間の各経過後とにおける各評価用肌画像に関する肌パッチ画像群の指標値分布を示すカラーマップ画像を示す図である。同一女性を被写体としており、図１３（ａ）と図１３（ｂ）とで利用したファンデーション化粧料が異なっている。
図１３は、図面制約により二値画像とせざるを得ず分かりづらいが、黒に近づく程、塗布から所定時間経過後を示す指標値が高くなり（化粧崩れ度が高くなり）、白に近づく程、塗布から所定時間経過後を示す指標値が低くなる（化粧崩れ度が低くなる）ことを示している。
図１３によれば、時間経過に伴う化粧崩れ度の変化が、図１３（ａ）で用いられたファンデーション化粧料のほうが図１３（ｂ）で用いられたファンデーション化粧料よりも小さいことが分かる。
このように図１３で示されるカラーマップ画像によれば、化粧料の崩れ特性を客観的に評価することができる。また、このカラーマップ画像によれば、顔全体の中で化粧崩れ度が高い部位と低い部位とを細かい小区画領域ごとに把握し易く提示することができる。 As a result, the estimated cosmetic application state obtained for each skin patch image using this analysis method demonstrated a discrimination accuracy of 78.4%. While makeup smearing can occur uniformly across the entire face, it is often the case that it occurs only in parts. Considering this, this discrimination accuracy value (78.4%) can be said to be sufficiently high. In other words, it was demonstrated that the two index values obtained for each skin patch image obtained using this analysis method, immediately after cosmetic application and after a predetermined time has passed since cosmetic application, demonstrate high accuracy. Furthermore, these two index values can be said to indicate the degree of makeup smearing.
13A and 13B are diagrams showing color map images showing the index value distribution of the skin patch image group for each evaluation skin image immediately after application of two types of foundation cosmetics and 4, 8, and 10 hours after application. The same woman was used as the subject, but different foundation cosmetics were used in Fig. 13A and Fig. 13B.
Figure 13 is difficult to understand because it has to be a binary image due to drawing restrictions, but it shows that the closer to black the color, the higher the index value indicating a predetermined time after application (the greater the degree of makeup smearing), and the closer to white the color, the lower the index value indicating a predetermined time after application (the less the degree of makeup smearing).
13A and 13B, it can be seen that the change in the degree of makeup smearing over time is smaller for the foundation cosmetic used in FIG. 13A than for the foundation cosmetic used in FIG. 13B.
In this way, the cosmetic smearing characteristics can be objectively evaluated using the color map image shown in Fig. 13. Furthermore, this color map image makes it easy to grasp the areas of the face that are more likely to smear and the areas that are less likely to smear, divided into small regions.

実施例７では、本分析方法で得られる各肌パッチ画像の肌のうるおい感の指標値の精度が検証された。
実施例７では、８名の女性を被写体としてスキンケア剤塗布前の顔画像とスキンケア剤塗布後の顔画像とがそれぞれ撮像され、各顔画像からそれぞれ肌パッチ画像群が取得され、合計２３１個の肌パッチ画像が取得された。それら肌パッチ画像群の７割が教師用肌パッチ画像群として判別モデルの学習に用いられ、残りの３割が評価用肌パッチ画像群として当該判別モデルを用いた本分析方法に用いられた。
当該判別モデルは、スキンケア剤塗布前及びスキンケア剤塗布後の二つの指標値を出力するモデルとされた。本分析方法では、当該評価用肌パッチ画像群に含まれる各肌パッチ画像に関してスキンケア剤塗布前及びスキンケア剤塗布後の二つの指標値がそれぞれ取得され、指標値が大きいほうの状態（スキンケア剤塗布前又はスキンケア剤塗布後）が各肌パッチ画像の推定スキンケア塗布状態に決定された。 In Example 7, the accuracy of the index values of the skin moistness of each skin patch image obtained by this analysis method was verified.
In Example 7, facial images of eight women were captured before and after application of a skin care agent, and a group of skin patch images was obtained from each facial image, resulting in a total of 231 skin patch images. 70% of the group of skin patch images were used as a training group of skin patch images for training a discriminant model, and the remaining 30% were used as an evaluation group of skin patch images for this analysis method using the discriminant model.
The discriminant model was a model that outputs two index values, one before application of a skin care agent and one after application of a skin care agent. In this analysis method, two index values, one before application of a skin care agent and one after application of a skin care agent, were obtained for each skin patch image included in the group of skin patch images for evaluation, and the state with the larger index value (before application of a skin care agent or after application of a skin care agent) was determined to be the estimated skin care application state of each skin patch image.

図１４は、本学習方法におけるエポック数と判別モデルの判別精度との関係を示すグラフである。判別モデルの判別精度は、上述のように肌パッチ画像ごとに得られる推定スキンケア塗布状態の精度を示す。
図１４には、教師用肌パッチ画像の入力に応じた判別モデルの判別精度が教師データと表記する折れ線で示され、評価用肌パッチ画像の入力に応じた判別モデルの判別精度が検証用データと表記する折れ線で示されている。
結果、当該判別モデルが最適なエポック数（６４）で学習された場合にスキンケア剤塗布前又は塗布後の判別精度が８８．６％となることが確認された。即ち、本分析方法で得られる各肌パッチ画像のスキンケア剤塗布前及びスキンケア剤塗布後の二つの指標値が高い精度を示すことが実証された。この二つの指標値は、肌のうるおい感の指標値と呼ぶことができる。 14 is a graph showing the relationship between the number of epochs and the discrimination accuracy of the discriminant model in this learning method. The discrimination accuracy of the discriminant model indicates the accuracy of the estimated skin care application state obtained for each skin patch image, as described above.
In Figure 14, the discrimination accuracy of the discriminant model in response to the input of a teacher skin patch image is shown by a broken line labeled "teacher data," and the discrimination accuracy of the discriminant model in response to the input of an evaluation skin patch image is shown by a broken line labeled "verification data."
As a result, it was confirmed that when the discriminant model was trained using the optimal number of epochs (64), the discrimination accuracy before or after application of a skin care agent was 88.6%. In other words, it was demonstrated that the two index values obtained from each skin patch image before and after application of a skin care agent exhibit high accuracy. These two index values can be called index values of skin moisturization.

実施例８では、肌の目視評価スコアを推定する学習済みモデルを判別モデルとして用いて、本分析方法で得られる各肌パッチ画像の肌の目視評価スコアの精度が検証された。
まず、２０歳から３９歳の日本人女性８７名の素肌の顔全体が写る３４１枚の顔画像が準備された。この３４１枚の顔画像は、８７名の各被写体（一部を除く）を時期を変えてそれぞれ最大４回撮影することで取得された。
更に、１０名の判定者が目視により３４１枚の顔画像の肌を官能評価し、評価結果として１０項目の各評価項目について‐３ポイントから＋３ポイントの７段階の評価スコア（１０名の判定者の平均値）がそれぞれ付与され、最終的に、顔画像ごとに付与された１０個の各評価項目の評価スコアが正解情報とされた。
評価項目は、総合評価、肌の白さ、肌の黄み、潤い、乾燥感、ハリ、つや、透明感、肌表面の滑らかさ、及びキメの整いの１０項目である。
これにより、顔画像と、それの正解情報としての１０個の目視評価スコアとの３４１組の組合せが取得された。ここで、正解情報として取得された１０個の目視評価スコアは、評価項目ごとに平均０、分散１となるようにそれぞれ正規化された。 In Example 8, the accuracy of the visual skin evaluation score of each skin patch image obtained by this analysis method was verified using a trained model that estimates the visual skin evaluation score as a discriminant model.
First, 341 facial images were prepared, showing the entire bare face of 87 Japanese women aged between 20 and 39. These 341 facial images were obtained by photographing each of the 87 subjects (with some exceptions) up to four times at different times.
Furthermore, 10 judges visually evaluated the skin of 341 facial images, and as evaluation results, a 7-level evaluation score ranging from -3 points to +3 points (average value of the 10 judges) was assigned to each of the 10 evaluation items.Finally, the evaluation scores assigned to each facial image for each of the 10 evaluation items were determined to be the correct answer information.
The evaluation items were 10 items: overall evaluation, whiteness of the skin, yellowness of the skin, moisture, dryness, firmness, gloss, transparency, smoothness of the skin surface, and evenness of the skin texture.
As a result, 341 combinations of face images and 10 visual evaluation scores as correct answer information were obtained. Here, the 10 visual evaluation scores obtained as correct answer information were normalized so that the average and variance for each evaluation item were 0 and 1, respectively.

次に、当該顔画像に対して顔検出及び肌色検出を行い、肌領域が９０％以上含まれる所定画像サイズの肌パッチ画像群が各顔画像からそれぞれ取得された。本実施例では、各肌パッチ画像に写る被写体の肌領域は、被写体の１７．８ｍｍ四方の肌領域とされ、全ての顔画像から９３０６枚の肌パッチ画像が取得された。
更に、各肌パッチ画像の画素値が正規化され、正規化された各肌パッチ画像に対して、その肌パッチ画像の元となった顔画像の１０個の目視評価スコアがラベル（正解情報）として割り当てられた。
本実施例では、このように取得された肌パッチ画像群とそれのラベルとして割り当てられた１０個の目視評価スコアとの３４１組の組合せの中の７割が学習用の教師データとされ、残り３割が検証用のデータとされた。 Next, face detection and skin color detection were performed on the face images, and a group of skin patch images of a predetermined image size, each containing 90% or more of the skin area, was obtained from each face image. In this example, the skin area of the subject in each skin patch image was set to a 17.8 mm square skin area of the subject, and 9,306 skin patch images were obtained from all face images.
Furthermore, the pixel values of each skin patch image were normalized, and for each normalized skin patch image, the 10 visual evaluation scores of the face image from which the skin patch image was derived were assigned as labels (correct answer information).
In this example, 70% of the 341 combinations of the skin patch images thus obtained and the 10 visual evaluation scores assigned as their labels were used as training data, and the remaining 30% were used as verification data.

実施例８で用いられる判別モデル（以降、肌パッチ型モデルと表記）は、肌パッチ画像を入力し、１０項目の目視評価スコアの推定値を要素として持つ１０次元ベクトルを出力するモデルである。この判別モデルは、上述した学習用の教師データを用いて、出力ベクトルの要素である１０項目すべての推定値と実際の目視評価スコアとの二乗平均平方根誤差（ＲＭＳＥ）の評価項目間平均値が最小化されるよう学習された。
また、比較例として、上述の顔画像を入力し、１０項目の目視評価スコアの推定値を要素として持つ１０次元ベクトルを出力する比較用判別モデル（以降、全顔型モデルと表記）も生成された。 The discriminant model used in Example 8 (hereinafter referred to as the skin patch model) is a model that inputs a skin patch image and outputs a 10-dimensional vector having estimated values of visual evaluation scores for 10 items as elements. This discriminant model was trained using the training data described above so as to minimize the average value between the root mean square errors (RMSE) of all 10 items, which are elements of the output vector, and the actual visual evaluation scores across evaluation items.
As a comparative example, a comparative discriminant model (hereinafter referred to as the full face model) was also generated, which inputs the above-mentioned face image and outputs a 10-dimensional vector having estimated values of the visual evaluation scores of 10 items as elements.

図１５は、顔画像を入力して肌の目視評価スコアを推定する学習済みモデル（全顔型判別モデル）と肌パッチ画像を入力して肌の目視評価スコアを推定する学習済みモデル（肌パッチ型判別モデル）との推定精度を比較するためのグラフである。図１５には、１０個の目視評価項目ごとに、肌パッチ型判別モデル又は全顔型判別モデルにより出力された目視評価スコアの推定値と検証用データの正解情報（目視評価スコア）との相関係数が示されている。
図１５によれば、肌パッチ型判別モデルは、１０個の評価項目すべてにおいて相関係数０．６を超える推定精度を示すのに対して、比較例としての全顔型判別モデルは、肌の色及び肌の黄みという二つの評価横目について大きく推定精度が低下していることが分かる。
以上より、肌の目視評価スコアを推定する判別モデルを用いる場合においても、肌パッチ画像を用いることの有効性が実証された。 15 is a graph for comparing the estimation accuracy of a trained model (all-face type discriminant model) that inputs a face image to estimate a visual skin evaluation score and a trained model (skin patch type discriminant model) that inputs a skin patch image to estimate a visual skin evaluation score. For each of the 10 visual evaluation items, Fig. 15 shows the correlation coefficient between the estimated visual evaluation score output by the skin patch type discriminant model or the all-face type discriminant model and the correct answer information (visual evaluation score) of the verification data.
According to Figure 15, the skin patch type discrimination model shows an estimation accuracy with a correlation coefficient exceeding 0.6 for all 10 evaluation items, whereas the all-face type discrimination model as a comparative example shows a significant drop in estimation accuracy for the two evaluation items of sideways glance, namely skin color and yellowish skin.
From the above, the effectiveness of using skin patch images was demonstrated even when using a discriminant model to estimate visual skin assessment scores.

本実施例では、更に、上述の顔画像とは異なる被写体を異なる撮影環境で撮影することで得られた新たな顔画像群を用いて、上述の肌パッチ型判別モデルの推定精度が検証された。
新たな顔画像群は、２０代から７０代の日本人女性２６９名を被写体とし、同一被写体につき素肌の全顔及びファンデーション塗布後の全顔の二つの顔画像をそれぞれ含む。
一方で、肌パッチ型判別モデルは、上述の学習用教師データを用いて学習されているため、４０歳以上の被写体の顔画像及びファンデーション塗布後の顔画像については学習していない状態である。 In this example, the estimation accuracy of the above-mentioned skin patch type discrimination model was further verified using a new group of face images obtained by photographing a subject different from the above-mentioned face images in a different shooting environment.
The new face image group includes 269 Japanese women in their 20s to 70s as subjects, and includes two face images of each subject: one with bare skin and one with foundation applied.
On the other hand, since the skin patch type discrimination model is trained using the training data described above, it has not yet been trained on facial images of subjects aged 40 or over or facial images after applying foundation.

このような新たな顔画像群から上述と同様に肌パッチ画像群が取得され、各肌パッチ画像が肌パッチ型判別モデルに入力されて、各肌パッチ画像に関する１０項目の目視評価スコアの推定値がそれぞれ取得された。
そして、当該肌パッチ画像群が素肌とファンデーション塗布後の肌との二つの肌グループに分類され、肌グループごとに更に、被写体の年齢によって６つの年代グループ（２０代から７０代）に分類され、グループごとに肌パッチ型判別モデルにより得られた目視評価スコアの推定値の平均値が算出された。 A group of skin patch images was obtained from this new group of face images in the same manner as described above, and each skin patch image was input into a skin patch type discrimination model to obtain estimated visual evaluation scores for the 10 items for each skin patch image.
The group of skin patch images was then classified into two skin groups: bare skin and skin after foundation application. Each skin group was further classified into six age groups (20s to 70s) based on the age of the subject, and the average estimated visual evaluation score obtained by the skin patch type discrimination model was calculated for each group.

図１６は、実施例８における肌パッチ型判別モデルから得られる目視評価スコアの推定値と年齢との相関、及び化粧料塗布に伴う目視評価スコアの推定値の変化を示すレーダーチャートである。図１６（ａ）は、素肌グループにおける６つの年代ごとの肌パッチ型判別モデルにより得られた目視評価スコアの推定値の平均値を示すレーダーチャートであり、図１６（ｂ）は、ファンデーション塗布後の肌グループにおける６つの年代ごとの肌パッチ型判別モデルにより得られた目視評価スコアの推定値の平均値を示すレーダーチャートである。図１６では、１０個の各評価項目の目視評価スコアの推定値の点を結んで囲まれた領域（以降、チャート領域と表記）が広い程、各評価項目の目視評価スコアの推定値が高いことを示す。
図１６によれば、素肌グループ及びファンデーション塗布後の肌グループのいずれにおいても、２０代グループのチャート領域が最も広く、年代が増すにつれて、チャート領域が狭くなっていることが分かる。これは、当該肌パッチ型判別モデルが肌の目視評価スコアの加齢に伴う低下を的確に捉えているということができる。
また、同一年代グループ間で比較すると、いずれの年代グループに関しても、図１６（ｂ）で示されるチャート領域のほうが図１６（ａ）で示されるチャート領域よりも広くなっていることが分かる。これは、当該肌パッチ型判別モデルが肌の目視評価スコアのファンデーション塗布に伴う改善を的確に捉えているということができる。
以上より、実施例８の当該肌パッチ型判別モデルによれば、未学習の年代の顔画像や未学習のファンデーション塗布後の顔画像を含む新たな顔画像群に関しても高い精度で目視評価スコアの推定値を取得できることが実証された。 Fig. 16 is a radar chart showing the correlation between age and the estimated visual evaluation score obtained from the skin patch type discriminant model in Example 8, and the change in the estimated visual evaluation score following the application of a cosmetic product. Fig. 16(a) is a radar chart showing the average value of the estimated visual evaluation score obtained from the skin patch type discriminant model for each of the six age groups in the bare skin group, and Fig. 16(b) is a radar chart showing the average value of the estimated visual evaluation score obtained from the skin patch type discriminant model for each of the six age groups in the skin group after the application of foundation. In Fig. 16, the wider the area surrounded by connecting the points of the estimated values of the visual evaluation scores for each of the 10 evaluation items (hereinafter referred to as the chart area), the higher the estimated value of the visual evaluation score for each evaluation item.
16, it can be seen that for both the bare skin group and the foundation-applied skin group, the chart area for the 20s group is the widest, and the chart area becomes narrower as the age group increases. This means that the skin patch type discrimination model accurately captures the decline in visual skin evaluation scores with age.
Furthermore, when comparing between the same age groups, it can be seen that the chart area shown in Fig. 16(b) is wider than the chart area shown in Fig. 16(a) for all age groups. This means that the skin patch type discrimination model accurately captures the improvement in the skin visual evaluation score that accompanies the application of foundation.
From the above, it was demonstrated that the skin patch type discrimination model of Example 8 can obtain highly accurate estimates of visual evaluation scores even for new groups of face images including face images from untrained ages and face images after untrained foundation application.

１０情報処理装置
１１ＣＰＵ
１２メモリ
１３入出力Ｉ／Ｆ
１４通信ユニット
１５表示装置
１６入力装置 10 Information processing device 11 CPU
12 Memory 13 Input/Output I/F
14 Communication unit 15 Display device 16 Input device

Claims

one or more processors that can use a trained discrimination model that has been machine-learned based on a plurality of training data including a plurality of combinations of correct answer information indicating a common state of subject skin in a training skin image and a group of training skin patch images of a predetermined image size extracted from the training skin image;
A step of acquiring an evaluation skin image showing the skin of the person to be evaluated;
acquiring a group of skin patch images of a predetermined image size from the acquired skin images for evaluation;
normalizing pixel values of each of the acquired skin patch images;
an index value acquisition step of inputting each normalized skin patch image into the discriminant model to acquire a skin condition index value for each skin patch image;
Run
the correct answer information indicating a common skin state of the subject skin of the teacher skin image included in the teacher data indicates either bare skin or made-up skin,
In the index value acquisition step, an index value of a bare skin look or a makeup look is acquired for each of the skin patch images as an index value of a skin condition.
Skin image analysis method.

one or more processors that can use a trained discrimination model that has been machine-learned based on a plurality of training data including a plurality of combinations of correct answer information indicating a common state of subject skin in a training skin image and a group of training skin patch images of a predetermined image size extracted from the training skin image;
A step of acquiring an evaluation skin image showing the skin of the person to be evaluated;
acquiring a group of skin patch images of a predetermined image size from the acquired skin images for evaluation;
normalizing pixel values of each of the acquired skin patch images;
an index value acquisition step of inputting each normalized skin patch image into the discriminant model to acquire a skin condition index value for each skin patch image;
Run
the correct answer information indicating a common state of the subject skin of the teacher skin image included in the teacher data indicates either a state immediately after makeup application or a state after a predetermined time has elapsed since makeup application,
In the index value acquisition step, a makeup smear degree is acquired for each of the skin patch images as an index value of a skin condition.
Skin image analysis method.

one or more processors that can use a trained discrimination model that has been machine-learned based on a plurality of training data including a plurality of combinations of correct answer information indicating a common state of subject skin in a training skin image and a group of training skin patch images of a predetermined image size extracted from the training skin image;
A step of acquiring an evaluation skin image showing the skin of the person to be evaluated;
acquiring a group of skin patch images of a predetermined image size from the acquired skin images for evaluation;
normalizing pixel values of each of the acquired skin patch images;
an index value acquisition step of inputting each normalized skin patch image into the discriminant model to acquire a skin condition index value for each skin patch image;
Run
the correct answer information indicating a common state of the subject skin of the teacher skin image included in the teacher data indicates either a state before or after application of a skin care agent;
In the index value acquisition step, an index value of skin moisturizing feeling is acquired as an index value of skin condition for each of the skin patch images.
Skin image analysis method.

one or more processors that can use a trained discrimination model that has been machine-learned based on a plurality of training data including a plurality of combinations of correct answer information indicating common attributes or states of subject skin in a training skin image and a group of training skin patch images of a predetermined image size extracted from the training skin image;
A step of acquiring an evaluation skin image showing the skin of the person to be evaluated;
acquiring a group of skin patch images of a predetermined image size from the acquired skin images for evaluation;
normalizing pixel values of each of the acquired skin patch images;
inputting each normalized skin patch image into the discriminant model to obtain a skin attribute or skin condition index value for each skin patch image;
Run
The skin area shown in the skin patch image is a skin area of the subject that is 1 cm ^{or more} and 20 cm or ^less .
Skin image analysis method.

the one or more processors:
The skin image analysis method according to claim 1 , further comprising: generating index value distribution information indicating a distribution of the acquired index values according to the position of the subject skin region of each skin patch image.

generating a color map image as the index value distribution information by mapping each of the acquired index values onto the acquired skin image for evaluation according to the position of the subject skin region of each skin patch image;
The skin image analysis method according to claim 5 .

the one or more processors:
calculating a representative evaluation value of the skin condition of the subject based on the index value for each of the acquired skin patch images;
The skin image analysis method according to claim 1 , further comprising:

the one or more processors:
A step of calculating a representative evaluation value of the skin attributes or skin condition of the subject based on the index value for each of the acquired skin patch images;
The method of claim 4 further comprising:

the one or more processors:
acquiring area designation information that designates a desired image area in the acquired skin image for evaluation;
calculating a representative evaluation value of the skin condition of the image region based on the index values acquired for a group of skin patch images corresponding to the image region indicated by the acquired region designation information;
The skin image analysis method according to claim 1 , further comprising:

the one or more processors:
acquiring area designation information that designates a desired image area in the acquired skin image for evaluation;
calculating a representative evaluation value of a skin attribute or a skin condition of an image region based on the index values acquired for a group of skin patch images corresponding to the image region indicated by the acquired region designation information;
The method of claim 4 further comprising:

The correct answer information indicating a common attribute of subject skin of the teacher skin image included in the teacher data indicates either male or female,
An index value of male skin-likeness or female skin-likeness is acquired for each of the skin patch images as an index value of a skin attribute.
The method for analyzing skin images according to claim 4, 8 or 10 .

The correct answer information indicating a common attribute of the subject's skin of the teacher skin image included in the teacher data indicates the actual age or actual age range of the subject of the teacher skin image,
For each of the skin patch images, an index value for each skin appearance age range or an appearance age is acquired as a skin attribute index value.
The method for analyzing skin images according to claim 4, 8 or 10 .

The discrimination model is composed of a convolutional neural network including a deep neural network.
The method for analyzing skin images according to any one of claims 1 to 12 .

A method for learning a discriminant model for discriminating a skin condition of a skin patch image in response to an input of the skin patch image having a predetermined image size, comprising:
obtaining a plurality of combinations of teacher skin images and correct answer information indicating a common state of subject skin in the teacher skin images;
a step of acquiring a group of teacher skin patch images of a predetermined image size, each containing a skin region at a predetermined ratio or more from the acquired teacher skin images;
normalizing pixel values of each of the acquired teacher skin patch images;
generating a group of teacher data in which the correct answer information corresponding to the teacher skin image from which each teacher skin patch image was obtained is associated with each of the normalized teacher skin patch images;
a step of training the discriminant model using the training data group;
Including,
The correct answer information indicating a common state of subject skin in the teacher skin image indicates either bare skin or made-up skin.
How to train a discriminant model.

A method for learning a discriminant model for discriminating a skin condition of a skin patch image in response to an input of the skin patch image having a predetermined image size, comprising:
obtaining a plurality of combinations of teacher skin images and correct answer information indicating a common state of subject skin in the teacher skin images;
a step of acquiring a group of teacher skin patch images of a predetermined image size, each containing a skin region at a predetermined ratio or more from the acquired teacher skin images;
normalizing pixel values of each of the acquired teacher skin patch images;
generating a group of teacher data in which the correct answer information corresponding to the teacher skin image from which each teacher skin patch image was obtained is associated with each of the normalized teacher skin patch images;
a step of training the discriminant model using the training data group;
Including,
The correct answer information indicating a common state of the subject skin of the teacher skin image indicates either a state immediately after applying makeup or a state after a predetermined time has elapsed since applying makeup.
How to train a discriminant model.

A method for learning a discriminant model for discriminating a skin condition of a skin patch image in response to an input of the skin patch image having a predetermined image size, comprising:
obtaining a plurality of combinations of teacher skin images and correct answer information indicating a common state of subject skin in the teacher skin images;
a step of acquiring a group of teacher skin patch images of a predetermined image size, each containing a skin region at a predetermined ratio or more, from the acquired teacher skin images;
normalizing pixel values of each of the acquired teacher skin patch images;
generating a group of teacher data in which the correct answer information corresponding to the teacher skin image from which each teacher skin patch image was obtained is associated with each of the normalized teacher skin patch images;
a step of training the discriminant model using the training data group;
Including,
the correct answer information indicating a common state of the subject skin of the teacher skin image indicates either a state before application of a skin care agent or a state after application of a skin care agent;
How to train a discriminant model.

A method for learning a discriminant model that discriminates skin attributes or skin conditions of a skin patch image in response to an input of the skin patch image having a predetermined image size, comprising:
obtaining a plurality of combinations of teacher skin images and correct answer information indicating attributes or states common to the subject skin of the teacher skin images;
a step of acquiring a group of teacher skin patch images of a predetermined image size, each containing a skin region at a predetermined ratio or more from the acquired teacher skin images;
normalizing pixel values of each of the acquired teacher skin patch images;
generating a group of teacher data in which the correct answer information corresponding to the teacher skin image from which each teacher skin patch image was obtained is associated with each of the normalized teacher skin patch images;
a step of training the discriminant model using the training data group;
Including,
a skin area of the subject appearing in each of the teacher skin patch images included in the teacher skin patch image group is equal to or greater than 1 ^cm2 and equal to or less than 20 ^cm2 ;
How to train a discriminant model.

A skin image analysis device comprising at least the one or more processors and a memory,
A skin image analysis device capable of executing the skin image analysis method according to any one of claims 1 to 13 .

A discriminant model learning device capable of executing the discriminant model learning method according to any one of claims 14 to 17 .