JP7573239B2

JP7573239B2 - PROGRAM, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING APPARATUS

Info

Publication number: JP7573239B2
Application number: JP2021027695A
Authority: JP
Inventors: 英樹坂元
Original assignee: 株式会社プラグ
Priority date: 2021-02-24
Filing date: 2021-02-24
Publication date: 2024-10-25
Anticipated expiration: 2041-02-24
Also published as: JP2022129123A

Description

特許法第３０条第２項適用令和２年５月２２日に、Ｃｏｎｆｉｔの２０２０年度人工知能学会全国大会（第３４回）のウェブサイトにて公開（ｈｔｔｐｓ：／／ｃｏｎｆｉｔ．ａｔｌａｓ．ｊｐ／ｇｕｉｄｅ／ｅｖｅｎｔ／ｊｓａｉ２０２０／ｓｕｂｊｅｃｔ／１Ｍ３－ＧＳ－１３－０２／ａｄｖａｎｃｅｄ）令和２年６月９日に、２０２０年度人工知能学会全国大会（第３４回）にて公開（ｈｔｔｐｓ：／／ｗｗｗ．ａｉ－ｇａｋｋａｉ．ｏｒ．ｊｐ／ｊｓａｉ２０２０／）令和２年６月２０日に、ＰＬＵＧＡＩのウェブサイトで公開（ｈｔｔｐｓ：／／ｈｐ．ｐａｃｋａｇｅ－ａｉ．ｊｐ／）Application of Article 30, Paragraph 2 of the Patent Act Published on May 22, 2020 on the website of Confit's 2020 National Conference of the Japanese Society for Artificial Intelligence (34th) (https://confit.atlas.jp/guide/event/jsai2020/subject/1M3-GS-13-02/advanced) Published on June 9, 2020 at the 2020 National Conference of the Japanese Society for Artificial Intelligence (34th) (https://www.ai-gakkai.or.jp/jsai2020/) Published on June 20, 2020 on the PLUG AI website (https://hp.package-ai.jp/)

本発明は、プログラム、情報処理方法、及び情報処理装置に関する。 The present invention relates to a program, an information processing method, and an information processing device.

従来、デザインを評価する際に、評価対象となる第１デザインの第１特徴量を抽出し、抽出された第１特徴量に基づいて、第１デザインと、既存の複数の第２デザインの各々との類似度を算出し、算出された類似度と、予め取得された第２デザインの各々に対する顧客の感想を示す情報に基づいて、第１デザインに対する顧客の感想を予想する評価装置が開示されている（例えば、特許文献１参照）。 Conventionally, when evaluating designs, an evaluation device has been disclosed that extracts a first characteristic amount of a first design to be evaluated, calculates the similarity between the first design and each of a plurality of existing second designs based on the extracted first characteristic amount, and predicts customer impressions of the first design based on the calculated similarity amount and previously acquired information indicating customer impressions of each of the second designs (see, for example, Patent Document 1).

特開２０１８－１９５０７８号公報JP 2018-195078 A

しかしながら、特許文献１に係る発明は、適切にパッケージ画像を提案することができないおそれがある。 However, the invention described in Patent Document 1 may not be able to appropriately suggest package images.

一つの側面では、適切にパッケージ画像を提案することが可能なプログラム等を提供することにある。 One aspect is to provide a program etc. that can appropriately suggest package images.

一つの側面に係るプログラムは、候補となるパッケージ画像に含まれるデザイン要素数、及び、デザイン数を受け付け、受け付けたデザイン要素数のデザイン要素を選択するためのデザイン要素選択欄を出力し、各デザイン要素選択欄に対応付けて、受け付けたデザイン数のデザインを選択するためのデザイン選択欄を出力し、前記候補となるパッケージ画像を複数生成し、パッケージ画像を入力した場合に、前記パッケージ画像の好意度に関する情報を出力するよう学習済みの第１学習モデルに、生成した複数のパッケージ画像を前記第１学習モデルに入力して、各パッケージ画像の好意度に関する情報を出力し、出力された好意度に関する情報に基づいて、前記複数のパッケージ画像から提案すべきパッケージ画像を出力する処理をコンピュータに実行させる。 A program relating to one aspect causes a computer to execute a process of accepting the number of design elements and the number of designs contained in candidate package images, outputting a design element selection field for selecting a design element for the accepted number of design elements, outputting a design selection field for selecting a design for the accepted number of designs in correspondence with each design element selection field, generating a plurality of the candidate package images, inputting the generated package images into a first learning model that has been trained to output information regarding the favorability of the package image when a package image is input, outputting information regarding the favorability of each package image, and outputting a package image to be suggested from the plurality of package images based on the output information regarding the favorability.

一つの側面では、適切にパッケージ画像を提案することが可能となる。 On one hand, it makes it possible to suggest appropriate package images.

パッケージ画像提案システムの概要を示す説明図である。FIG. 1 is an explanatory diagram showing an overview of a package image suggestion system. サーバの構成例を示すブロック図である。FIG. 2 is a block diagram showing an example of the configuration of a server. 端末の構成例を示すブロック図である。FIG. 2 is a block diagram showing a configuration example of a terminal. 提案すべきパッケージ画像を出力する動作を説明する説明図である。FIG. 11 is an explanatory diagram for explaining an operation of outputting a package image to be proposed. 好意度予測モデルを説明する説明図である。FIG. 13 is an explanatory diagram for explaining a favorability prediction model. デザイン数及びデザイン要素数の受付画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of a reception screen for the number of designs and the number of design elements. デザイン要素の登録画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of a registration screen for a design element. 提案すべきパッケージ画像の表示画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of a display screen of a package image to be proposed. 提案すべきパッケージ画像を出力する際の処理手順を示すフローチャートである。10 is a flowchart showing a processing procedure for outputting a package image to be proposed. 提案すべきパッケージ画像を出力する際の処理手順を示すフローチャートである。10 is a flowchart showing a processing procedure for outputting a package image to be proposed. スコアを用いて学習させる好意度予測モデルを説明する説明図である。FIG. 11 is an explanatory diagram illustrating a favorability prediction model that is trained using scores. 実施形態２のサーバの構成例を示すブロック図である。FIG. 11 is a block diagram showing an example of the configuration of a server according to a second embodiment. ＳｔｙｌｅＧＡＮの構成を概念的に示す構成図であるFIG. 1 is a diagram conceptually illustrating the configuration of StyleGAN. 基本パッケージ画像の受付画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of a reception screen for a basic package image. スタイル画像の受付画面の一例を示す説明図である。FIG. 11 is an explanatory diagram showing an example of a style image reception screen. パッケージ画像生成の確定画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of a confirmation screen for generating a package image. パッケージ画像の表示画面の一例を示す説明図である。FIG. 11 is an explanatory diagram showing an example of a display screen of a package image. スタイル画像を用いてパッケージ画像を生成する際の処理手順を示すフローチャートである。13 is a flowchart showing a processing procedure for generating a package image using a style image. 実施形態３のサーバの構成例を示すブロック図である。FIG. 13 is a block diagram showing an example of the configuration of a server according to a third embodiment. 出現割合予測モデルを説明する説明図である。FIG. 13 is an explanatory diagram for explaining an appearance ratio prediction model. 実施形態４のサーバの構成例を示すブロック図である。FIG. 13 is a block diagram showing an example of the configuration of a server according to a fourth embodiment. 標準偏差予測モデルを説明する説明図である。FIG. 13 is an explanatory diagram illustrating a standard deviation prediction model.

以下、本発明をその実施形態を示す図面に基づいて詳述する。 The present invention will be described in detail below with reference to drawings showing an embodiment of the invention.

（実施形態１）
実施形態１は、候補となるパッケージ画像を複数生成し、パッケージ画像の好意度に関する情報に基づいて、生成した複数のパッケージ画像から提案すべきパッケージ画像を出力する形態に関する。図１は、パッケージ画像提案システムの概要を示す説明図である。本実施形態のシステムは、情報処理装置１及び情報処理端末２を含み、各装置はインターネット等のネットワークＮを介して情報の送受信を行う。 (Embodiment 1)
The first embodiment relates to a form in which a plurality of candidate package images are generated, and a package image to be proposed is output from the plurality of generated package images based on information regarding the favorability of the package images. Fig. 1 is an explanatory diagram showing an overview of a package image proposal system. The system of this embodiment includes an information processing device 1 and an information processing terminal 2, and each device transmits and receives information via a network N such as the Internet.

情報処理装置１は、種々の情報に対する処理、記憶及び送受信を行う情報処理装置である。情報処理装置１は、例えばサーバ装置、パーソナルコンピュータまたは汎用のタブレットＰＣ（パソコン）等である。本実施形態において、情報処理装置１はサーバ装置であるものとし、以下では簡潔のためサーバ１と読み替える。 The information processing device 1 is an information processing device that processes, stores, and transmits/receives various types of information. The information processing device 1 is, for example, a server device, a personal computer, or a general-purpose tablet PC (personal computer). In this embodiment, the information processing device 1 is a server device, and for the sake of simplicity, will be referred to as server 1 below.

情報処理端末２は、提案すべきパッケージ画像の受信及び表示、並びに、該パッケージ画像の記憶（保存）等を行う端末装置である。情報処理端末２は、例えばスマートフォン、携帯電話、タブレット、パーソナルコンピュータ端末等の情報処理機器である。以下では簡潔のため、情報処理端末２を端末２と読み替える。 The information processing terminal 2 is a terminal device that receives and displays package images to be proposed, and stores (saves) the package images. The information processing terminal 2 is, for example, an information processing device such as a smartphone, a mobile phone, a tablet, or a personal computer terminal. For the sake of simplicity, the information processing terminal 2 will be read as terminal 2 below.

本実施形態に係るサーバ１は、複数のデザイン要素に対するデザインを組み替えることにより、候補となるパッケージ画像を複数生成する。サーバ１は、生成した複数のパッケージ画像を好意度予測モデルに入力して、各パッケージ画像の好意度に関する情報を出力する。サーバ１は、出力した各パッケージ画像の好意度に関する情報に基づいて、複数の候補となるパッケージ画像から、提案すべきパッケージ画像を抽出する。サーバ１は、抽出した提案すべきパッケージ画像を端末２に送信する。 The server 1 according to this embodiment generates multiple candidate package images by rearranging the designs for multiple design elements. The server 1 inputs the multiple generated package images into a favorability prediction model and outputs information regarding the favorability of each package image. The server 1 extracts a package image to be proposed from the multiple candidate package images based on the output information regarding the favorability of each package image. The server 1 transmits the extracted package image to be proposed to the terminal 2.

図２は、サーバ１の構成例を示すブロック図である。サーバ１は、制御部１１、記憶部１２、通信部１３、入力部１４、表示部１５、読取部１６及び大容量記憶部１７を含む。各構成はバスＢで接続されている。 Figure 2 is a block diagram showing an example of the configuration of the server 1. The server 1 includes a control unit 11, a storage unit 12, a communication unit 13, an input unit 14, a display unit 15, a reading unit 16, and a large-capacity storage unit 17. Each component is connected by a bus B.

制御部１１はＣＰＵ（Central Processing Unit）、ＭＰＵ（Micro-Processing Unit）、ＧＰＵ（Graphics Processing Unit）等の演算処理装置を含み、記憶部１２に記憶された制御プログラム１Ｐを読み出して実行することにより、サーバ１に係る種々の情報処理、制御処理等を行う。なお、制御プログラム１Ｐは、単一のコンピュータ上で、または１つのサイトにおいて配置されるか、もしくは複数のサイトにわたって分散され、通信ネットワークによって相互接続された複数のコンピュータ上で実行されるように展開することができる。なお、図２では制御部１１を単一のプロセッサであるものとして説明するが、マルチプロセッサであっても良い。 The control unit 11 includes a processor such as a CPU (Central Processing Unit), MPU (Micro-Processing Unit), and GPU (Graphics Processing Unit), and performs various information processing, control processing, and the like related to the server 1 by reading and executing the control program 1P stored in the storage unit 12. The control program 1P can be deployed so that it is executed on a single computer or at one site, or distributed across multiple sites and on multiple computers interconnected by a communications network. Although the control unit 11 is described in FIG. 2 as being a single processor, it may also be a multi-processor.

記憶部１２はＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）等のメモリ素子を含み、制御部１１が処理を実行するために必要な制御プログラム１Ｐ又はデータ等を記憶している。また、記憶部１２は、制御部１１が演算処理を実行するために必要なデータ等を一時的に記憶する。通信部１３は通信に関する処理を行うための通信モジュールであり、ネットワークＮを介して、端末２との間で情報の送受信を行う。 The storage unit 12 includes memory elements such as RAM (Random Access Memory) and ROM (Read Only Memory), and stores the control program 1P or data required for the control unit 11 to execute processing. The storage unit 12 also temporarily stores data required for the control unit 11 to execute arithmetic processing. The communication unit 13 is a communication module for performing processing related to communication, and transmits and receives information to and from the terminal 2 via the network N.

入力部１４は、マウス、キーボード、タッチパネル、ボタン等の入力デバイスであり、受け付けた操作情報を制御部１１へ出力する。表示部１５は、液晶ディスプレイ又は有機ＥＬ（electroluminescence）ディスプレイ等であり、制御部１１の指示に従い各種情報を表示する。 The input unit 14 is an input device such as a mouse, keyboard, touch panel, or button, and outputs received operation information to the control unit 11. The display unit 15 is a liquid crystal display or an organic EL (electroluminescence) display, etc., and displays various information according to instructions from the control unit 11.

読取部１６は、ＣＤ（Compact Disc）－ＲＯＭ又はＤＶＤ（Digital Versatile Disc）－ＲＯＭを含む可搬型記憶媒体１ａを読み取る。制御部１１が読取部１６を介して、制御プログラム１Ｐを可搬型記憶媒体１ａより読み取り、大容量記憶部１７に記憶しても良い。また、ネットワークＮ等を介して他のコンピュータから制御部１１が制御プログラム１Ｐをダウンロードし、大容量記憶部１７に記憶しても良い。さらにまた、半導体メモリ１ｂから、制御部１１が制御プログラム１Ｐを読み込んでも良い。 The reading unit 16 reads the portable storage medium 1a, which may be a CD (Compact Disc)-ROM or a DVD (Digital Versatile Disc)-ROM. The control unit 11 may read the control program 1P from the portable storage medium 1a via the reading unit 16 and store it in the mass storage unit 17. The control unit 11 may also download the control program 1P from another computer via a network N or the like and store it in the mass storage unit 17. The control unit 11 may also read the control program 1P from the semiconductor memory 1b.

大容量記憶部１７は、例えばＨＤＤ（Hard disk drive:ハードディスク）、ＳＳＤ(Solid State Drive:ソリッドステートドライブ)等の記録媒体を備える。大容量記憶部１７は、好意度予測モデル１７１を含む。好意度予測モデル１７１は、パッケージ画像に基づいて該パッケージ画像の好意度に関する情報を予測する予測器であり、機械学習により生成された学習済みモデルである。 The large-capacity storage unit 17 includes a recording medium such as a hard disk drive (HDD) or a solid state drive (SSD). The large-capacity storage unit 17 includes a favorability prediction model 171. The favorability prediction model 171 is a predictor that predicts information related to the favorability of a package image based on the package image, and is a trained model generated by machine learning.

なお、本実施形態において記憶部１２及び大容量記憶部１７は一体の記憶装置として構成されていても良い。また、大容量記憶部１７は複数の記憶装置により構成されていても良い。更にまた、大容量記憶部１７はサーバ１に接続された外部記憶装置であっても良い。 In this embodiment, the memory unit 12 and the large-capacity memory unit 17 may be configured as an integrated memory device. The large-capacity memory unit 17 may also be configured from multiple memory devices. Furthermore, the large-capacity memory unit 17 may be an external memory device connected to the server 1.

サーバ１はコンピュータ単体で実行しても良いし、複数のコンピュータで分散して実行しても良いし、仮想マシンで分散して実行しても良い。 Server 1 may be executed on a single computer, or may be executed in a distributed manner on multiple computers, or may be executed in a distributed manner on virtual machines.

図３は、端末２の構成例を示すブロック図である。端末２は、制御部２１、記憶部２２、通信部２３、入力部２４、表示部２５及び撮影部２６を含む。各構成はバスＢで接続されている。 Figure 3 is a block diagram showing an example of the configuration of the terminal 2. The terminal 2 includes a control unit 21, a storage unit 22, a communication unit 23, an input unit 24, a display unit 25, and an image capture unit 26. Each component is connected by a bus B.

制御部２１はＣＰＵ、ＭＰＵ等の演算処理装置を含み、記憶部２２に記憶された制御プログラム２Ｐを読み出して実行することにより、端末２に係る種々の情報処理、制御処理等を行う。なお、図３では制御部２１を単一のプロセッサであるものとして説明するが、マルチプロセッサであっても良い。記憶部２２はＲＡＭ、ＲＯＭ等のメモリ素子を含み、制御部２１が処理を実行するために必要な制御プログラム２Ｐ又はデータ等を記憶している。また、記憶部２２は、制御部２１が演算処理を実行するために必要なデータ等を一時的に記憶する。 The control unit 21 includes an arithmetic processing device such as a CPU or MPU, and performs various information processing, control processing, etc. related to the terminal 2 by reading and executing the control program 2P stored in the memory unit 22. Note that while the control unit 21 is described as a single processor in FIG. 3, it may be a multiprocessor. The memory unit 22 includes memory elements such as RAM and ROM, and stores the control program 2P or data, etc. required for the control unit 21 to execute processing. The memory unit 22 also temporarily stores data, etc. required for the control unit 21 to execute arithmetic processing.

通信部２３は通信に関する処理を行うための通信モジュールであり、ネットワークＮを介して、サーバ１と情報の送受信を行う。入力部２４は、キーボード、マウスまたは表示部２５と一体化したタッチパネルでも良い。表示部２５は、液晶ディスプレイ又は有機ＥＬディスプレイ等であり、制御部２１の指示に従い各種情報を表示する。 The communication unit 23 is a communication module for performing communication-related processing, and transmits and receives information to and from the server 1 via the network N. The input unit 24 may be a keyboard, a mouse, or a touch panel integrated with the display unit 25. The display unit 25 is a liquid crystal display or an organic EL display, etc., and displays various information according to instructions from the control unit 21.

撮影部２６は、例えばＣＣＤ(Charge Coupled Device)カメラ、ＣＭＯＳ（Complementary Metal Oxide Semiconductor）カメラ等の撮影装置である。なお、撮影部２６は、複数の撮影装置により構成されても良い。なお、撮影部２６は端末２の中に内蔵せず、外部で直接に端末２と接続し、撮影可能な構成としても良い。 The photographing unit 26 is, for example, a photographing device such as a CCD (Charge Coupled Device) camera or a CMOS (Complementary Metal Oxide Semiconductor) camera. The photographing unit 26 may be composed of multiple photographing devices. The photographing unit 26 may not be built into the terminal 2, but may be externally connected directly to the terminal 2 and configured to be capable of photographing.

なお、端末２はサーバ１の機能を有しても良い。 In addition, terminal 2 may also have the functions of server 1.

図４は、提案すべきパッケージ画像を出力する動作を説明する説明図である。先に、候補となるパッケージ画像を生成する処理を説明する。本実施形態では、サーバ１が遺伝的アルゴリズムを用いて、パッケージ画像に含まれる複数のデザイン要素に対して交叉処理及び突然変異処理を行い、候補となるパッケージ画像を複数生成する。遺伝的アルゴリズムは、進化論的な考え方に基づいてデータを操作し、最適解探索、学習または推論を行う進化型計算手法である。 Figure 4 is an explanatory diagram explaining the operation of outputting a package image to be proposed. First, the process of generating candidate package images will be explained. In this embodiment, the server 1 uses a genetic algorithm to perform crossover and mutation processes on multiple design elements included in the package image to generate multiple candidate package images. A genetic algorithm is an evolutionary computing method that manipulates data based on evolutionary thinking and performs optimal solution search, learning, or inference.

サーバ１は、パッケージデザイン毎に複数のデザイン要素の要素名及び要素画像を端末２から取得する。パッケージデザインは、食品・飲料・日用品をはじめとした包装・容器のデザインである。デザイン要素は、パッケージデザインを構成する各種デザインの要素である。例えば、飲料容器の場合、背景、キャップ、容量表示のフォントまたはロゴ等がデザイン要素となる。 The server 1 acquires from the terminal 2 the element names and element images of multiple design elements for each package design. Package designs are the designs of packaging and containers for foods, beverages, daily necessities, and other products. Design elements are the various design elements that make up a package design. For example, in the case of a beverage container, design elements include the background, cap, font or logo for indicating volume, etc.

サーバ１は、取得した複数のデザイン要素を連結して二進数表現する。そして、連結したものを１つの染色体とし、その染色体が１つの個体を表すものとする。こうして表した個体に対して遺伝的アルゴリズム（genetic algorithm）を適用する。交叉処理については、サーバ１は、受信した複数のデザイン要素に対し、例えばランダムに二つのデザイン要素を交叉することを複数回行い、結果的に複数個の交叉後の候補デザイン要素を生成する。 The server 1 concatenates the multiple design elements it has acquired and expresses them in binary. The concatenated set is then treated as one chromosome, and this chromosome represents one individual. A genetic algorithm is applied to the individual thus represented. For the crossover process, the server 1 randomly crosses two design elements together multiple times for the multiple design elements it has received, thereby generating multiple candidate design elements after crossover.

突然変異処理については、受信した複数のデザイン要素と共に、上述した交叉処理にて生成された複数の候補デザイン要素の中から、例えばランダムに単一または複数のデザイン要素を選択する。例えばサーバ１は、３つのデザイン要素をランダムに選択し、選択した３つのデザイン要素におけるそれぞれの遺伝子配列の１点を変化させることを実行し、３つの突然変異後の候補デザイン要素を生成する。 For the mutation process, a single or multiple design elements are selected, for example, randomly from the multiple candidate design elements generated in the crossover process described above along with the multiple design elements received. For example, server 1 randomly selects three design elements and changes one point in the gene sequence of each of the three selected design elements to generate three mutated candidate design elements.

このように、サーバ１は、複数のデザイン要素を交叉処理及び突然変異処理を行い、候補となるパッケージ画像を複数生成する。なお、本実施形態では、遺伝的アルゴリズムを用いる例を説明するが、これに限るものではない。例えば、ランダムにデザイン要素を変更する等、他の手法を用いても良い。 In this way, the server 1 performs crossover and mutation processes on multiple design elements to generate multiple candidate package images. Note that, although an example using a genetic algorithm is described in this embodiment, this is not limiting. For example, other methods such as randomly changing design elements may also be used.

次に、サーバ１は、生成した各候補となるパッケージ画像の好意度に関する情報を利用して、各パッケージ画像（個体）を評価する。好意度に関する情報は、好意度が最大５（好き）から最小１（嫌い）までに分類されるスコア、または好意度のスコアを最大５から最小１までの加重平均値を含む。具体的には、サーバ１は、生成したそれぞれのパッケージ画像を学習済みの好意度予測モデル１７１に入力して、それぞれのパッケージ画像の好意度に関する情報を出力する。なお、好意度予測モデル１７１を用いる好意度に関する情報の出力処理に関しては後述する。 Next, the server 1 evaluates each package image (individual) using information related to the favorability of each candidate package image that has been generated. The information related to the favorability includes a score that classifies the favorability from a maximum of 5 (like) to a minimum of 1 (dislike), or a weighted average of the favorability score from a maximum of 5 to a minimum of 1. Specifically, the server 1 inputs each of the generated package images into the trained favorability prediction model 171, and outputs information related to the favorability of each package image. The output process of the information related to the favorability using the favorability prediction model 171 will be described later.

サーバ１は、出力した好意度に関する情報に基づいて、各パッケージ画像（個体）を評価する。例えば、好意度の加重平均値の降順に上位からＮ番目（例えば、２０）までのパッケージ画像を最も好意度の高い個体として評価しても良く、または、好意度の加重平均値が所定閾値以上であるパッケージ画像を最も好意度の高い個体として評価しても良い。サーバ１は、好意度が低いと判定したパッケージ画像を破棄する。サーバ１は、好意度が高いと判定したパッケージ画像（生き残った個体）を、提案すべきパッケージ画像として抽出する。 The server 1 evaluates each package image (individual) based on the output information on the favorability. For example, the top N package images (e.g., 20) may be evaluated in descending order of the weighted average favorability as the most favorable individuals, or the package image whose weighted average favorability is equal to or greater than a predetermined threshold may be evaluated as the most favorable individual. The server 1 discards package images that are determined to have a low favorability. The server 1 extracts package images that are determined to have a high favorability (surviving individuals) as package images to be proposed.

サーバ１は、抽出した提案すべきパッケージ画像を第１世代のパッケージ画像として端末２に送信する。端末２は、サーバ１から送信された第１世代のパッケージ画像を受信して表示する。端末２は、第１世代のパッケージ画像に基づく第２世代のパッケージ画像の生成要求を受け付けた場合、第２世代のパッケージ画像の生成要求をサーバ１に送信する。サーバ１は、端末２から送信された第２世代のパッケージ画像の生成要求に応じて、第１世帯のパッケージ画像に基づいて、各パッケージ画像に含まれる複数のデザイン要素に対して交叉処理及び突然変異処理を再度行い、候補となるパッケージ画像を生成する。 The server 1 transmits the extracted package image to be proposed to the terminal 2 as a first generation package image. The terminal 2 receives and displays the first generation package image transmitted from the server 1. When the terminal 2 receives a request to generate a second generation package image based on the first generation package image, it transmits the request to generate the second generation package image to the server 1. In response to the request to generate a second generation package image transmitted from the terminal 2, the server 1 again performs crossover and mutation processes on multiple design elements contained in each package image based on the package image of the first generation, to generate candidate package images.

そして、サーバ１は、パッケージ画像の好意度に基づいて、生成された各候補となるパッケージ画像を再度評価する。サーバ１は、好意度が高いと判定した候補となるパッケージ画像を、提案すべきパッケージ画像として抽出する。サーバ１は、抽出した提案すべきパッケージ画像を第２世代のパッケージ画像として端末２に送信する。 Then, the server 1 re-evaluates each of the generated candidate package images based on the favorability of the package image. The server 1 extracts the candidate package image determined to have a high favorability as a package image to be proposed. The server 1 transmits the extracted package image to be proposed to the terminal 2 as a second-generation package image.

このように、サーバ１は、遺伝的アルゴリズムに基づく交叉処理及び突然変異処理を行うことにより、ユーザが満足する最終的な世代（例えば、第５世代）のパッケージ画像が決定されるまで繰り返す。このようにパッケージ画像処理を行うことにより、パッケージ画像の表示様態をユーザの好みに近づけた、即ちユーザにとって好意度が高いパッケージ画像を生成することが可能となる。 In this way, the server 1 repeats the crossover and mutation processes based on the genetic algorithm until a package image of the final generation (e.g., the fifth generation) that satisfies the user is determined. By performing package image processing in this way, it is possible to generate a package image whose display format is closer to the user's preferences, i.e., a package image that is highly favorable to the user.

続いて、好意度予測モデル１７１を用いる好意度に関する情報の出力処理を説明する。図５は、好意度予測モデル１７１を説明する説明図である。好意度予測モデル１７１は、人工知能ソフトウェアの一部であるプログラムモジュールとして利用される。好意度予測モデル１７１は、パッケージ画像を入力とし、該パッケージ画像の好意度に関する情報を出力とするニューラルネットワークを構築済みの好意度に関する情報の予測器である。 Next, the output process of information relating to favorability using the favorability prediction model 171 will be described. FIG. 5 is an explanatory diagram of the favorability prediction model 171. The favorability prediction model 171 is used as a program module that is part of artificial intelligence software. The favorability prediction model 171 is a predictor of information relating to favorability that has constructed a neural network that takes a package image as input and outputs information relating to the favorability of the package image.

サーバ１は、好意度予測モデル１７１として、パッケージ画像の好意度に影響を及ぼす箇所の特徴量を学習するディープラーニングを行うことで好意度予測モデル１７１を構築（生成）する。例えば、好意度予測モデル１７１はＣＮＮ（Convolution Neural Network）であり、パッケージ画像の入力を受け付ける入力層と、該パッケージ画像の好意度に関する情報を出力する出力層と、バックプロパゲーションにより学習済の中間層とを有する。 The server 1 constructs (generates) the favorability prediction model 171 by performing deep learning to learn the features of the parts of the package image that affect the favorability of the package image. For example, the favorability prediction model 171 is a CNN (Convolution Neural Network) that has an input layer that accepts input of the package image, an output layer that outputs information related to the favorability of the package image, and an intermediate layer that has been trained by backpropagation.

入力層は、パッケージ画像に含まれる各画素の画素値の入力を受け付ける複数のニューロンを有し、入力された画素値を中間層に受け渡す。中間層は、パッケージ画像の特徴量を抽出する複数のニューロンを有し、抽出した画像特徴量を出力層に受け渡す。具体的には、中間層は、入力層から入力された各画素の画素値を畳み込むコンボリューション層と、コンボリューション層で畳み込んだ画素値をマッピングするプーリング層とが交互に連結された構成により、パッケージ画像の画素情報を圧縮しながら最終的にパッケージ画像の特徴量を抽出する。その後中間層は、バックプロパゲーションによりパラメータが学習された全結合層により、パッケージ画像の好意度に関する情報を予測する。予測結果は、ニューロンを有する出力層に出力される。 The input layer has a number of neurons that accept input of pixel values of each pixel included in the package image, and passes the input pixel values to the intermediate layer. The intermediate layer has a number of neurons that extract features of the package image, and passes the extracted image features to the output layer. Specifically, the intermediate layer is configured by alternating convolution layers that convolve the pixel values of each pixel input from the input layer with pooling layers that map the pixel values convolved in the convolution layer, compressing the pixel information of the package image while ultimately extracting the features of the package image. The intermediate layer then predicts information related to the favorability of the package image using a fully connected layer whose parameters have been learned by backpropagation. The prediction results are output to the output layer, which has neurons.

なお、パッケージ画像は、交互に連結されたコンボリューション層とプーリング層とを通過して特徴量が抽出された後に、入力層に入力されても良い。 In addition, the package image may be input to the input layer after passing through an alternating convolution layer and pooling layer to extract features.

本実施形態では、好意度予測モデル１７１として、ＲｅｓＮｅｔ５０（Residual Network50）のネットワーク構造を適用しても良い。ＲｅｓＮｅｔ５０は、５０層を持ったＣＮＮのモデルである。なお、上述したＲｅｓＮｅｔ５０の代わりに、例えばＶＧＧ１６、ＩｎｃｅｐｔｉｏｎＶ３、ＤｅｎｓｅＮｅｔ、ＩｎｃｅｐｔｉｏｎＲｅｓＮｅｔＶ２等が用いられても良い。 In this embodiment, the network structure of ResNet50 (Residual Network50) may be applied as the favorability prediction model 171. ResNet50 is a CNN model with 50 layers. Note that, instead of the above-mentioned ResNet50, for example, VGG16, InceptionV3, DenseNet, InceptionResNetV2, etc. may be used.

なお、本実施の形態では好意度予測モデル１７１がＣＮＮであるものとして説明するが、好意度予測モデル１７１はＣＮＮに限定されず、ＣＮＮ以外のニューラルネットワーク、ＳＶＭ（Support Vector Machine）、ベイジアンネットワーク、または回帰木等の任意の学習アルゴリズムで構築された学習済みモデルであって良い。 In this embodiment, the favorability prediction model 171 is described as a CNN, but the favorability prediction model 171 is not limited to a CNN and may be a trained model constructed with any learning algorithm, such as a neural network other than a CNN, an SVM (Support Vector Machine), a Bayesian network, or a regression tree.

サーバ１は、パッケージ画像と、該パッケージ画像の好意度とが対応付けられた訓練データの組み合わせを複数取得する。訓練データは、パッケージ画像に対し、該パッケージ画像の好意度の加重平均値がラベル付けされたデータである。なお、本実施形態では、好意度が最大５（好き）から最小１（嫌い）までの５段階に分類された例を説明したが、これに限るものではない。好意度が実際のニーズに合わせて設けられても良い。 The server 1 acquires multiple combinations of training data in which package images are associated with the favorability levels of the package images. The training data is data in which package images are labeled with the weighted average of the favorability levels of the package images. Note that, in this embodiment, an example has been described in which the favorability levels are classified into five levels, from a maximum of 5 (like) to a minimum of 1 (dislike), but this is not limiting. The favorability levels may be set according to actual needs.

なお、本実施形態では、予め記憶された訓練データを取得した例を説明したが、これに限るものではない。例えばサーバ１は、複数のパッケージ画像において、それぞれのパッケージ画像の好意度の加重平均値をラベリングすることで、好意度を学習するための訓練データを生成しても良い。具体的には、サーバ１は、１つのパッケージ画像に対して複数人による回答の好意度を、最大５から最小１までの５段階で取得する。サーバ１は、取得した複数人による回答の好意度の加重平均値を算出する。サーバ１は、パッケージ画像に対し、複数人の該パッケージ画像の好意度の加重平均値をラベル付けすることにより訓練データを作成する。 In the present embodiment, an example has been described in which pre-stored training data is acquired, but the present invention is not limited to this. For example, the server 1 may generate training data for learning favorability by labeling a weighted average of the favorability of each of a plurality of package images. Specifically, the server 1 acquires the favorability of responses from a plurality of people for one package image on a five-point scale ranging from a maximum of 5 to a minimum of 1. The server 1 calculates the weighted average of the favorability of the acquired responses from a plurality of people. The server 1 creates training data by labeling a package image with the weighted average of the favorability of the package image from a plurality of people.

例えば、パッケージ画像Ａに対して、１０人による好意度の回答が取得された場合、３人が好意度５を回答し、５人が好意度４を回答し、２人が好意度３を回答する。サーバ１は、パッケージ画像Ａに対する好意度の回答に基づいて、好意度の加重平均値を算出する。算出された好意度の加重平均値は「４．１」（(5×3 + 4×5 + 3×2 + 2×0 + 1×0）/10）である。サーバ１は、パッケージ画像Ａに対し、「４．１」となった好意度の加重平均値をラベル付けする。 For example, if 10 people provide responses regarding the level of favorability for package image A, 3 people provide a level of favorability of 5, 5 people provide a level of favorability of 4, and 2 people provide a level of favorability of 3. Server 1 calculates a weighted average level of favorability based on the responses regarding the level of favorability for package image A. The calculated weighted average level of favorability is "4.1" ((5 x 3 + 4 x 5 + 3 x 2 + 2 x 0 + 1 x 0)/10). Server 1 labels package image A with the weighted average level of favorability of "4.1".

サーバ１は、取得した訓練データを用いて学習を行う。具体的には、サーバ１は、訓練データであるパッケージ画像を入力層に入力し、中間層での演算処理を経て、出力層から該パッケージ画像の好意度に関する情報を取得する。好意度に関する情報は、好意度の確からしさを示す確率（尤度）値、または好意度の加重平均値を含む。図５では、好意度の加重平均値の例を説明する。図示のように、出力層から出力される予測結果は、好意度（好意度５から好意度１まで）の加重平均値である。 The server 1 performs learning using the acquired training data. Specifically, the server 1 inputs a package image, which is training data, into the input layer, and acquires information about the favorability of the package image from the output layer after arithmetic processing in the intermediate layer. The information about the favorability includes a probability (likelihood) value indicating the likelihood of the favorability, or a weighted average value of the favorability. Figure 5 explains an example of the weighted average value of the favorability. As shown in the figure, the prediction result output from the output layer is a weighted average value of the favorability (from favorability level 5 to favorability level 1).

サーバ１は、出力層から出力された予測結果を、訓練データにおいてパッケージ画像に対しラベル付けされた情報、すなわち正解値と比較し、出力層からの予測値が正解値に近づくように、中間層での演算処理に用いるパラメータを最適化する。当該パラメータは、例えばニューロン間の重み（結合係数）、各ニューロンで用いられる活性化関数の係数等である。パラメータの最適化の方法は特に限定されないが、例えば、サーバ１は誤差逆伝播法を用いて各種パラメータの最適化を行う。 The server 1 compares the prediction results output from the output layer with the information labeled for the package images in the training data, i.e., the correct answer value, and optimizes the parameters used in the calculation process in the intermediate layer so that the predicted value from the output layer approaches the correct answer value. The parameters in question are, for example, the weights (coupling coefficients) between neurons and the coefficients of the activation functions used in each neuron. There are no particular limitations on the method of optimizing the parameters, but for example, the server 1 optimizes various parameters using the backpropagation method.

サーバ１は、訓練データに含まれる各パッケージ画像について上記の処理を行い、好意度予測モデル１７１を生成する。これにより、例えばサーバ１は当該訓練データを用いて好意度予測モデル１７１の学習を行うことで、パッケージ画像の好意度に関する情報を出力可能なモデルを構築することができる。 The server 1 performs the above process for each package image included in the training data to generate a favorability prediction model 171. As a result, for example, the server 1 can use the training data to learn the favorability prediction model 171, thereby constructing a model that can output information related to the favorability of a package image.

サーバ１は、遺伝的アルゴリズムに基づいて生成した候補となるパッケージ画像を取得した場合、取得したパッケージ画像を好意度予測モデル１７１に入力する。サーバ１は、好意度予測モデル１７１の中間層にてパッケージ画像の特徴量を抽出する演算処理を行う。サーバ１は、抽出した特徴量を好意度予測モデル１７１の出力層に入力して、パッケージ画像の好意度に関する情報を出力として取得する。図示のように、好意度の加重平均値は出力層から出力される。好意度の加重平均値は「３．８４」である。なお、算出された好意度の加重平均値に対して四捨五入、切り上げ、又は切り捨て等により補正されても良い。 When the server 1 acquires a candidate package image generated based on a genetic algorithm, it inputs the acquired package image to the favorability prediction model 171. The server 1 performs a calculation process to extract features of the package image in the middle layer of the favorability prediction model 171. The server 1 inputs the extracted features to the output layer of the favorability prediction model 171 and acquires information related to the favorability of the package image as an output. As shown in the figure, the weighted average value of the favorability is output from the output layer. The weighted average value of the favorability is "3.84". The calculated weighted average value of the favorability may be corrected by rounding up, rounding down, or the like.

なお、上述した例では、好意度の加重平均値を予測結果として出力したが、これに限るものではない。例えば、各々の好意度を示す確率値そのものを予測結果として出力しても良い。例えば、パッケージ画像に対し、「好意度５」、「好意度４」、「好意度３」、「好意度２」、「好意度１」それぞれの確率値が、「０．０５」、「０．８５」、「０．０３」、「０．０３」、「０．０４」である予測結果が出力層から出力される。また、所定閾値を利用して予測結果を出力しても良い。例えばサーバ１は、「好意度４」の確率値（０．８５）が所定閾値（例えば、０．８０）以上であると判定した場合、パッケージ画像の好意度が「好意度４」である予測結果を出力する。なお、上述した閾値を利用せず、好意度予測モデル１７１が予測した各々の好意度の確率値から、最も高い確率値に対応する好意度を予測結果として出力しても良い。 In the above example, the weighted average value of the favorability is output as the prediction result, but this is not limited to this. For example, the probability value itself indicating each favorability may be output as the prediction result. For example, the output layer outputs prediction results in which the probability values of "favorability 5", "favorability 4", "favorability 3", "favorability 2", and "favorability 1" for the package image are "0.05", "0.85", "0.03", "0.03", and "0.04", respectively. In addition, the prediction result may be output using a predetermined threshold. For example, when the server 1 determines that the probability value of "favorability 4" (0.85) is equal to or greater than a predetermined threshold (e.g., 0.80), it outputs a prediction result in which the favorability of the package image is "favorability 4". In addition, the above-mentioned threshold may not be used, and the favorability corresponding to the highest probability value from among the probability values of each favorability predicted by the favorability prediction model 171 may be output as the prediction result.

なお、好意度に関する情報の予測処理は、上述した機械学習により予測処理に限るものではない。例えば、Ａ－ＫＡＺＥ（Accelerated KAZE）、ＳＩＦＴ（Scale Invariant Feature Transform）、ＳＵＲＦ（Speeded-Up Robust Features）、ＯＲＢ（Oriented FAST and Rotated BRIEF）、ＨＯＧ(Histograms of Oriented Gradients)等の局所特徴量抽出方法を用いて好意度に関する情報を予測しても良い。 Note that the prediction process for information related to favorability is not limited to the prediction process using machine learning described above. For example, information related to favorability may be predicted using local feature extraction methods such as A-KAZE (Accelerated KAZE), SIFT (Scale Invariant Feature Transform), SURF (Speed-Up Robust Features), ORB (Oriented FAST and Rotated BRIEF), and HOG (Histograms of Oriented Gradients).

また、性別または年代ごとに好意度予測モデル１７１を生成することができる。具体的には、サーバ１は、性別または年代の設定を受け付け、受け付けた性別または年代に応じて、好意度予測モデル１７１の生成に用いる訓練データを複数取得する。サーバ１は、取得した訓練データを用いて、受け付けた性別または年代に対応する好意度予測モデル１７１を生成する。 Favourability prediction models 171 can also be generated for each gender or age group. Specifically, server 1 accepts a gender or age group setting, and acquires multiple pieces of training data to be used in generating favourability prediction models 171 according to the accepted gender or age group. Server 1 uses the acquired training data to generate a favourability prediction model 171 corresponding to the accepted gender or age group.

サーバ１は、性別または年代ごとに生成した好意度予測モデル１７１を大容量記憶部１７に記憶する。サーバ１は、複数の性別または年代に対して用意された複数の好意度予測モデル１７１を用いて、性別または年代ごとにパッケージ画像の好意度に関する情報を出力することができる。 The server 1 stores the favorability prediction model 171 generated for each gender or age group in the mass storage unit 17. The server 1 can output information related to the favorability of the package image for each gender or age group using multiple favorability prediction models 171 prepared for multiple genders or age groups.

また、各性別または年代に対応して用意された複数の好意度予測モデル１７１から出力された好意度に関する情報に基づき、全体（全性別及び全年代）の好意度に関する情報を出力することができる。全体は、例えば２０代男性、２０代女性、３０代男性、３０代女性、４０代男性、４０代女性、５０代男性及び５０代女性を含む。 In addition, information regarding the overall (all genders and all ages) favorability can be output based on the favorability information output from multiple favorability prediction models 171 prepared for each gender or age. The overall includes, for example, men in their 20s, women in their 20s, men in their 30s, women in their 30s, men in their 40s, women in their 40s, men in their 50s, and women in their 50s.

具体的には、サーバ１は、各性別及び年代に対応して用意された複数の好意度予測モデル１７１それぞれを大容量記憶部１７から取得する。サーバ１は、取得したそれぞれの好意度予測モデル１７１にパッケージ画像を入力し、各性別及び年代に対応するパッケージ画像の好意度の加重平均値を出力する。サーバ１は、出力した各性別及び年代に対応するパッケージ画像の好意度の加重平均値に基づき、好意度の加重平均値の平均値を算出する。サーバ１は、算出した加重平均値の平均値を全体の好意度として出力する。 Specifically, the server 1 acquires from the mass storage unit 17 each of a plurality of favorability prediction models 171 prepared corresponding to each gender and age group. The server 1 inputs a package image into each acquired favorability prediction model 171, and outputs a weighted average value of the favorability of the package image corresponding to each gender and age group. The server 1 calculates an average value of the weighted average values of the favorability based on the output weighted average values of the favorability of the package images corresponding to each gender and age group. The server 1 outputs the calculated average value of the weighted average values as the overall favorability.

図６は、デザイン数及びデザイン要素数の受付画面の一例を示す説明図である。該画面は、デザイン数入力欄１１ａ、デザイン要素数入力欄１１ｂ及び進むボタン１１ｃを含む。デザイン数入力欄１１ａは、デザイン（デザイン案）の数を入力するテキストフィールドである。デザイン要素数入力欄１１ｂは、デザイン要素の数を入力するテキストフィールドである。なお、デザイン数及びデザイン要素数それぞれが最大１０である例示しているが、これに限るものではない。なお、デザイン数及びデザイン要素数に関しては、ユーザの入力に限定せず、予め決められても良い。 Figure 6 is an explanatory diagram showing an example of a reception screen for the number of designs and the number of design elements. The screen includes a number of designs input field 11a, a number of design elements input field 11b, and a forward button 11c. The number of designs input field 11a is a text field for inputting the number of designs (design proposals). The number of design elements input field 11b is a text field for inputting the number of design elements. Note that, although an example is shown in which the number of designs and the number of design elements are each a maximum of 10, this is not limited to this. Note that the number of designs and the number of design elements are not limited to being input by the user, and may be determined in advance.

進むボタン１１ｃは、後述するデザイン要素の登録画面（図７）に遷移するボタンである。端末２は、進むボタン１１ｃのタッチ（クリック）操作を受け付けた場合、デザイン数入力欄１１ａにより入力されたデザイン数、及びデザイン要素数入力欄１１ｂにより入力されたデザイン要素数をサーバ１に送信する。サーバ１は、端末２から送信されたデザイン数及びデザイン要素数に応じて、デザイン要素の登録画面を生成して端末２に送信する。端末２は、サーバ１から送信されたデザイン要素の登録画面を受信し、受信したデザイン要素の登録画面に遷移する。 The forward button 11c is a button that transitions to a design element registration screen (Figure 7) described below. When the terminal 2 receives a touch (click) operation of the forward button 11c, it transmits to the server 1 the number of designs input in the design number input field 11a and the number of design elements input in the design element number input field 11b. The server 1 generates a design element registration screen according to the number of designs and the number of design elements transmitted from the terminal 2 and transmits it to the terminal 2. The terminal 2 receives the design element registration screen transmitted from the server 1 and transitions to the received design element registration screen.

図７は、デザイン要素の登録画面の一例を示す説明図である。該画面は、デザイン要素選択欄１２ａ、デザイン選択欄１２ｂ及び登録ボタン１２ｃを含む。サーバ１は、端末２から送信されたデザイン要素数に応じて、デザイン要素数のデザイン要素を選択するためのデザイン要素選択欄１２ａを生成する。サーバ１は、生成したデザイン要素選択欄１２ａに対応付けて、端末２から送信されたデザイン数に応じて、デザイン数のデザインを選択するためのデザイン選択欄１２ｂを生成する。 Figure 7 is an explanatory diagram showing an example of a design element registration screen. The screen includes a design element selection field 12a, a design selection field 12b, and a registration button 12c. The server 1 generates a design element selection field 12a for selecting the number of design elements according to the number of design elements transmitted from the terminal 2. The server 1 generates a design selection field 12b for selecting the number of designs according to the number of designs transmitted from the terminal 2, in association with the generated design element selection field 12a.

デザイン要素選択欄１２ａは、要素名入力欄１２ａ１及び要素種類選択欄１２ａ２を含む。要素名入力欄１２ａ１は、デザイン要素の名称を入力するテキストフィールドである。なお、要素名入力欄１２ａ１に関しては、テキストフィールドに限定せず、予め設けられたデザイン要素の名称に基づいて設定された選択可能なコンボボックスであっても良い。要素種類選択欄１２ａ２は、デザイン要素の種類の選択を受け付けるコンボボックスである。デザイン要素の種類は、テキスト及び非テキスト（テキストではない）を含む。 The design element selection field 12a includes an element name input field 12a1 and an element type selection field 12a2. The element name input field 12a1 is a text field for inputting the name of the design element. Note that the element name input field 12a1 is not limited to a text field, but may be a selectable combo box set based on the names of pre-defined design elements. The element type selection field 12a2 is a combo box that accepts the selection of the type of design element. The types of design elements include text and non-text (not text).

デザイン選択欄１２ｂは、デザイン名入力欄１２ｂ１、要素画像表示欄１２ｂ２、要素画像選択ボタン１２ｂ３、要素画像削除ボタン１２ｂ４及び要素画像名入力欄１２ｂ５を含む。デザイン名入力欄１２ｂ１は、パッケージデザインの名称を入力するテキストフィールドである。要素画像表示欄１２ｂ２は、デザイン要素の要素画像を表示する表示欄である。要素画像選択ボタン１２ｂ３は、要素画像を選択するボタンである。要素画像削除ボタン１２ｂ４は、要素画像を削除するボタンである。要素画像名入力欄１２ｂ５は、要素画像の名称を入力するテキストフィールドである。 The design selection field 12b includes a design name input field 12b1, an element image display field 12b2, an element image selection button 12b3, an element image deletion button 12b4, and an element image name input field 12b5. The design name input field 12b1 is a text field for inputting the name of the package design. The element image display field 12b2 is a display field for displaying the element images of the design elements. The element image selection button 12b3 is a button for selecting an element image. The element image deletion button 12b4 is a button for deleting an element image. The element image name input field 12b5 is a text field for inputting the name of the element image.

例えば、図６で受け付けられたデザイン要素数が４である場合、サーバ１は、デザイン要素選択欄１２ａに含まれる要素名入力欄１２ａ１及び要素種類選択欄１２ａ２それぞれを４つ生成する。図６で受け付けられたデザイン数が５である場合、サーバ１は、それぞれの要素名入力欄１２ａ１及び要素種類選択欄１２ａ２に対応付けて、デザイン選択欄１２ｂに含まれるデザイン名入力欄１２ｂ１、要素画像表示欄１２ｂ２、要素画像選択ボタン１２ｂ３、要素画像削除ボタン１２ｂ４及び要素画像名入力欄１２ｂ５それぞれを２０個（４×５）生成する。 For example, if the number of design elements accepted in FIG. 6 is four, the server 1 generates four element name input fields 12a1 and four element type selection fields 12a2 included in the design element selection field 12a. If the number of designs accepted in FIG. 6 is five, the server 1 generates 20 (4×5) design name input fields 12b1, element image display fields 12b2, element image selection buttons 12b3, element image deletion buttons 12b4, and element image name input fields 12b5 included in the design selection field 12b, corresponding to the respective element name input fields 12a1 and element type selection fields 12a2.

登録ボタン１２ｃは、デザイン要素選択欄１２ａ及びデザイン選択欄１２ｂにより設定された複数のデザイン要素に対するデザインをサーバ１に登録するボタンである。 The registration button 12c is a button that registers designs for multiple design elements set in the design element selection field 12a and the design selection field 12b on the server 1.

端末２は、要素名入力欄１２ａ１の入力操作を受け付けた場合、デザイン要素の名称の入力を受け付ける。端末２は、要素種類選択欄１２ａ２の選択操作を受け付けた場合、要素名入力欄１２ａ１により入力されたデザイン要素の名称に対応するデザイン要素の種類の選択を受け付ける。 When terminal 2 receives an input operation in element name input field 12a1, it receives the input of the name of the design element. When terminal 2 receives a selection operation in element type selection field 12a2, it receives the selection of the type of design element that corresponds to the name of the design element input in element name input field 12a1.

なお、本実施形態では、遺伝的アルゴリズムによるパッケージ画像の生成処理にて、複数のデザイン要素を組み替える際に、種類が「テキスト」であるデザイン要素が重複しないようにルール化される。例えば、「５００ｍｌ」デザイン要素が「テキスト」種類であり、且つ、デザインＡ及びデザインＢの両者どちらともに「５００ｍｌ」が含まれた場合、デザインＡの「５００ｍｌ」デザイン要素とデザインＢの「５００ｍｌ」デザイン要素とを重ねていると、デザインとして成立していないため、該当するケース（状況）は対象外となる。 In this embodiment, when multiple design elements are rearranged in the process of generating a package image using a genetic algorithm, a rule is established so that design elements of the type "text" do not overlap. For example, if the "500ml" design element is of the "text" type and both design A and design B contain "500ml," overlapping the "500ml" design element of design A and the "500ml" design element of design B does not constitute a valid design, and therefore the corresponding case (situation) is excluded.

端末２は、デザイン名入力欄１２ｂ１の入力操作を受け付けた場合、パッケージデザインの名称の入力を受け付ける。端末２は、要素画像選択ボタン１２ｂ３のタッチ操作を受け付けた場合、記憶部２２に記憶されている要素画像の選択を受け付ける。端末２は、受け付けた要素画像を要素画像表示欄１２ｂ２に表示する。なお、端末２は、撮影部２６を介して撮影した要素画像を取得しても良い。 When the terminal 2 receives an input operation in the design name input field 12b1, it receives the input of the name of the package design. When the terminal 2 receives a touch operation on the element image selection button 12b3, it receives the selection of an element image stored in the memory unit 22. The terminal 2 displays the received element image in the element image display field 12b2. The terminal 2 may also obtain an element image photographed via the photographing unit 26.

端末２は、要素画像名入力欄１２ｂ５の入力操作を受け付けた場合、要素画像の名称の入力を受け付ける。端末２は、要素画像削除ボタン１２ｂ４のタッチ操作を受け付けた場合、要素画像表示欄１２ｂ２に表示されている要素画像を削除し、要素画像名入力欄１２ｂ５に入力された要素画像の名称を削除する。 When the terminal 2 receives an input operation in the element image name input field 12b5, it receives the input of the name of the element image. When the terminal 2 receives a touch operation on the element image deletion button 12b4, it deletes the element image displayed in the element image display field 12b2 and deletes the name of the element image input in the element image name input field 12b5.

端末２は、登録ボタン１２ｃのタッチ操作を受け付けた場合、要素名入力欄１２ａ１により入力されたデザイン要素の名称、要素種類選択欄１２ａ２により選択されたデザイン要素の種類、デザイン名入力欄１２ｂ１により入力されたパッケージデザインの名称、要素画像選択ボタン１２ｂ３により選択された要素画像、及び要素画像名入力欄１２ｂ５により入力された要素画像の名称をサーバ１に送信する。 When the terminal 2 receives a touch operation of the registration button 12c, it transmits to the server 1 the name of the design element input in the element name input field 12a1, the type of design element selected in the element type selection field 12a2, the name of the package design input in the design name input field 12b1, the element image selected in the element image selection button 12b3, and the name of the element image input in the element image name input field 12b5.

図８は、提案すべきパッケージ画像の表示画面の一例を示す説明図である。なお、図８では、遺伝的アルゴリズムにより生成された第１世代のパッケージ画像を表示する例を説明するが、ほかのパッケージ画像にも同様に適用することができる。該画面は、画像ＩＤ表示欄１３ａ、画像表示欄１３ｂ、好意度表示欄１３ｃ、画像保存ボタン１３ｄ、次世代画像生成ボタン１３ｅ及び切り替えボタン１３ｆを含む。 Figure 8 is an explanatory diagram showing an example of a display screen for a package image to be proposed. Note that while Figure 8 shows an example of displaying a first generation package image generated by a genetic algorithm, this can be similarly applied to other package images. The screen includes an image ID display field 13a, an image display field 13b, a favorability display field 13c, an image save button 13d, a next generation image generation button 13e, and a switch button 13f.

画像ＩＤ表示欄１３ａは、パッケージ画像のＩＤを表示する表示欄である。画像表示欄１３ｂは、パッケージ画像を表示する表示欄である。好意度表示欄１３ｃは、パッケージ画像の好意度に関する情報を表示する表示欄である。即ち、それぞれのパッケージ画像に対し、パッケージ画像のＩＤ、該パッケージ画像及び該パッケージ画像の好意度が同時表示される。 The image ID display field 13a is a display field that displays the ID of the package image. The image display field 13b is a display field that displays the package image. The favorability display field 13c is a display field that displays information related to the favorability of the package image. That is, for each package image, the package image ID, the package image, and the favorability of the package image are simultaneously displayed.

画像保存ボタン１３ｄは、パッケージ画像を保存するボタンである。次世代画像生成ボタン１３ｅは、次世代のパッケージ画像を生成するボタンである。なお、図８においては、第２世代のパッケージ画像を生成するボタンを例示しているが、例えば第２世代のパッケージ画像に基づいて第３世代のパッケージ画像を生成した場合、次世代画像生成ボタン１３ｅが第３世代のパッケージ画像を生成するボタンとなる。切り替えボタン１３ｆは、過去に生成された各世代のパッケージ画像を切り替えて表示するボタンである。 The image save button 13d is a button for saving a package image. The next generation image generation button 13e is a button for generating a next generation package image. Note that in FIG. 8, a button for generating a second generation package image is shown as an example, but if a third generation package image is generated based on the second generation package image, for example, the next generation image generation button 13e becomes a button for generating the third generation package image. The switch button 13f is a button for switching between displaying package images of each generation generated in the past.

サーバ１は、図７で端末２を経由して受け付けた複数のデザイン要素を、遺伝的アルゴリズムに基づく交叉処理及び突然変異処理を行い、候補となるパッケージ画像を複数生成する。例えば、受け付けられたデザイン要素数が４であり、且つ、デザイン数が５である場合、サーバ１は、デザイン要素数とデザイン数との組み合わせに基づいて、５^{（４×２）}枚のパッケージ画像を生成する。サーバ１は、生成した複数のパッケージ画像からランダムに所定数（例えば、５０）のパッケージ画像を抽出する。サーバ１は、抽出したパッケージ画像を候補となるパッケージ画像として出力する。サーバ１は、出力した複数の候補となるパッケージ画像を好意度予測モデル１７１に入力して、各パッケージ画像の好意度に関する情報を出力する。 The server 1 performs crossover and mutation processing based on a genetic algorithm on the multiple design elements received via the terminal 2 in Fig. 7 to generate multiple candidate package images. For example, if the number of received design elements is 4 and the number of designs is 5, the server 1 generates 5 ^{(4 x 2)} package images based on a combination of the number of design elements and the number of designs. The server 1 randomly extracts a predetermined number (e.g., 50) of package images from the multiple package images generated. The server 1 outputs the extracted package images as candidate package images. The server 1 inputs the multiple candidate package images output to the favorability prediction model 171 and outputs information regarding the favorability of each package image.

サーバ１は、出力した好意度に関する情報に基づいて、各パッケージ画像を評価する。例えば、好意度の加重平均値が所定閾値以上であるパッケージ画像を最も好意度の高い個体として評価しても良い。サーバ１は、好意度が高いと判定したパッケージ画像を、第１世代のパッケージ画像として抽出する。サーバ１は、抽出した複数のパッケージ画像、並びに、各パッケージ画像のＩＤ及び好意度に関する情報を端末２に送信する。 The server 1 evaluates each package image based on the output information on the favorability. For example, a package image whose weighted average favorability is equal to or greater than a predetermined threshold may be evaluated as the most favorable individual. The server 1 extracts package images that are determined to have a high favorability as first-generation package images. The server 1 transmits the extracted package images, as well as information on the ID and favorability of each package image, to the terminal 2.

端末２は、サーバ１から送信された複数のパッケージ画像、各パッケージ画像のＩＤ及び好意度に関する情報を受信して画面に表示する。具体的には、端末２は、各パッケージ画像のＩＤを該当する画像ＩＤ表示欄１３ａに表示し、各パッケージ画像を該当する画像表示欄１３ｂに表示し、各パッケージ画像の好意度を該当する好意度表示欄１３ｃに表示する。 The terminal 2 receives the multiple package images, the ID of each package image, and information regarding the favorability level sent from the server 1, and displays them on the screen. Specifically, the terminal 2 displays the ID of each package image in the corresponding image ID display field 13a, displays each package image in the corresponding image display field 13b, and displays the favorability level of each package image in the corresponding favorability display field 13c.

端末２は、画像保存ボタン１３ｄのタッチ操作を受け付けた場合、複数のパッケージ画像を、指定された画像フォーマットで記憶部２２または外部記憶装置に記憶する。画像フォーマットは、ｂｍｐ、ｔｉｆｆ、ｊｐｇ、ｊｐｅｇ、ｇｉｆ、またはｐｎｇフォーマットのような、任意の適切な画像フォーマットであって良い。 When the terminal 2 receives a touch operation of the image save button 13d, the terminal 2 stores the multiple package images in the storage unit 22 or an external storage device in the specified image format. The image format may be any appropriate image format, such as bmp, tiff, jpg, jpeg, gif, or png format.

端末２は、次世代画像生成ボタン１３ｅのタッチ操作を受け付けた場合、第１世代のパッケージ画像に基づく第２世代のパッケージ画像の生成要求をサーバ１に送信する。端末２は、切り替えボタン１３ｆのタッチ操作を受け付けた場合、生成された各世代のパッケージ画像を切り替えて表示する。例えば端末２は、逐次的な循環順序（第１世代のパッケージ画像、第２世代のパッケージ画像、・・・・・・、第Ｎ世代のパッケージ画像、第１世代のパッケージ画像）で、各世代のパッケージ画像を切り替えて表示しても良い。 When terminal 2 receives a touch operation of next generation image generation button 13e, it sends to server 1 a request to generate a second generation package image based on the first generation package image. When terminal 2 receives a touch operation of switch button 13f, it switches between and displays the generated package images of each generation. For example, terminal 2 may switch between and display the package images of each generation in a sequential cyclical order (first generation package image, second generation package image, ..., Nth generation package image, first generation package image).

図９及び図１０は、提案すべきパッケージ画像を出力する際の処理手順を示すフローチャートである。端末２の制御部２１は、ユーザによるデザイン数及びデザイン要素数の入力を入力部２４により受け付ける（ステップＳ２０１）。制御部２１は、受け付けたデザイン数及びデザイン要素数を通信部２３によりサーバ１に送信する（ステップＳ２０２）。 Figures 9 and 10 are flowcharts showing the processing steps when outputting package images to be proposed. The control unit 21 of the terminal 2 accepts input of the number of designs and the number of design elements by the user through the input unit 24 (step S201). The control unit 21 transmits the accepted number of designs and the number of design elements to the server 1 through the communication unit 23 (step S202).

サーバ１の制御部１１は、端末２から送信されたデザイン数及びデザイン要素数を通信部１３により受信する（ステップＳ１０１）。制御部１１は、受信したザイン数及びデザイン要素数に応じて、デザイン要素数のデザイン要素を選択するためのデザイン要素選択欄、及び、各デザイン要素選択欄に対応付けてデザイン数のデザインを選択するためのデザイン選択欄を生成する（ステップＳ１０２）。制御部１１は、生成したデザイン要素選択欄及びデザイン選択欄を通信部１３により端末２に送信する（ステップＳ１０３）。 The control unit 11 of the server 1 receives the number of designs and the number of design elements transmitted from the terminal 2 via the communication unit 13 (step S101). The control unit 11 generates design element selection fields for selecting design elements of the number of design elements and design selection fields for selecting designs of the number of designs in correspondence with each design element selection field according to the received number of designs and number of design elements (step S102). The control unit 11 transmits the generated design element selection fields and design selection fields to the terminal 2 via the communication unit 13 (step S103).

端末２の制御部２１は、サーバ１から送信されたデザイン要素選択欄及びデザイン選択欄を通信部２３により受信する（ステップＳ２０３）。制御部２１は、受信したデザイン要素選択欄及びデザイン選択欄を表示部２５により表示する（ステップＳ２０４）。制御部２１は、入力部２４を介して、デザイン要素数のデザイン要素（要素名称及び要素種類等）と、各デザイン要素に対応付けてデザイン数のデザイン（デザイン名称及び要素画像等）との入力を受け付ける（ステップＳ２０５）。 The control unit 21 of the terminal 2 receives the design element selection field and the design selection field sent from the server 1 via the communication unit 23 (step S203). The control unit 21 displays the received design element selection field and the design selection field on the display unit 25 (step S204). The control unit 21 accepts input of the number of design elements (element names and element types, etc.) and the number of designs (design names and element images, etc.) associated with each design element via the input unit 24 (step S205).

制御部２１は、入力部２４を介して、パッケージ画像を評価するための評価条件の設定を受け付ける（ステップＳ２０６）。評価条件は、全体（全性別及び全年代）による評価、性別による評価、年代による評価、性別及び年代（例えば、女性２０代）による評価などの評価条件を含む。なお、全体、性別、年代、性別及び年代のいずれか１つまたは２つ以上を組み合わせたものが評価条件として設定されても良い。 The control unit 21 accepts the setting of evaluation conditions for evaluating the package image via the input unit 24 (step S206). The evaluation conditions include evaluation by overall (all genders and all ages), evaluation by gender, evaluation by age, evaluation by gender and age (for example, women in their 20s), etc. Note that the evaluation conditions may be set to be one or a combination of two or more of overall, gender, age, or gender and age.

制御部２１は、受け付けたデザイン要素に対するデザイン、及び評価条件を通信部２３によりサーバ１に送信する（ステップＳ２０７）。サーバ１の制御部１１は、端末２から送信されたデザイン要素に対するデザイン、及び評価条件を通信部１３により受信する（ステップＳ１０４）。制御部１１は、受信した複数のデザイン要素を、遺伝的アルゴリズムに基づく交叉処理及び突然変異処理を行い、次世代の候補となるパッケージ画像を複数生成する（ステップＳ１０５）。 The control unit 21 transmits the design and evaluation conditions for the accepted design elements to the server 1 via the communication unit 23 (step S207). The control unit 11 of the server 1 receives the design and evaluation conditions for the design elements transmitted from the terminal 2 via the communication unit 13 (step S104). The control unit 11 performs crossover and mutation processes based on a genetic algorithm on the received multiple design elements to generate multiple package images that are candidates for the next generation (step S105).

制御部１１は、受信した評価条件に応じて学習済みの好意度予測モデル１７１を特定する（ステップＳ１０６）。制御部１１は、生成した複数のパッケージ画像を、特定した学習済みの好意度予測モデル１７１に入力して、設定された評価条件に応じて、各パッケージ画像の好意度に関する情報を出力（取得）する（ステップＳ１０７）。 The control unit 11 identifies a trained favorability prediction model 171 according to the received evaluation conditions (step S106). The control unit 11 inputs the generated package images into the identified trained favorability prediction model 171, and outputs (obtains) information regarding the favorability of each package image according to the set evaluation conditions (step S107).

制御部１１は、取得した好意度に関する情報に基づいて、複数のパッケージ画像から提案すべきパッケージ画像を抽出する（ステップＳ１０８）。具体的には、制御部１１は、取得した好意度に関する情報に基づいて、各パッケージ画像を評価する。例えば、好意度の加重平均値の降順に上位からＮ番目（例えば、２０）までのパッケージ画像を最も好意度の高い個体として評価しても良い。制御部１１は、好意度が高いと判定したパッケージ画像を、提案すべきパッケージ画像として抽出する。なお、初回抽出された提案すべきパッケージ画像は、第１世代のパッケージ画像とされる。 The control unit 11 extracts a package image to be proposed from the multiple package images based on the obtained information on the favorability (step S108). Specifically, the control unit 11 evaluates each package image based on the obtained information on the favorability. For example, the top N package images (e.g., 20) may be evaluated in descending order of the weighted average favorability as the most favorable individuals. The control unit 11 extracts a package image that is determined to have a high favorability as a package image to be proposed. Note that the package image to be proposed that is initially extracted is considered to be a first-generation package image.

制御部１１は、抽出した提案すべきパッケージ画像、該パッケージ画像のＩＤ及び好意度に関する情報を通信部１３により端末２に送信する（ステップＳ１０９）。端末２の制御部２１は、サーバ１から送信された提案すべきパッケージ画像、該パッケージ画像のＩＤ及び好意度に関する情報を通信部２３により受信する（ステップＳ２０８）。制御部２１は、受信した提案すべきパッケージ画像、該パッケージ画像のＩＤ及び好意度に関する情報を表示部２５により表示する（ステップＳ２０９）。 The control unit 11 transmits the extracted package image to be proposed, the package image's ID, and information regarding the favorability level to the terminal 2 via the communication unit 13 (step S109). The control unit 21 of the terminal 2 receives the package image to be proposed, the package image's ID, and information regarding the favorability level transmitted from the server 1 via the communication unit 23 (step S208). The control unit 21 displays the received package image to be proposed, the package image's ID, and information regarding the favorability level on the display unit 25 (step S209).

制御部２１は、今回生成された世代のパッケージ画像に基づく次世代のパッケージ画像の生成要求を入力部２４により受け付けた場合（ステップＳ２１０）、今回生成された世代のパッケージ画像から複数のデザイン要素を抽出する（ステップＳ２１１）。そして、制御部２１はステップＳ２０７の処理に戻り、抽出した複数のデザイン要素に対するデザイン、及び評価条件を通信部２３によりサーバ１に送信する。 When the control unit 21 receives a request to generate a next-generation package image based on the package image of the currently generated generation through the input unit 24 (step S210), the control unit 21 extracts multiple design elements from the package image of the currently generated generation (step S211). Then, the control unit 21 returns to the process of step S207, and transmits the designs and evaluation conditions for the extracted multiple design elements to the server 1 through the communication unit 23.

本実施形態によると、遺伝的アルゴリズムを用いたデザイン要素の組合せ最適化により、ユーザが満足する最終的な次世代のパッケージ画像を生成することが可能となる。 According to this embodiment, by optimizing the combination of design elements using a genetic algorithm, it is possible to generate the final next-generation package image that satisfies users.

本実施形態によると、遺伝的アルゴリズムを用いて生成された候補となるパッケージ画像を好意度予測モデル１７１に入力して、該パッケージ画像の好意度に関する情報を出力することが可能となる。 According to this embodiment, candidate package images generated using a genetic algorithm can be input to the favorability prediction model 171, and information regarding the favorability of the package image can be output.

本実施形態によると、遺伝的アルゴリズムと好意度予測モデル１７１との両方を併用することにより、遺伝的アルゴリズムに基づいて生成された複数の候補となるパッケージ画像から、好意度が高いパッケージ画像を抽出して提案すべきパッケージ画像として出力することが可能となる。 According to this embodiment, by using both a genetic algorithm and a favorability prediction model 171 in combination, it is possible to extract a package image with a high favorability rating from multiple candidate package images generated based on the genetic algorithm and output it as the package image to be proposed.

＜変形例１＞
商品のカテゴリ毎に設定されたスコアを用いて学習させた好意度予測モデル１７１に、遺伝的アルゴリズムに基づいて生成された複数の候補となるパッケージ画像を入力して、各パッケージ画像の好意度に関する情報を出力する処理を説明する。 <Modification 1>
We will explain the process of inputting multiple candidate package images generated based on a genetic algorithm into a favorability prediction model 171 that has been trained using scores set for each product category, and outputting information regarding the favorability of each package image.

商品のカテゴリは、例えばビール、チューハイ・カクテル、炭酸飲料、お茶・紅茶・コーヒー、栄養ドリンク、調味料、カップ麺・袋麺、スープ類、デザート類、常温食品または冷凍食品等を含む。その他、洗剤、仕上げ剤、水筒または化粧品等が商品のカテゴリに含まれても良い。 Product categories include, for example, beer, chuhai/cocktails, carbonated drinks, tea/black tea/coffee, energy drinks, seasonings, instant noodles/bag noodles, soups, desserts, room temperature foods or frozen foods, etc. Other product categories may also include detergents, finishing agents, water bottles, cosmetics, etc.

まず、スコアの作成処理を説明する。サーバ１は、商品のカテゴリ毎に、商品を検索するためのブランド名及び企業名の入力を受け付ける。ブランド名及び企業名は、例えば「ブランドＡ株式会社ＡＡＡ」等である。なお、「ブランド＋企業名」に限定せず、例えば「商品名＋企業名」または「ブランド名＋カテゴリ名」等であっても良い。サーバ１は検索エンジン経由で、受け付けたブランド名及び企業名に対する検索ヒット数を取得する。検索エンジンは、例えばＧＯＯＧＬＥ（登録商標）、またはＹＡＨＯＯ（登録商標）等であっても良い。 First, the process of creating the score will be explained. The server 1 accepts input of the brand name and company name for searching for products for each product category. The brand name and company name are, for example, "Brand A AAA Co., Ltd." Note that the brand name and company name are not limited to "brand + company name" and may be, for example, "product name + company name" or "brand name + category name." The server 1 obtains the number of search hits for the accepted brand name and company name via a search engine. The search engine may be, for example, GOOGLE (registered trademark) or YAHOO (registered trademark), etc.

サーバ１は、取得したブランド名及び企業名に対する検索ヒット数の対数を算出し、算出した検索ヒット数の対数をスコアとして出力する。サーバ１は、商品のカテゴリ内の複数のブランドにおけるスコアの平均値を算出する。サーバ１は、算出したスコアの平均値を商品のカテゴリに対応付けて記憶部１２または大容量記憶部１７に記憶する。 The server 1 calculates the logarithm of the number of search hits for the acquired brand name and company name, and outputs the calculated logarithm of the number of search hits as a score. The server 1 calculates the average score for multiple brands within the product category. The server 1 stores the calculated average score in the memory unit 12 or the mass memory unit 17 in association with the product category.

次に、スコアを用いて好意度予測モデル１７１を学習させる処理を説明する。
図１１は、スコアを用いて学習させる好意度予測モデル１７１を説明する説明図である。なお、図５と重複する内容については説明を省略する。変形例１の好意度予測モデル１７１は、ＲｅｓＮｅｔ５０のネットワーク構造が適用される。サーバ１は、パッケージ画像及び該パッケージ画像に対応するスコアと、該パッケージ画像の好意度とが対応付けられた訓練データの組み合わせを複数取得する。 Next, a process of training the favorability prediction model 171 using the scores will be described.
Fig. 11 is an explanatory diagram for explaining a favorability prediction model 171 that is trained using scores. Note that explanations of contents that overlap with Fig. 5 will be omitted. The favorability prediction model 171 of Modification 1 applies the network structure of ResNet 50. The server 1 acquires a plurality of combinations of training data in which package images, scores corresponding to the package images, and favorability of the package images are associated with each other.

サーバ１は、取得した訓練データ用いて学習を行う。具体的には、サーバ１は、訓練データであるパッケージ画像をＲｅｓＮｅｔ５０の全結合層（ＦＣ）１４ｃを除いたネットワーク１４ａに入力する。ネットワーク１４ａは、パッケージ画像の特徴量を抽出し、抽出したパッケージ画像の特徴量を、バックプロパゲーションによりパラメータが学習された全結合層１４ｃに受け渡す。サーバ１は、該パッケージ画像に対応するスコアを全結合層１４ｂに入力する。全結合層１４ｃは、パッケージ画像の特徴量と、全結合層１４ｂに入力されたスコアとを結合し、結合した情報に基づいてパッケージ画像の好意度に関する情報を予測する。予測結果は、ニューロンを有する出力層に出力される。 The server 1 performs learning using the acquired training data. Specifically, the server 1 inputs the package image, which is the training data, to the network 14a of the ResNet 50, excluding the fully connected layer (FC) 14c. The network 14a extracts the features of the package image, and passes the extracted features of the package image to the fully connected layer 14c, whose parameters have been learned by backpropagation. The server 1 inputs the score corresponding to the package image to the fully connected layer 14b. The fully connected layer 14c combines the features of the package image with the score input to the fully connected layer 14b, and predicts information related to the favorability of the package image based on the combined information. The prediction result is output to an output layer having neurons.

サーバ１は、出力層から出力された予測結果を、訓練データにおいてパッケージ画像及びスコアに対しラベル付けされた情報、すなわち正解値と比較し、出力層からの予測値が正解値に近づくように、中間層での演算処理に用いるパラメータを最適化する。これにより、サーバ１は当該訓練データを用いて好意度予測モデル１７１の学習を行うことで、学習させた好意度予測モデル１７１を大容量記憶部１７に更新する。 The server 1 compares the prediction results output from the output layer with the information labeled for the package image and score in the training data, i.e., the correct answer value, and optimizes the parameters used in the calculation process in the intermediate layer so that the predicted value from the output layer approaches the correct answer value. In this way, the server 1 uses the training data to train the favorability prediction model 171, and updates the trained favorability prediction model 171 in the large-capacity storage unit 17.

サーバ１は、パッケージ画像及び該パッケージ画像に対応するスコアを取得した場合、取得したパッケージ画像及びスコアを好意度予測モデル１７１に入力して、該パッケージ画像の好意度に関する情報を出力する。 When the server 1 acquires a package image and a score corresponding to the package image, the server 1 inputs the acquired package image and score into the favorability prediction model 171 and outputs information regarding the favorability of the package image.

なお、上述した処理に限るものではない。商品のカテゴリと、商品のカテゴリ毎に設定されたスコアとを用いて、好意度予測モデル１７１を学習させても良い。この場合、サーバ１は、パッケージ画像をネットワーク１４ａに入力する。ネットワーク１４ａは、パッケージ画像の特徴量を抽出し、抽出したパッケージ画像の特徴量を全結合層１４ｃに受け渡す。サーバ１は、商品のカテゴリを示すラベルと、カテゴリ毎に設定されたスコアとを結合して１つのベクトルにしたベクトル情報を全結合層１４ｂに入力する。カテゴリを示すラベルは、例えばビール、チューハイ・カクテル、その他アルコール、炭酸飲料、お茶・紅茶・コーヒー、インスタント飲料、栄養ドリンク、その他飲料、調味料、カップ麺・袋麺、スープ類、デザート類、常温食品、冷凍食品、冷蔵食品、チョコ・飴・グミ、アイス、スナック菓子、その他菓子、化粧品、シャンプー、医薬品、リップクリーム、衛生用品、ペットフード及び鍋つゆを有する２６種類のラベルであっても良い。なお、上述した２６種類のラベルは一例であり、これに限るものではない。 Note that the process is not limited to the above. The favorability prediction model 171 may be trained using the product category and the score set for each product category. In this case, the server 1 inputs the package image to the network 14a. The network 14a extracts the feature amount of the package image and passes the extracted feature amount of the package image to the fully connected layer 14c. The server 1 inputs vector information obtained by combining the label indicating the product category and the score set for each category into one vector to the fully connected layer 14b. The label indicating the category may be, for example, 26 types of labels including beer, chuhai/cocktail, other alcohol, carbonated drinks, tea/black tea/coffee, instant drinks, nutritional drinks, other drinks, seasonings, cup noodles/bag noodles, soups, desserts, room temperature foods, frozen foods, refrigerated foods, chocolate/candy/gummy, ice cream, snacks, other sweets, cosmetics, shampoo, medicine, lip balm, sanitary products, pet food, and hot pot soup. Note that the 26 types of labels described above are only an example and are not limited to this.

全結合層１４ｃは、パッケージ画像の特徴量と、全結合層１４ｂに入力されたベクトル情報とに基づいてパッケージ画像の好意度に関する情報を予測する。予測結果は、ニューロンを有する出力層に出力される。 The fully connected layer 14c predicts information about the favorability of the package image based on the features of the package image and the vector information input to the fully connected layer 14b. The prediction result is output to an output layer having neurons.

そして、スコアを用いて学習させた好意度予測モデル１７１を利用して提案すべきパッケージ画像を出力する処理を説明する。サーバ１は、商品のカテゴリ毎に設定されたスコア、及び、遺伝的アルゴリズムに基づいて生成した複数の候補となるパッケージ画像を取得する。サーバ１は、取得したスコアと複数のパッケージ画像とを好意度予測モデル１７１に入力して、各パッケージ画像の好意度に関する情報を出力する。 Then, a process of outputting package images to be proposed using the favorability prediction model 171 trained using the scores will be described. The server 1 acquires the scores set for each product category and multiple candidate package images generated based on a genetic algorithm. The server 1 inputs the acquired scores and multiple package images into the favorability prediction model 171, and outputs information related to the favorability of each package image.

サーバ１は、出力した好意度に関する情報に基づいて、複数の候補となるパッケージ画像から提案すべきパッケージ画像を抽出する。サーバ１は、抽出した提案すべきパッケージ画像を端末２に送信する。端末２は、サーバ１から送信された提案すべきパッケージ画像を受信して表示する。 The server 1 extracts a package image to be proposed from a plurality of candidate package images based on the output information on the degree of favorability. The server 1 transmits the extracted package image to be proposed to the terminal 2. The terminal 2 receives and displays the package image to be proposed transmitted from the server 1.

本変形例１によると、商品のカテゴリ毎に設定されたスコアを用いて学習させた好意度予測モデル１７１を用いて、遺伝的アルゴリズムに基づいて生成された候補となるパッケージ画像の好意度に関する情報を出力することが可能となる。 According to this variant example 1, it is possible to output information regarding the favorability of candidate package images generated based on a genetic algorithm using a favorability prediction model 171 trained using scores set for each product category.

本変形例１によると、スコアを用いて学習させた好意度予測モデル１７１を利用することにより、好意度に関する情報の予測精度を向上するため、適当なパッケージ画像を提案することが可能となる。 According to this variant example 1, by utilizing the favorability prediction model 171 trained using the scores, it is possible to improve the prediction accuracy of information related to favorability, and thus to suggest appropriate package images.

（実施形態２）
実施形態２は、複数のスタイル画像を用いて候補となるパッケージ画像を複数生成する形態に関する。なお、実施形態１と重複する内容については説明を省略する。スタイル画像は、元画像となる基本パッケージ画像と合わせて、特定のスタイル（主題またはテーマ等）に関するパッケージデザインを実現するために使用される画像である。スタイルは、例えば風景、国、文化、季節、海、山、島または街等を含む。 (Embodiment 2)
The second embodiment relates to a form in which a plurality of candidate package images are generated using a plurality of style images. Note that a description of the contents overlapping with the first embodiment will be omitted. A style image is an image used to realize a package design related to a specific style (subject or theme, etc.) in combination with a basic package image that is an original image. Styles include, for example, landscapes, countries, cultures, seasons, oceans, mountains, islands, towns, etc.

図１２は、実施形態２のサーバ１の構成例を示すブロック図である。なお、図２と重複する内容については説明を省略する。大容量記憶部１７には、生成モデル１７２が記憶されている。生成モデル１７２は、ＳｔｙｌｅＧＡＮ（A Style-Based Generator Architecture for Generative Adversarial Networks）の手法を用いて基本パッケージ画像及び複数のスタイル画像を学習することにより、候補となるパッケージ画像を複数生成する生成器であり、機械学習により生成された学習済みモデルである。 Fig. 12 is a block diagram showing an example of the configuration of the server 1 according to the second embodiment. Note that a description of the contents overlapping with Fig. 2 will be omitted. A generative model 172 is stored in the mass storage unit 17. The generative model 172 is a generator that generates multiple candidate package images by learning a basic package image and multiple style images using a StyleGAN (A Style-Based Generator Architecture for Generative Adversarial Networks) method, and is a trained model generated by machine learning.

図１３は、ＳｔｙｌｅＧＡＮの構成を概念的に示す構成図である。ＳｔｙｌｅＧＡＮは、敵対的生成ネットワークＧＡＮ（Generative Adversarial Network）の一種である。ＳｔｙｌｅＧＡＮにより構築された生成器は、マッピングネットワーク（Mapping network）ｆ（以下、「ネットワークｆ」と称する）と合成ネットワーク（Synthesis network）ｇ（以下、「ネットワークｇ」と称する）との２つで構成される。ネットワークｆは、８層の全連結層（ＦＣ）ｆ１からなるニューラルネットから成り、潜在変数（ｚ∈Ｚ）を潜在空間（ｗ∈Ｗ）に非線形変換する（マッピング）。 Figure 13 is a diagram conceptually illustrating the structure of StyleGAN. StyleGAN is a type of generative adversarial network (GAN). The generator constructed by StyleGAN is composed of two networks: a mapping network f (hereinafter referred to as "network f") and a synthesis network g (hereinafter referred to as "network g"). Network f is composed of a neural network consisting of an 8-layer fully connected layer (FC) f1, and performs nonlinear conversion (mapping) of latent variables (z∈Z) to a latent space (w∈W).

ネットワークｇは、例えば４×４サイズの定数画像から開始し、アップサンプリング処理を繰り返して、例えば最終的な出力となる１０２４×１０２４サイズの画像を生成するネットワークである。ネットワークｇは、１８層から成り、ネットワークｆから出力されたベクトルｗ∈Ｗがネットワークｇの中で１８回参照される。 Network g is a network that starts with a constant image of, for example, 4x4 size, and repeats the upsampling process to generate an image of, for example, 1024x1024 size as the final output. Network g consists of 18 layers, and the vector w∈W output from network f is referenced 18 times in network g.

ネットワークｇは、合成ネットワーク部ｇ１が複数連結される。１番目のネットワーク部ｇ１は、定数テンソル（constant tensor）層２０ａ、２つのＡｄａＩＮ層２０ｂ及び畳み込み層（コンボリューション層）２０ｃを含む。２層目からのネットワーク部ｇ１は、アップサンプリング層２０ｄ、２つのＡｄａＩＮ層２０ｂ及び２つの畳み込み層２０ｃを含む。定数テンソル層２０ａは、予め用意された最小解像度のパラメータ（定数テンソル）をネットワークｇに入力する層である。ＡｄａＩＮ層２０ｂは、ネットワークｆで非線形変換から得られたベクトルｗ∈Ｗをスタイルに関する情報として空間データに適用する層である。畳み込み層２０ｃは、例えばサイズ３×３のフィルタを持つ畳み込み層である。アップサンプリング層２０ｄは、画像のアップサンプリング処理を行う層である。 Network g is a combination of multiple synthesis network parts g1. The first network part g1 includes a constant tensor layer 20a, two AdaIN layers 20b, and a convolution layer 20c. The second and subsequent network parts g1 include an upsampling layer 20d, two AdaIN layers 20b, and two convolution layers 20c. The constant tensor layer 20a is a layer that inputs a minimum resolution parameter (constant tensor) prepared in advance to the network g. The AdaIN layer 20b is a layer that applies the vector w∈W obtained from the nonlinear transformation in the network f to spatial data as information about the style. The convolution layer 20c is a convolution layer with a filter of size 3×3, for example. The upsampling layer 20d is a layer that performs upsampling processing of the image.

サーバ１は、基本パッケージ画像と複数のスタイル画像とを生成モデル１７２に入力した場合、ネットワークｆを通じて、パッケージ画像の生成に用いる潜在変数（ｚ∈Ｚ）を潜在空間（ｗ∈Ｗ）に変換する。サーバ１は、変換した潜在空間（ｗ∈Ｗ）をスタイルに関する情報としてネットワークｇに入力する。サーバ１は、ＡｄａＩＮ層２０ｂを通じて、潜在空間（ｗ∈Ｗ）にアフィン変換をしてＡｄａＩＮ処理を行う。 When the server 1 inputs a basic package image and multiple style images to the generative model 172, it converts the latent variables (z∈Z) used to generate the package image into a latent space (w∈W) through the network f. The server 1 inputs the converted latent space (w∈W) into the network g as information related to the style. The server 1 performs an affine transformation on the latent space (w∈W) through the AdaIN layer 20b to perform AdaIN processing.

サーバ１は、ＡｄａＩｎ処理により潜在空間（ｗ∈Ｗ）に適用されたスタイルに関する情報に基づいて、畳み込み層２０ｃを通じて、各画素の画素値を畳み込んでスタイル画像の特徴量を抽出する。その後に、サーバ１は、確率的な変動を行うために、ノイズ（noise）を潜在空間（ｗ∈Ｗ）に入力する。上述した処理によって、スタイル画像のスタイルに関する情報を基本パッケージ画像に適応することができる。 Based on the style information applied to the latent space (w∈W) by the Adaln process, the server 1 extracts features of the style image by convolving the pixel values of each pixel through the convolution layer 20c. After that, the server 1 inputs noise into the latent space (w∈W) to perform probabilistic variation. Through the above-mentioned process, the style information of the style image can be applied to the basic package image.

ネットワークｇは、アップサンプリング層２０ｄを通じてアップサンプリング処理を行うことにより、徐々に生成画像（パッケージ画像）の解像度を上げる。図示のように、解像度は、例えば４×４から順次アップサンプリングし、例えば最終的に１０２４×１０２４になる。これに従い、アップサンプリングを行うことによって、各解像度において異なるスタイルに関する情報を潜在空間（ｗ∈Ｗ）に適用させることで、生成画像（パッケージ画像）のスタイルを制御することができる。 The network g gradually increases the resolution of the generated image (package image) by performing an upsampling process through the upsampling layer 20d. As shown in the figure, the resolution is successively upsampled from, for example, 4x4, eventually becoming, for example, 1024x1024. Accordingly, by performing upsampling, information about different styles at each resolution can be applied to the latent space (w∈W), thereby controlling the style of the generated image (package image).

これによって、例えば基本パッケージ画像と、１つのスタイル画像とが生成モデル１７２に入力された場合、複数の候補となるパッケージ画像を出力することができる。具体的には、サーバ１は、基本パッケージ画像とスタイル画像とを生成モデル１７２に入力する。サーバ１は、ネットワークｆを通じて、パッケージ画像の生成に用いる潜在変数（ｚ∈Ｚ）を潜在空間（ｗ∈Ｗ）に変換する。サーバ１は、変換した潜在空間（ｗ∈Ｗ）をスタイルに関する情報としてネットワークｇに入力する。 As a result, for example, when a basic package image and one style image are input to the generative model 172, multiple candidate package images can be output. Specifically, the server 1 inputs the basic package image and the style image to the generative model 172. The server 1 converts the latent variables (z∈Z) used to generate the package image into a latent space (w∈W) via the network f. The server 1 inputs the converted latent space (w∈W) to the network g as information related to the style.

サーバ１は、ネットワークｇの複数のネットワーク部ｇ１の選択を受け付ける。ネットワークｇの各ネットワーク部ｇ１は、解像度に基づいて、例えば、４×４から８×８までの低解像度スタイル（Coarse styles）、１６×１６から３２×３２までの中解像度スタイル（Middle styles）、及び６４×６４から１０２４×１０２４までの高解像度（Fine Style）に分類される。例えばサーバ１は、低解像度スタイル、中解像度スタイルまたは高解像度スタイルからいずれかの１つの選択を受け付けても良い。その他、複数のネットワーク部ｇ１の選択を受け付け、異なる複数のパッケージ画像を生成しても良い。 The server 1 accepts a selection of multiple network parts g1 of the network g. Each network part g1 of the network g is classified based on the resolution, for example, into low resolution styles (Coarse styles) from 4x4 to 8x8, medium resolution styles (Middle styles) from 16x16 to 32x32, and high resolution styles (Fine styles) from 64x64 to 1024x1024. For example, the server 1 may accept a selection of any one of the low resolution styles, medium resolution styles, or high resolution styles. Alternatively, the server 1 may accept a selection of multiple network parts g1 and generate multiple different package images.

サーバ１は、選択された各ネットワーク部ｇ１に、スタイルに関する情報を入力する。スタイルに関する情報が入力された各ネットワーク部ｇ１は、該当する解像度スタイルに応じて得られたスタイルに関する情報を基本パッケージ画像に適応する。 The server 1 inputs style-related information to each selected network unit g1. Each network unit g1 to which the style-related information has been input applies the style-related information obtained according to the corresponding resolution style to the basic package image.

具体的には、低解像度スタイルに対応するネットワーク部ｇ１は、低解像度スタイル（４×４～８×８）に応じて得られたスタイルに関する情報を基本パッケージ画像に適応してパッケージ画像を生成する。中解像度スタイルに対応するネットワーク部ｇ１は、中解像度スタイル（１６×１６～３２×３２）に応じて得られたスタイルに関する情報を基本パッケージ画像に適応してパッケージ画像を生成する。高解像度スタイルに対応するネットワーク部ｇ１は、高解像度スタイル（６４×６４～１０２４×１０２４）に応じて得られたスタイルに関する情報を基本パッケージ画像に適応してパッケージ画像を生成する。 Specifically, the network section g1 corresponding to the low resolution style generates a package image by applying style-related information obtained according to the low resolution style (4x4 to 8x8) to the basic package image. The network section g1 corresponding to the medium resolution style generates a package image by applying style-related information obtained according to the medium resolution style (16x16 to 32x32) to the basic package image. The network section g1 corresponding to the high resolution style generates a package image by applying style-related information obtained according to the high resolution style (64x64 to 1024x1024) to the basic package image.

なお、生成モデル１７２は、ＳｔｙｌｅＧａｎの手法により構築されることに限定せず、例えばＰＧＧＡＮ（Progressive Growing GAN）、ＤＣＧＡＮ（Deep Convolutional GAN）またはＣｙｃｌｅＧＡＮ等の手法により構築されても良い。なお、その他ニューラルスタイル変換を用いても良い。 The generative model 172 is not limited to being constructed using the StyleGan method, but may be constructed using methods such as PGGAN (Progressive Growing GAN), DCGAN (Deep Convolutional GAN), or CycleGAN. Other neural style transformations may also be used.

なお、本実施の形態では生成モデル１７２の生成（学習）手法としてＧＡＮを用いるが、生成モデル１７２はＧＡＮに係る学習済みモデルに限定されず、その他ＶＡＥ（Variational Auto Encoder）、Ｕ－ＮＥＴ（Ｕ字型のニューラルネットワーク）等の深層学習、決定木等の学習手法による学習済みモデルであっても良い。 In this embodiment, GAN is used as the generation (learning) method for the generative model 172, but the generative model 172 is not limited to a trained model related to GAN, and may be a trained model using other deep learning methods such as VAE (Variational Auto Encoder) and U-NET (U-shaped neural network), decision trees, etc.

サーバ１は、上述した生成モデル１７２を用いて候補となるパッケージ画像を複数生成した場合、パッケージ画像の好意度に基づいて、複数のパッケージ画像から提案すべきパッケージ画像を抽出する。サーバ１は、抽出した提案すべきパッケージ画像を端末２に出力する。なお、提案すべきパッケージ画像の出力処理に関しては、実施形態１と同様であるため、説明を省略する。 When the server 1 generates multiple candidate package images using the above-mentioned generative model 172, it extracts a package image to be proposed from the multiple package images based on the package image's favorability. The server 1 outputs the extracted package image to be proposed to the terminal 2. Note that the output process of the package image to be proposed is the same as in embodiment 1, and therefore will not be described here.

図１４は、基本パッケージ画像の受付画面の一例を示す説明図である。該画面は、画像表示欄１５ａ、画像選択ボタン１５ｂ、削除ボタン１５ｃ、画像名入力欄１５ｄ及び進むボタン１５ｅを含む。 Figure 14 is an explanatory diagram showing an example of a basic package image reception screen. The screen includes an image display field 15a, an image selection button 15b, a delete button 15c, an image name input field 15d, and a proceed button 15e.

画像表示欄１５ａは、元画像となる基本パッケージ画像を表示する表示欄である。画像選択ボタン１５ｂは、基本パッケージ画像を選択するボタンである。削除ボタン１５ｃは、選択された基本パッケージ画像を削除するボタンである。画像名入力欄１５ｄは、基本パッケージ画像の名称を入力するテキストフィールドである。進むボタン１５ｅは、スタイル画像の受付画面（図１５）に遷移するボタンである。 The image display field 15a is a display field that displays the basic package image that is to be the original image. The image selection button 15b is a button for selecting a basic package image. The delete button 15c is a button for deleting the selected basic package image. The image name input field 15d is a text field for inputting the name of the basic package image. The forward button 15e is a button for transitioning to a style image reception screen (Figure 15).

端末２は、画像選択ボタン１５ｂのタッチ操作を受け付けた場合、記憶部２２に記憶されている基本パッケージ画像の選択を受け付ける。端末２は、受け付けた基本パッケージ画像を画像表示欄１５ａに表示する。端末２は、削除ボタン１５ｃのタッチ操作を受け付けた場合、画像表示欄１５ａに表示されている基本パッケージ画像を削除する。 When the terminal 2 receives a touch operation of the image selection button 15b, it receives the selection of a basic package image stored in the memory unit 22. The terminal 2 displays the received basic package image in the image display field 15a. When the terminal 2 receives a touch operation of the delete button 15c, it deletes the basic package image displayed in the image display field 15a.

端末２は、画像名入力欄１５ｄの入力操作を受け付けた場合、基本パッケージ画像の名称の入力を受け付ける。端末２は、進むボタン１５ｅのタッチ操作を受け付けた場合、画像選択ボタン１５ｂにより選択された基本パッケージ画像、及び画像名入力欄１５ｄにより入力された基本パッケージ画像の名称をスタイル画像の受付画面（図１５）に受け渡し、スタイル画像の受付画面に遷移する。 When the terminal 2 receives an input operation in the image name input field 15d, it receives the input of the name of the basic package image. When the terminal 2 receives a touch operation of the forward button 15e, it passes the basic package image selected by the image selection button 15b and the name of the basic package image input in the image name input field 15d to the style image reception screen (Figure 15), and transitions to the style image reception screen.

図１５は、スタイル画像の受付画面の一例を示す説明図である。該画面は、スタイル選択コンボボックス１６ａ、画像表示欄１６ｂ、進むボタン１６ｃ及び戻るボタン１６ｄを含む。 Figure 15 is an explanatory diagram showing an example of a style image reception screen. The screen includes a style selection combo box 16a, an image display field 16b, a forward button 16c, and a back button 16d.

スタイル選択コンボボックス１６ａは、スタイル（風景、国、文化、季節、海、山、島または街等）を選択するコンボボックスである。画像表示欄１６ｂは、スタイル画像を表示する表示欄である。なお、スタイル毎に、予め選択可能なスタイル画像の枚数（例えば、５枚）は決められても良い。進むボタン１６ｃは、パッケージ画像生成の確定画面（図１６）に遷移するボタンである。戻るボタン１６ｄは、基本パッケージ画像の受付画面（図１４）に遷移するボタンである。 The style selection combo box 16a is a combo box for selecting a style (landscape, country, culture, season, sea, mountain, island, city, etc.). The image display field 16b is a display field for displaying style images. Note that the number of selectable style images (e.g., five) may be determined in advance for each style. The forward button 16c is a button for transitioning to a confirmation screen for package image generation (Figure 16). The back button 16d is a button for transitioning to a basic package image reception screen (Figure 14).

端末２は、スタイル選択コンボボックス１６ａの選択操作を受け付けた場合、選択されたスタイルに応じて、該スタイルに対応する複数のスタイル画像を記憶部２２または外部装置から取得する。端末２は、取得したそれぞれのスタイル画像を該当する画像表示欄１６ｂに表示する。図示のように、「風景」スタイルが選択された場合、サーバ１は、「風景」スタイルに対応する５枚のスタイル画像を取得し、取得したそれぞれのスタイル画像を該当する画像表示欄１６ｂに表示する。端末２は、選択対象となるスタイル画像に対応する画像表示欄１６ｂのタッチ操作を受け付けた場合、該当する画像表示欄１６ｂに選択済みとなる状態を示す枠を重畳して表示する。 When the terminal 2 receives a selection operation on the style selection combo box 16a, it acquires multiple style images corresponding to the selected style from the storage unit 22 or an external device. The terminal 2 displays each acquired style image in the corresponding image display field 16b. As shown in the figure, when the "Landscape" style is selected, the server 1 acquires five style images corresponding to the "Landscape" style and displays each acquired style image in the corresponding image display field 16b. When the terminal 2 receives a touch operation on the image display field 16b corresponding to the style image to be selected, it displays a frame indicating a selected state superimposed on the corresponding image display field 16b.

端末２は、進むボタン１６ｃのタッチ操作を受け付けた場合、基本パッケージ画像の受付画面（図１４）から受け渡された基本パッケージ画像及び基本パッケージ画像の名称、並びに、スタイル選択コンボボックス１６ａにより選択されたスタイルの名称、及び画像表示欄１６ｂにより選択された単一または複数のスタイル画像をパッケージ画像生成の確定画面（図１６）に受け渡し、パッケージ画像生成の確定画面に遷移する。端末２は、戻るボタン１６ｄのタッチ操作を受け付けた場合、基本パッケージ画像の受付画面（図１４）に遷移する。 When terminal 2 receives a touch operation of forward button 16c, it transfers the basic package image and the name of the basic package image transferred from the basic package image reception screen (Fig. 14), as well as the name of the style selected in style selection combo box 16a, and the single or multiple style images selected in image display field 16b, to the package image generation confirmation screen (Fig. 16), and transitions to the package image generation confirmation screen. When terminal 2 receives a touch operation of back button 16d, it transitions to the basic package image reception screen (Fig. 14).

図１６は、パッケージ画像生成の確定画面の一例を示す説明図である。該画面は、基本パッケージ画像表示欄１７ａ、スタイル画像表示欄１７ｂ、スタイル表示欄１７ｃ、スタートボタン１７ｄ及び戻るボタン１７ｅを含む。 Figure 16 is an explanatory diagram showing an example of a confirmation screen for generating a package image. The screen includes a basic package image display field 17a, a style image display field 17b, a style display field 17c, a start button 17d, and a back button 17e.

基本パッケージ画像表示欄１７ａは、基本パッケージ画像を表示する表示欄である。スタイル画像表示欄１７ｂは、スタイル画像を表示する表示欄である。スタイル表示欄１７ｃは、スタイルの名称を表示する表示欄である。スタートボタン１７ｄは、候補となるパッケージ画像を生成するボタンである。戻るボタン１７ｅは、スタイル画像の受付画面（図１５）に遷移するボタンである。 The basic package image display field 17a is a display field that displays a basic package image. The style image display field 17b is a display field that displays a style image. The style display field 17c is a display field that displays the name of the style. The start button 17d is a button that generates a candidate package image. The back button 17e is a button that transitions to a style image reception screen (Figure 15).

端末２は、スタイル画像の受付画面（図１５）から受け渡された基本パッケージ画像、スタイルの名称及び該スタイルに応じて選択されたスタイル画像を受け取る。端末２は、受け取った基本パッケージ画像を基本パッケージ画像表示欄１７ａに表示し、スタイルの名称をスタイル表示欄１７ｃに表示し、選択されたスタイル画像をスタイル画像表示欄１７ｂに表示する。 The terminal 2 receives the basic package image, the style name, and the style image selected according to the style, which are passed from the style image reception screen (Figure 15). The terminal 2 displays the received basic package image in the basic package image display field 17a, displays the style name in the style display field 17c, and displays the selected style image in the style image display field 17b.

端末２は、スタートボタン１７ｄのタッチ操作を受け付けた場合、選択された基本パッケージ画像及びスタイル画像をサーバ１に送信する。サーバ１は、端末２から送信された基本パッケージ画像及びスタイル画像に基づいて、生成モデル１７２を用いて候補となるパッケージを複数生成する。サーバ１は、生成した複数の候補となるパッケージ画像を好意度予測モデル１７１に入力して、各パッケージ画像の好意度に関する情報を出力する。 When the terminal 2 receives a touch operation of the start button 17d, it transmits the selected basic package image and style image to the server 1. The server 1 generates multiple candidate packages using the generation model 172 based on the basic package image and style image transmitted from the terminal 2. The server 1 inputs the multiple generated candidate package images to the favorability prediction model 171 and outputs information related to the favorability of each package image.

サーバ１は、候補となる各パッケージ画像、各パッケージ画像に対応するＩＤ及び好意度を端末２に送信する。端末２は、サーバ１から送信された各パッケージ画像、各パッケージ画像に対応するＩＤ及び好意度を受信してパッケージ画像の表示画面（図１７）に受け渡し、パッケージ画像の表示画面に遷移する。 The server 1 transmits each of the candidate package images, the ID corresponding to each package image, and the favorability level to the terminal 2. The terminal 2 receives each of the package images, the ID corresponding to each package image, and the favorability level transmitted from the server 1, passes them to the package image display screen (FIG. 17), and transitions to the package image display screen.

端末２は、戻るボタン１７ｅのタッチ操作を受け付けた場合、スタイル画像の受付画面（図１５）に遷移する。 When the device 2 receives a touch operation of the back button 17e, it transitions to a style image reception screen (Figure 15).

図１７は、パッケージ画像の表示画面の一例を示す説明図である。該画面は、画像ＩＤ表示欄１８ａ、画像表示欄１８ｂ、好意度表示欄１８ｃ及び画像保存ボタン１８ｄを含む。 Figure 17 is an explanatory diagram showing an example of a display screen for a package image. The screen includes an image ID display field 18a, an image display field 18b, a favorability display field 18c, and an image save button 18d.

画像ＩＤ表示欄１８ａは、パッケージ画像のＩＤを表示する表示欄である。画像表示欄１８ｂは、パッケージ画像を表示する表示欄である。好意度表示欄１８ｃは、パッケージ画像の好意度に関する情報を表示する表示欄である。即ち、それぞれのパッケージ画像に対し、パッケージ画像のＩＤ、該パッケージ画像及び該パッケージ画像の好意度が同時表示される。画像保存ボタン１８ｄは、パッケージ画像を保存するボタンである。 The image ID display field 18a is a display field that displays the ID of the package image. The image display field 18b is a display field that displays the package image. The favorability display field 18c is a display field that displays information related to the favorability of the package image. That is, for each package image, the package image ID, the package image, and the favorability of the package image are simultaneously displayed. The image save button 18d is a button that saves the package image.

端末２は、候補となる各パッケージ画像、各パッケージ画像のＩＤ及び好意度をサーバ１から取得して画面に表示する。具体的には、端末２は、各パッケージ画像のＩＤを該当する画像ＩＤ表示欄１８ａに表示し、各パッケージ画像を該当する画像表示欄１８ｂに表示し、各パッケージ画像の好意度を該当する好意度表示欄１８ｃに表示する。 The terminal 2 acquires each candidate package image, its ID, and its favorability from the server 1, and displays them on the screen. Specifically, the terminal 2 displays the ID of each package image in the corresponding image ID display field 18a, displays each package image in the corresponding image display field 18b, and displays the favorability of each package image in the corresponding favorability display field 18c.

端末２は、画像保存ボタン１８ｄのタッチ操作を受け付けた場合、画像表示欄１８ｂに表示されている複数のパッケージ画像を、指定された画像フォーマット（例えば、ｂｍｐ、ｔｉｆｆ、ｊｐｇ、ｊｐｅｇ、ｇｉｆ、またはｐｎｇ）で記憶部２２または外部記憶装置に記憶する。 When the terminal 2 receives a touch operation of the image save button 18d, it stores the multiple package images displayed in the image display field 18b in the memory unit 22 or an external storage device in the specified image format (e.g., bmp, tiff, jpg, jpeg, gif, or png).

図１８は、スタイル画像を用いてパッケージ画像を生成する際の処理手順を示すフローチャートである。端末２の制御部２１は、基本パッケージ画像の選択を入力部２４により受け付ける（ステップＳ２２１）。制御部２１は、スタイルの選択を入力部２４により受け付ける（ステップＳ２２２）。制御部２１は、受け付けたスタイルに応じて、該スタイルに対応する複数のスタイル画像を記憶部２２から取得する（ステップＳ２２３）。制御部２１は、取得した複数のスタイル画像を表示部２５により表示する（ステップＳ２２４）。 Figure 18 is a flowchart showing the processing steps when generating a package image using a style image. The control unit 21 of the terminal 2 accepts the selection of a basic package image through the input unit 24 (step S221). The control unit 21 accepts the selection of a style through the input unit 24 (step S222). Depending on the accepted style, the control unit 21 acquires multiple style images corresponding to the style from the memory unit 22 (step S223). The control unit 21 displays the acquired multiple style images on the display unit 25 (step S224).

制御部２１は、表示されている複数のスタイル画像の中から、変換対象となるスタイル画像の選択を入力部２４により受け付ける（ステップＳ２２５）。制御部２１は、受け付けた基本パッケージ画像、及び変換対象となるスタイル画像を通信部２３によりサーバ１に送信する（ステップＳ２２６）。サーバ１の制御部１１は、端末２から送信された基本パッケージ画像及びスタイル画像を通信部１３により受信する（ステップＳ１２１）。 The control unit 21 receives a selection of a style image to be converted from among the multiple style images displayed through the input unit 24 (step S225). The control unit 21 transmits the received basic package image and the style image to be converted to the server 1 through the communication unit 23 (step S226). The control unit 11 of the server 1 receives the basic package image and the style image transmitted from the terminal 2 through the communication unit 13 (step S121).

制御部１１は、受信した基本パッケージ画像及びスタイル画像に基づいて、生成モデル１７２を用いて候補となるパッケージ画像を複数生成する（ステップＳ１２２）。制御部１１は、生成した複数のパッケージ画像を好意度予測モデル１７１に入力して、各パッケージ画像の好意度に関する情報を出力（取得）する（ステップＳ１２３）。 Based on the received basic package image and style image, the control unit 11 generates multiple candidate package images using the generation model 172 (step S122). The control unit 11 inputs the multiple generated package images to the favorability prediction model 171 and outputs (obtains) information regarding the favorability of each package image (step S123).

サーバ１は、候補となる各パッケージ画像、各パッケージ画像に対応するＩＤ及び好意度を通信部１３により端末２に送信する（ステップＳ１２４）。端末２の制御部２１は、サーバ１から送信された各パッケージ画像、各パッケージ画像に対応するＩＤ及び好意度を通信部２３により受信する（ステップＳ２２７）。制御部２１は、受信した各パッケージ画像、各パッケージ画像に対応するＩＤ及び好意度を表示部２５により表示し（ステップＳ２２８）、処理を終了する。 The server 1 transmits each of the candidate package images, the ID corresponding to each package image, and the favorability level to the terminal 2 via the communication unit 13 (step S124). The control unit 21 of the terminal 2 receives each of the package images, the ID corresponding to each package image, and the favorability level transmitted from the server 1 via the communication unit 23 (step S227). The control unit 21 displays each of the received package images, the ID corresponding to each package image, and the favorability level on the display unit 25 (step S228), and ends the process.

本実施形態によると、スタイル画像を用いて生成モデル１７２を構築することが可能となる。 According to this embodiment, it is possible to construct a generative model 172 using a style image.

本実施形態によると、生成モデル１７２を用いて候補となるパッケージ画像を出力することにより、スタイルに基づき多様なパッケージデザインを実現することが可能となる。 In this embodiment, by using the generative model 172 to output candidate package images, it is possible to realize a variety of package designs based on style.

（実施形態３）
実施形態３は、候補となるパッケージ画像に基づき、人工知能を用いて該パッケージ画像のイメージワードの出現割合を出力する形態に関する。なお、実施形態１～２と重複する内容については説明を省略する。 (Embodiment 3)
The third embodiment relates to a form in which the appearance rate of image words of a package image is output using artificial intelligence based on the package image of a candidate. Note that a description of the contents that overlap with the first and second embodiments will be omitted.

図１９は、実施形態３のサーバ１の構成例を示すブロック図である。なお、図１２と重複する内容については説明を省略する。大容量記憶部１７には、出現割合予測モデル１７３（第２学習モデル）が記憶されている。出現割合予測モデル１７３は、パッケージ画像に基づいてパッケージ画像のイメージワードの出現割合を予測する予測器であり、機械学習により生成された学習済みモデルである。 Fig. 19 is a block diagram showing an example of the configuration of the server 1 of the third embodiment. Note that a description of the contents overlapping with Fig. 12 will be omitted. The mass storage unit 17 stores an appearance rate prediction model 173 (second learning model). The appearance rate prediction model 173 is a predictor that predicts the appearance rate of image words of a package image based on the package image, and is a learned model generated by machine learning.

イメージワードは、パッケージ画像の評価時に好意度の理由を表すための文字（テキスト）である。イメージワードは、例えば「おいしそう」（非飲食系はなし）、「かわいい」、「シンプル」、「デザイン要素がよい」、「なつかしい」、「やさしい」、「安心感・信頼感がある」、「季節感」、「健康感がある」、「効果・効能を感じる」、「高級感・上質感」、「色味がよい」、「新しい・ユニーク」、「洗練」、「爽やか・清涼感」、「特徴がわかりやすい」、「目立つ・印象に残る」、「綺麗・美しい」または「清潔」等を含む。なお、上述したイメージワードに限定せず、実際のニーズに応じて任意のイメージワードを設定すれば良い。 Image words are characters (text) used to express reasons for favorability when evaluating a package image. Image words include, for example, "delicious" (none for non-food and drink products), "cute", "simple", "good design elements", "nostalgic", "gentle", "sense of security and trust", "seasonal", "healthy", "feels effective", "luxurious and high-quality", "good color", "new and unique", "sophisticated", "refreshing and cool", "easy to understand characteristics", "noticeable and memorable", "pretty and pretty" or "clean". Note that image words are not limited to those mentioned above, and any image words can be set according to actual needs.

続いて、イメージワードの抽出処理を説明する。サーバ１は、パッケージ画像を評価した評価テキスト集合を取得する。評価テキスト集合は、例えばパッケージ画像に対し、任意のフォーマットで書き込まれたレビューまたは感想等の集合であっても良く、またはアンケートによる回答から集合された評価情報であっても良い。評価テキスト集合が記憶部１２または大容量記憶部１７に記憶された場合、サーバ１は、記憶部１２または大容量記憶部１７から評価テキスト集合を取得する。なお、サーバ１は評価テキスト集合を外部装置から取得しても良い。 Next, the image word extraction process will be described. The server 1 acquires a set of evaluation texts that evaluate the package image. The set of evaluation texts may be, for example, a set of reviews or impressions written in any format about the package image, or it may be evaluation information collected from responses to a questionnaire. If the set of evaluation texts is stored in the memory unit 12 or the mass storage unit 17, the server 1 acquires the set of evaluation texts from the memory unit 12 or the mass storage unit 17. The server 1 may also acquire the set of evaluation texts from an external device.

サーバ１は、取得した評価テキスト集合からキーワードのリストを抽出する。例えば、評価テキスト集合から、出現頻度が所定の出現頻度（例えば、１０００回）以上であるキーワードのリストが抽出されても良い。サーバ１は、抽出したキーワードのリストから、ネガティブワードまたは意味不明ワードを排除する。ネガティブワードは、例えば「悲しい」、「苦しい」、「不満だ」、「不味い」、「困る」または「困惑する」等を含む。例えば制御部１１は、抽出したキーワードのリストに対して形態素解析を行い、さらに、ｔｆ－ｉｄｆ（Term Frequency－Inverse Document Frequency）等、キーワードのリストに出現する各単語をスコアリングするアルゴリズムを利用し、ネガティブワードまたは意味不明ワードを抽出する。 The server 1 extracts a list of keywords from the acquired set of evaluation texts. For example, a list of keywords whose frequency of occurrence is equal to or greater than a predetermined frequency (e.g., 1000 times) may be extracted from the set of evaluation texts. The server 1 eliminates negative words or meaningless words from the extracted list of keywords. Negative words include, for example, "sad," "painful," "unsatisfactory," "tasteless," "troubled," or "confused." For example, the control unit 11 performs a morphological analysis on the extracted list of keywords, and further uses an algorithm for scoring each word that appears in the list of keywords, such as tf-idf (Term Frequency-Inverse Document Frequency), to extract negative words or meaningless words.

サーバ１は、抽出したキーワードのリストから、抽出したネガティブワードまたは意味不明ワードを排除する。サーバ１は、ネガティブワード及び意味不明ワードを排除したキーワードのリストに対し、例えばＷｏｒｄ２ｖｅｃ、Ｄｏｃ２Ｖｅｃ(Paragraph2vec)、ＬＤＡ（Latent Dirichlet Allocation）またはＮＴＳＧ（Neural Tensor Skip Gram）等のような単語をベクトル化する公知の手法を用いて、キーワードのリストに含まれる各々のキーワードをベクトル化する。 The server 1 removes the extracted negative words or meaningless words from the list of extracted keywords. The server 1 vectorizes each keyword included in the list of keywords from which the negative words and meaningless words have been removed, using a known method for vectorizing words, such as Word2vec, Doc2Vec (Paragraph2vec), LDA (Latent Dirichlet Allocation), or NTSG (Neural Tensor Skip Gram).

サーバ１は、ベクトル化した各々のキーワードの特徴情報に基づいて、各々のキーワードを分類する（クラスタリング）。クラスタリング（clustering）とは、自然言語処理の一種で、ベクトル化したキーワードを、その方向と大きさから、同一の概念に近い単語の集団（クラスタ）に分ける処理である。サーバ１は、分類した各々のキーワードからパッケージ画像のイメージワードを抽出する。例えば、クラスタ毎にイメージワードの手動入力が受け付けられても良い。抽出されたイメージワードは、パッケージ画像に対応付けて後述の訓練データの生成時に利用される。 The server 1 classifies each keyword based on the feature information of each vectorized keyword (clustering). Clustering is a type of natural language processing in which vectorized keywords are divided into groups (clusters) of words that are close to the same concept based on their direction and size. The server 1 extracts image words for package images from each classified keyword. For example, manual input of image words for each cluster may be accepted. The extracted image words are associated with package images and used when generating training data, which will be described later.

続いて、出現割合予測モデル１７３を用いて、パッケージ画像のイメージワードの出現割合を出力する処理を説明する。
図２０は、出現割合予測モデル１７３を説明する説明図である。出現割合予測モデル１７３は、人工知能ソフトウェアの一部であるプログラムモジュールとして利用される。出現割合予測モデル１７３は、パッケージ画像を入力とし、該パッケージ画像のイメージワードの出現割合を出力とするニューラルネットワークを構築済みのイメージワードの出現割合の予測器である。 Next, a process of outputting the appearance ratio of image words in package images using the appearance ratio prediction model 173 will be described.
20 is an explanatory diagram for explaining the appearance ratio prediction model 173. The appearance ratio prediction model 173 is used as a program module that is a part of artificial intelligence software. The appearance ratio prediction model 173 is a predictor of the appearance ratio of image words in which a neural network has been constructed, which receives a package image as input and outputs the appearance ratio of image words in the package image.

サーバ１は、出現割合予測モデル１７３として、パッケージ画像のイメージワードの出現割合に影響を及ぼした箇所の特徴量を学習するディープラーニングを行うことで出現割合予測モデル１７３を構築する。例えば、出現割合予測モデル１７３はＣＮＮであり、パッケージ画像の入力を受け付ける入力層と、活性化関数（本実施の形態ではソフトマックス関数）を用いて該パッケージ画像のイメージワードの出現割合を出力する出力層（ソフトマックス層：Softmax Layer）と、バックプロパゲーションにより学習済の中間層とを有する。 The server 1 constructs the appearance rate prediction model 173 by performing deep learning to learn the feature quantities of the parts that influenced the appearance rate of the image word of the package image. For example, the appearance rate prediction model 173 is a CNN, and has an input layer that accepts the input of the package image, an output layer (softmax layer) that outputs the appearance rate of the image word of the package image using an activation function (softmax function in this embodiment), and an intermediate layer that has been trained by backpropagation.

なお、本実施の形態では出現割合予測モデル１７３がＣＮＮであるものとして説明するが、出現割合予測モデル１７３はＣＮＮに限定されず、ＣＮＮ以外のニューラルネットワーク、ＳＶＭ、ベイジアンネットワーク、または回帰木等の任意の学習アルゴリズムで構築された学習済みモデルであって良い。 In this embodiment, the occurrence rate prediction model 173 is described as being a CNN, but the occurrence rate prediction model 173 is not limited to a CNN and may be a trained model constructed with any learning algorithm, such as a neural network other than a CNN, an SVM, a Bayesian network, or a regression tree.

サーバ１は、パッケージ画像と該パッケージ画像のイメージワードとが対応付けられた訓練データの組み合わせを複数取得する。訓練データは、パッケージ画像に対し、該パッケージ画像のイメージワードがラベル付けされたデータである。 The server 1 acquires a plurality of combinations of training data in which package images are associated with image words of the package images. The training data is data in which package images are labeled with image words of the package images.

サーバ１は、取得した訓練データ用いて学習を行う。具体的には、サーバ１は、訓練データであるパッケージ画像を入力層に入力し、中間層での演算処理を経て、出力層から該パッケージ画像のイメージワードの出現割合を取得する。サーバ１は、出力層から出力された予測結果（例えば、出現割合が最も高いイメージワード）を、訓練データにおいてパッケージ画像に対しラベル付けされた情報（イメージワード）、すなわち正解値と比較し、出力層からの予測値が正解値に近づくように、中間層での演算処理に用いるパラメータを最適化する。 The server 1 performs learning using the acquired training data. Specifically, the server 1 inputs the packaging images, which are the training data, into the input layer, and acquires the occurrence rate of the image word of the packaging image from the output layer after arithmetic processing in the intermediate layer. The server 1 compares the prediction result output from the output layer (e.g., the image word with the highest occurrence rate) with the information (image word) labeled with the packaging image in the training data, i.e., the correct answer value, and optimizes the parameters used in the arithmetic processing in the intermediate layer so that the predicted value from the output layer approaches the correct answer value.

サーバ１は、訓練データに含まれる各パッケージ画像について上記の処理を行い、出現割合予測モデル１７３を生成する。これにより、例えばサーバ１は当該訓練データを用いて出現割合予測モデル１７３の学習を行うことで、パッケージ画像のイメージワードの出現割合を出力可能なモデルを構築することができる。 The server 1 performs the above process for each package image included in the training data to generate an appearance rate prediction model 173. As a result, for example, the server 1 can use the training data to train the appearance rate prediction model 173, thereby constructing a model capable of outputting the appearance rate of image words for package images.

サーバ１は、候補となるパッケージ画像を取得した場合、取得したパッケージ画像を出現割合予測モデル１７３に入力する。サーバ１は、出現割合予測モデル１７３の中間層にてパッケージ画像の特徴量を抽出する演算処理を行う。サーバ１は、抽出した特徴量を出現割合予測モデル１７３の出力層に入力して、該パッケージ画像のイメージワードの出現割合を出力として取得する。 When the server 1 acquires a candidate package image, it inputs the acquired package image to the appearance rate prediction model 173. The server 1 performs a calculation process to extract features of the package image in the middle layer of the appearance rate prediction model 173. The server 1 inputs the extracted features to the output layer of the appearance rate prediction model 173, and acquires the appearance rate of the image word of the package image as an output.

図示のように、パッケージ画像に対し、「かわいい」、「シンプル」、「季節感」、「…」、「健康感がある」それぞれの出現割合が、「０．８２」、「０．５５」、「０．３５」、「…」、「０．２５」である予測結果が出力される。 As shown in the figure, the predicted results are output that show the respective occurrence rates of "cute," "simple," "seasonal," "...," and "healthy" for the package image as "0.82," "0.55," "0.35," "...," and "0.25."

また、性別または年代ごとに出現割合予測モデル１７３を生成することができる。具体的には、サーバ１は、性別または年代の設定を受け付け、受け付けた性別または年代に応じて、出現割合予測モデル１７３の生成に用いる訓練データを複数取得する。サーバ１は、取得した訓練データを用いて、受け付けた性別または年代に対応する出現割合予測モデル１７３を生成する。 In addition, an appearance rate prediction model 173 can be generated for each gender or age group. Specifically, the server 1 accepts a setting of the gender or age group, and acquires multiple pieces of training data to be used for generating the appearance rate prediction model 173 according to the accepted gender or age group. The server 1 uses the acquired training data to generate an appearance rate prediction model 173 corresponding to the accepted gender or age group.

サーバ１は、性別または年代ごとに生成した出現割合予測モデル１７３を大容量記憶部１７に記憶する。サーバ１は、複数の性別または年代に対して用意された複数の出現割合予測モデル１７３を用いて、性別または年代ごとにパッケージ画像から該パッケージ画像のイメージワードの出現割合を出力することができる。 The server 1 stores the occurrence rate prediction model 173 generated for each gender or age group in the mass storage unit 17. The server 1 can output the occurrence rate of image words of a package image from the package image for each gender or age group using multiple occurrence rate prediction models 173 prepared for multiple genders or age groups.

なお、上述した処理に限るものではない。商品のカテゴリ毎に設定されたスコアを用いて、出現割合予測モデル１７３を学習させても良い。この場合、サーバ１は、パッケージ画像と、カテゴリ毎に設定されたスコアとを出現割合予測モデル１７３に入力し、該パッケージ画像のイメージワードの出現割合を出力する。 Note that the process is not limited to the above. The occurrence rate prediction model 173 may be trained using a score set for each product category. In this case, the server 1 inputs the package image and the score set for each category into the occurrence rate prediction model 173, and outputs the occurrence rate of the image word for the package image.

また、実施形態１での遺伝的アルゴリズムにより生成されたパッケージ画像に対し、特定のイメージワード（例えば、おいしそう）の出現割合に基づいて次世代の候補となるパッケージ画像を生成することができる。具体的には、サーバ１は、特定されたイメージワードを取得する。サーバ１は、遺伝的アルゴリズムにより生成された各パッケージ画像を出現割合予測モデル１７３に入力して、各パッケージ画像のイメージワードの出現割合を出力する。 Furthermore, for the package images generated by the genetic algorithm in embodiment 1, it is possible to generate package images that are candidates for the next generation based on the appearance rate of a specific image word (e.g., delicious). Specifically, the server 1 acquires the identified image word. The server 1 inputs each package image generated by the genetic algorithm to the appearance rate prediction model 173, and outputs the appearance rate of the image word for each package image.

サーバ１は、出現割合予測モデル１７３から出力されたイメージワードの出現割合に基づいて、特定されたイメージワードの出現割合が高いパッケージ画像を複数取得する。サーバ１は、取得した各パッケージ画像に含まれる複数のデザイン要素に基づいて交叉処理及び突然変異処理を行い、次世代の候補となるパッケージ画像を複数生成する。 The server 1 acquires multiple package images with a high occurrence rate of the identified image word based on the occurrence rate of the image word output from the occurrence rate prediction model 173. The server 1 performs crossover and mutation processes based on multiple design elements contained in each acquired package image, and generates multiple package images that are candidates for the next generation.

更にまた、実施形態２での生成モデル１７２を用いて生成されたパッケージ画像に対し、特定のイメージワードの出現割合に基づいて候補となるパッケージ画像を出力することができる。具体的には、サーバ１は、特定されたイメージワードを取得する。サーバ１は、生成モデル１７２を用いて生成された各パッケージ画像を出現割合予測モデル１７３に入力して、各パッケージ画像のイメージワードの出現割合を出力する。サーバ１は、出現割合予測モデル１７３から出力されたイメージワードの出現割合に基づいて、特定されたイメージワードの出現割合が高いパッケージ画像を、候補となるパッケージ画像として出力する。 Furthermore, for package images generated using the generation model 172 in embodiment 2, candidate package images can be output based on the appearance rate of a specific image word. Specifically, the server 1 acquires the identified image words. The server 1 inputs each package image generated using the generation model 172 to the appearance rate prediction model 173 and outputs the appearance rate of the image word for each package image. The server 1 outputs package images with a high appearance rate of the identified image word based on the appearance rate of the image word output from the appearance rate prediction model 173 as candidate package images.

本実施形態によると、出現割合予測モデル１７３を用いて候補となるパッケージ画像のイメージワードの出現割合を出力することが可能となる。 According to this embodiment, it is possible to output the appearance rate of image words of candidate package images using the appearance rate prediction model 173.

本実施形態によると、イメージワードの出現割合が高いパッケージ画像に基づいて次世代のパッケージ画像を生成することにより、明確な好意度の理由に応じて適当なパッケージデザインを企画することが可能となる。 According to this embodiment, by generating next-generation package images based on package images with a high occurrence rate of image words, it becomes possible to plan appropriate package designs based on clear reasons for favorability.

（実施形態４）
実施形態４は、候補となるパッケージ画像に基づき、人工知能を用いて該パッケージ画像の好意度の標準偏差を出力する形態に関する。なお、実施形態１～３と重複する内容については説明を省略する。 (Embodiment 4)
The fourth embodiment relates to a form in which, based on a candidate package image, artificial intelligence is used to output the standard deviation of the favorability of the package image. Note that a description of the contents that overlap with the first to third embodiments will be omitted.

パッケージ画像の好き嫌い（バラつき）については、性別または年代によって、同一パッケージ画像に対しても異なる好意度を評価した場合がある。このようなパッケージ画像の好き嫌いの差は、好意度の標準偏差で解釈される。好意度の標準偏差が高いほど好き嫌いの差が大きくなり、逆に、好意度の標準偏差が低いほど好き嫌いの差が小さくなる。 Regarding likes and dislikes (variation) of package images, there are cases where the same package image is rated with different degrees of favorability depending on gender or age. Such differences in likes and dislikes of package images can be interpreted as the standard deviation of favorability. The higher the standard deviation of favorability, the greater the difference in likes and dislikes; conversely, the lower the standard deviation of favorability, the smaller the difference in likes and dislikes.

図２１は、実施形態４のサーバ１の構成例を示すブロック図である。なお、図１９と重複する内容については説明を省略する。大容量記憶部１７には、標準偏差予測モデル１７４（第３学習モデル）が記憶されている。標準偏差予測モデル１７４は、パッケージ画像に基づいて該パッケージ画像の好意度の標準偏差を予測する予測器であり、機械学習により生成された学習済みモデルである。 Fig. 21 is a block diagram showing an example of the configuration of the server 1 of the fourth embodiment. Note that a description of the contents overlapping with Fig. 19 will be omitted. The mass storage unit 17 stores a standard deviation prediction model 174 (third learning model). The standard deviation prediction model 174 is a predictor that predicts the standard deviation of the favorability of a package image based on the package image, and is a learned model generated by machine learning.

図２２は、標準偏差予測モデル１７４を説明する説明図である。標準偏差予測モデル１７４は、人工知能ソフトウェアの一部であるプログラムモジュールとして利用される。標準偏差予測モデル１７４は、パッケージ画像を入力とし、該パッケージ画像の好意度の標準偏差を出力とするニューラルネットワークを構築済みの好意度の標準偏差の予測器である。 Figure 22 is an explanatory diagram explaining the standard deviation prediction model 174. The standard deviation prediction model 174 is used as a program module that is part of artificial intelligence software. The standard deviation prediction model 174 is a predictor of the standard deviation of favorability that has already constructed a neural network that takes a package image as input and outputs the standard deviation of the favorability of the package image.

サーバ１は、パッケージ画像の特徴量を学習するディープラーニングを行うことで標準偏差予測モデル１７４を構築する。例えば、標準偏差予測モデル１７４はＣＮＮであり、パッケージ画像の入力を受け付ける入力層と、該パッケージ画像の好意度の標準偏差を出力する出力層と、バックプロパゲーションにより学習済の中間層とを有する。 The server 1 constructs a standard deviation prediction model 174 by performing deep learning to learn the feature quantities of the package image. For example, the standard deviation prediction model 174 is a CNN, and has an input layer that accepts input of a package image, an output layer that outputs the standard deviation of the favorability of the package image, and an intermediate layer that has been trained by backpropagation.

なお、本実施の形態では標準偏差予測モデル１７４がＣＮＮであるものとして説明するが、標準偏差予測モデル１７４はＣＮＮに限定されず、ＣＮＮ以外のニューラルネットワーク、ＳＶＭ、ベイジアンネットワーク、または回帰木等の任意の学習アルゴリズムで構築された学習済みモデルであって良い。 In this embodiment, the standard deviation prediction model 174 is described as a CNN, but the standard deviation prediction model 174 is not limited to a CNN and may be a trained model constructed with any learning algorithm, such as a neural network other than a CNN, an SVM, a Bayesian network, or a regression tree.

サーバ１は、パッケージ画像と、複数人のパッケージ画像の好意度に関する情報に基づいて算出された標準偏差とが対応付けられた訓練データの組み合わせを複数取得する。訓練データは、パッケージ画像に対し、該パッケージ画像の好意度の標準偏差がラベル付けされたデータである。 The server 1 acquires multiple combinations of training data in which package images are associated with standard deviations calculated based on information about the likeability of the package images by multiple people. The training data is data in which package images are labeled with the standard deviation of the likeability of the package images.

訓練データの作成に関しては、例えばサーバ１は、１つのパッケージ画像に対して１０００人による回答の好意度を、最大５（好き）から最小１（嫌い）までの５段階で取得する。サーバ１は、取得した１０００人による回答の好意度の平均値を算出する。サーバ１は、算出した好意度の平均値に基づき、１０００人による回答の好意度の標準偏差を算出する。サーバ１は、パッケージ画像に対し、１０００人の該パッケージ画像の好意度の標準偏差をラベル付けすることにより、訓練データを作成することができる。 To create training data, for example, the server 1 obtains the favorability ratings of responses from 1,000 people for one package image on a five-point scale ranging from a maximum of 5 (like) to a minimum of 1 (dislike). The server 1 calculates the average value of the favorability ratings of the responses from the 1,000 people. The server 1 calculates the standard deviation of the favorability ratings of the responses from the 1,000 people based on the calculated average value of the favorability ratings. The server 1 can create training data by labeling a package image with the standard deviation of the favorability ratings of the package image from the 1,000 people.

サーバ１は、取得した訓練データを用いて学習を行う。具体的には、サーバ１は、訓練データであるパッケージ画像を入力層に入力し、中間層での演算処理を経て、出力層から該パッケージ画像の好意度の標準偏差を取得する。サーバ１は、出力層から出力された予測結果を、訓練データにおいてパッケージ画像に対しラベル付けされた情報、すなわち正解値と比較し、出力層からの予測値が正解値に近づくように、中間層での演算処理に用いるパラメータを最適化する。 The server 1 performs learning using the acquired training data. Specifically, the server 1 inputs the package images, which are the training data, into the input layer, and obtains the standard deviation of the favorability of the package images from the output layer after arithmetic processing in the intermediate layer. The server 1 compares the prediction result output from the output layer with the information labeled for the package images in the training data, i.e., the correct answer value, and optimizes the parameters used in the arithmetic processing in the intermediate layer so that the predicted value from the output layer approaches the correct answer value.

サーバ１は、訓練データに含まれる各パッケージ画像について上記の処理を行い、標準偏差予測モデル１７４を生成する。これにより、例えばサーバ１は当該訓練データを用いて標準偏差予測モデル１７４の学習を行うことで、パッケージ画像の好意度の標準偏差を出力可能なモデルを構築することができる。 The server 1 performs the above process for each package image included in the training data to generate a standard deviation prediction model 174. As a result, for example, the server 1 can use the training data to train the standard deviation prediction model 174, thereby constructing a model that can output the standard deviation of the favorability of package images.

サーバ１は、候補となるパッケージ画像を取得した場合、取得したパッケージ画像を標準偏差予測モデル１７４に入力する。サーバ１は、標準偏差予測モデル１７４の中間層にてパッケージ画像の特徴量を抽出する演算処理を行う。サーバ１は、抽出した特徴量を標準偏差予測モデル１７４の出力層に入力して、該パッケージ画像の好意度の標準偏差を出力として取得する。図示のように、「１．０１０」である好意度の標準偏差が標準偏差予測モデル１７４から出力される。 When the server 1 acquires a candidate package image, it inputs the acquired package image to the standard deviation prediction model 174. The server 1 performs a calculation process to extract features of the package image in the middle layer of the standard deviation prediction model 174. The server 1 inputs the extracted features to the output layer of the standard deviation prediction model 174, and acquires the standard deviation of the favorability of the package image as an output. As shown in the figure, the standard deviation of the favorability, which is "1.010", is output from the standard deviation prediction model 174.

また、性別または年代ごとに標準偏差予測モデル１７４を生成することができる。具体的には、サーバ１は、性別または年代の設定を受け付け、受け付けた性別または年代に応じて、標準偏差予測モデル１７４の生成に用いる訓練データを複数取得する。サーバ１は訓練データを用いて、受け付けた性別または年代に対応する標準偏差予測モデル１７４を生成する。 In addition, a standard deviation prediction model 174 can be generated for each gender or age group. Specifically, the server 1 accepts a setting of gender or age group, and acquires multiple pieces of training data to be used for generating the standard deviation prediction model 174 according to the accepted gender or age group. The server 1 uses the training data to generate a standard deviation prediction model 174 corresponding to the accepted gender or age group.

サーバ１は、性別または年代ごとに生成した標準偏差予測モデル１７４を大容量記憶部１７に記憶する。サーバ１は、複数の性別または年代に対して用意された複数の標準偏差予測モデル１７４を用いて、性別または年代ごとにパッケージ画像から該パッケージ画像の好意度の標準偏差を出力することができる。 The server 1 stores the standard deviation prediction model 174 generated for each gender or age group in the mass storage unit 17. The server 1 can output the standard deviation of the favorability of a package image from the package image for each gender or age group using multiple standard deviation prediction models 174 prepared for multiple genders or age groups.

なお、上述した処理に限るものではない。商品のカテゴリ毎に設定されたスコアを用いて、標準偏差予測モデル１７４を学習させても良い。この場合、サーバ１は、パッケージ画像と、カテゴリ毎に設定されたスコアとを標準偏差予測モデル１７４に入力し、該パッケージ画像の好意度の標準偏差を出力する。 Note that the process is not limited to the above. The standard deviation prediction model 174 may be trained using scores set for each product category. In this case, the server 1 inputs the package image and the score set for each category into the standard deviation prediction model 174, and outputs the standard deviation of the favorability of the package image.

また、実施形態１での遺伝的アルゴリズムにより生成されたパッケージ画像に対し、好意度の標準偏差に基づいて次世代の候補となるパッケージ画像を生成することができる。具体的には、サーバ１は、標準偏差予測モデル１７４から出力された好意度の標準偏差が小さいパッケージ画像を複数取得する。サーバ１は、取得した各パッケージ画像に含まれる複数のデザイン要素に基づいて交叉処理及び突然変異処理を行い、次世代の候補となるパッケージ画像を複数生成する。 Furthermore, for the package images generated by the genetic algorithm in embodiment 1, it is possible to generate package images that are candidates for the next generation based on the standard deviation of the favorability. Specifically, the server 1 acquires multiple package images with a small standard deviation of the favorability output from the standard deviation prediction model 174. The server 1 performs crossover processing and mutation processing based on multiple design elements contained in each acquired package image, and generates multiple package images that are candidates for the next generation.

更にまた、遺伝的アルゴリズムにより生成されたパッケージ画像に対し、好意度に関する情報と、特定のイメージワード（例えば、おいしそう）の出現割合と、好意度の標準偏差との組み合わせに基づいて、次世代の候補となるパッケージ画像を生成することができる。 Furthermore, for the package images generated by the genetic algorithm, it is possible to generate candidate package images for the next generation based on a combination of information about the likeability level, the frequency of occurrence of specific image words (e.g., delicious), and the standard deviation of the likeability level.

具体的には、先ず、サーバ１は、好意度予測モデル１７１から出力された好意度の高いパッケージ画像を複数取得する。次に、サーバ１は、取得した複数の好意度の高いパッケージ画像を出現割合予測モデル１７３に入力して、イメージワードの出現割合を出力する。サーバ１は、出現割合予測モデル１７３から出力されたイメージワードの出現割合に基づいて、特定されたイメージワード（例えば、おいしそう）の出現割合が高いパッケージ画像を複数取得する。 Specifically, first, the server 1 acquires multiple package images with high favorability output from the favorability prediction model 171. Next, the server 1 inputs the multiple acquired package images with high favorability into the appearance rate prediction model 173 and outputs the appearance rate of the image word. Based on the appearance rate of the image word output from the appearance rate prediction model 173, the server 1 acquires multiple package images with a high appearance rate of a specified image word (e.g., delicious).

そして、サーバ１は、取得した複数のパッケージ画像を標準偏差予測モデル１７４に入力して、各パッケージ画像の好意度の標準偏差を出力する。サーバ１は、標準偏差予測モデル１７４から出力された好意度の標準偏差に基づいて、好意度の標準偏差が小さいパッケージ画像を複数取得する。サーバ１は、取得した複数のパッケージ画像を次世代の候補となるパッケージ画像として出力する。 Then, the server 1 inputs the multiple package images acquired into the standard deviation prediction model 174 and outputs the standard deviation of the favorability of each package image. The server 1 acquires multiple package images with a small standard deviation of the favorability based on the standard deviation of the favorability output from the standard deviation prediction model 174. The server 1 outputs the multiple acquired package images as package images that will be candidates for the next generation.

なお、上述した例は、好意度に関する情報、イメージワードの出現割合、好意度の標準偏差の順に、候補となるパッケージ画像の取得処理が行われたが、取得処理の順序は、あくまでも一例であって、任意の順序で行われても良い。なお、好意度に関する情報、イメージワードの出現割合及び好意度の標準偏差の取得処理は同時に実行されても良い。この場合、好意度に関する情報、イメージワードの出現割合及び好意度の標準偏差の総合的な結果に基づいて、候補となるパッケージ画像が取得される。 In the above example, the process of acquiring candidate package images was performed in the order of information on favorability, the occurrence rate of image words, and the standard deviation of favorability, but the order of the acquisition process is merely an example and may be performed in any order. The process of acquiring information on favorability, the occurrence rate of image words, and the standard deviation of favorability may be performed simultaneously. In this case, candidate package images are acquired based on the overall result of information on favorability, the occurrence rate of image words, and the standard deviation of favorability.

更にまた、好意度に関する情報、イメージワードの出現割合及び好意度の標準偏差のうちの任意の２つの組み合わせに基づいて、次世代の候補となるパッケージ画像を生成することができる。例えば、好意度に関する情報と好意度の標準偏差との組み合わせを利用した場合、先ず、サーバ１は、好意度予測モデル１７１から出力された好意度の高いパッケージ画像を複数取得する。 Furthermore, it is possible to generate package images that are candidates for the next generation based on a combination of any two of the information on favorability, the occurrence rate of image words, and the standard deviation of favorability. For example, when using a combination of information on favorability and the standard deviation of favorability, the server 1 first obtains multiple package images with high favorability output from the favorability prediction model 171.

次に、サーバ１は、取得した複数の好意度の高いパッケージ画像を標準偏差予測モデル１７４に入力して、各パッケージ画像の好意度の標準偏差を出力する。サーバ１は、標準偏差予測モデル１７４から出力された好意度の標準偏差に基づいて、好意度の標準偏差が小さいパッケージ画像を複数取得する。サーバ１は、取得した複数のパッケージ画像を次世代の候補となるパッケージ画像として出力する。 Next, the server 1 inputs the multiple package images with high favorability obtained into the standard deviation prediction model 174 and outputs the standard deviation of the favorability of each package image. The server 1 obtains multiple package images with a small standard deviation of the favorability based on the standard deviation of the favorability output from the standard deviation prediction model 174. The server 1 outputs the multiple package images obtained as package images that will be candidates for the next generation.

更にまた、実施形態２での生成モデル１７２を用いて生成されたパッケージ画像に対し、好意度の標準偏差に基づいて候補となるパッケージ画像を出力することができる。具体的には、サーバ１は、生成モデル１７２を用いて生成された各パッケージ画像を標準偏差予測モデル１７４に入力して、各パッケージ画像の好意度の標準偏差を出力する。サーバ１は、標準偏差予測モデル１７４から出力された好意度の標準偏差が小さいパッケージ画像を、候補となるパッケージ画像として出力する。 Furthermore, for package images generated using the generation model 172 in embodiment 2, candidate package images can be output based on the standard deviation of the favorability. Specifically, the server 1 inputs each package image generated using the generation model 172 to the standard deviation prediction model 174, and outputs the standard deviation of the favorability of each package image. The server 1 outputs package images with a small standard deviation of the favorability output from the standard deviation prediction model 174 as candidate package images.

本実施形態によると、標準偏差予測モデル１７４を用いて候補となるパッケージ画像の好意度の標準偏差を出力することが可能となる。 According to this embodiment, it is possible to output the standard deviation of the favorability of candidate package images using the standard deviation prediction model 174.

本実施形態によると、好意度の標準偏差が小さいパッケージ画像に基づいて次世代のパッケージ画像を生成することにより、パッケージ画像の好き嫌いの差を考慮して適当なパッケージデザインを企画することが可能となる。 According to this embodiment, by generating next-generation package images based on package images with a small standard deviation of favorability, it becomes possible to plan appropriate package designs that take into account differences in likes and dislikes of package images.

今回開示された実施形態はすべての点で例示であって、制限的なものではないと考えられるべきである。本発明の範囲は、上記した意味ではなく、特許請求の範囲によって示され、特許請求の範囲と均等の意味及び範囲内でのすべての変更が含まれることが意図される。 The embodiments disclosed herein are illustrative in all respects and should not be considered limiting. The scope of the present invention is indicated by the claims, not by the above meaning, and is intended to include all modifications within the meaning and scope of the claims.

１情報処理装置（サーバ）
１１制御部
１２記憶部
１３通信部
１４入力部
１５表示部
１６読取部
１７大容量記憶部
１７１好意度予測モデル
１７２生成モデル
１７３出現割合予測モデル
１７４標準偏差予測モデル
１ａ可搬型記憶媒体
１ｂ半導体メモリ
１Ｐ制御プログラム
２情報処理端末（端末）
２１制御部
２２記憶部
２３通信部
２４入力部
２５表示部
２６撮影部
２Ｐ制御プログラム 1. Information processing device (server)
REFERENCE SIGNS LIST 11 Control unit 12 Storage unit 13 Communication unit 14 Input unit 15 Display unit 16 Reading unit 17 Large-capacity storage unit 171 Favourability prediction model 172 Generation model 173 Appearance ratio prediction model 174 Standard deviation prediction model 1a Portable storage medium 1b Semiconductor memory 1P Control program 2 Information processing terminal (terminal)
21 Control unit 22 Memory unit 23 Communication unit 24 Input unit 25 Display unit 26 Photography unit 2P Control program

Claims

Accept the number of design elements and the number of designs included in the candidate package images;
outputting a design element selection field for selecting a design element from the number of accepted design elements;
outputting a design selection field for selecting a design from the number of designs received, in correspondence with each design element selection field;
generating a plurality of candidate package images;
A first learning model that has been trained to output information regarding the favorability of a package image when the package image is input, inputting the generated package images into the first learning model and outputting information regarding the favorability of each package image;
A program for causing a computer to execute a process of outputting a package image to be proposed from the plurality of package images based on the output information on the degree of favorability.

A plurality of designs are registered for a plurality of design elements included in the package image,
The program according to claim 1 , further comprising: generating a plurality of candidate package images by rearranging designs for each design element.

Obtain multiple highly favorable package images,
3. The program according to claim 1 or 2, which executes a process of rearranging the designs of a plurality of design elements contained in the plurality of acquired package images based on a genetic algorithm to generate a plurality of package images that are candidates for the next generation.

outputting information regarding each package image to be proposed, an ID of the package image, and a favorability level of the package image;
4. The program according to claim 1, further comprising a process for generating a plurality of package images as candidates for the next generation when a generation request is received for the output package image.

A basic package image serving as an original image and a plurality of style images for performing style conversion are obtained;
5. The program according to claim 1, further comprising: inputting the acquired basic package image and each of the style images into a generative model that has been trained to generate a package image based on a basic package image and a style image, and executing a process of generating a plurality of candidate package images.

Accepts style selection,
Output multiple style images corresponding to the accepted styles;
Accept the selection of multiple target style images from the multiple output style images,
The program according to claim 5 , which causes execution of a process of inputting the basic package image and a plurality of accepted style images to the generative model to generate a plurality of candidate package images.

7. The program according to claim 5 , further comprising: outputting information about each candidate package image, an ID of the package image, and a favorability level of the package image.

The generative model has a synthesis network in which a plurality of synthesis network units each having a convolution layer are connected,
8. The program according to claim 5 , which causes a selected one of the synthesis network units to input information about the style image to generate a package image.

9. The program according to claim 1, further comprising a process for outputting a package image having a high weighted average favorability level based on a weighted average favorability level of each package image output by the first learning model.

Obtain the score set for each product category,
inputting the obtained score and the plurality of candidate package images into the first learning model, and outputting information regarding the favorability of each package image;
10. The program according to claim 1, further comprising: outputting a package image to be proposed from the plurality of package image candidates based on the output information regarding the degree of favorability.

The program according to any one of claims 1 to 10, which causes a second learning model that has been trained to output the occurrence rate of image words of the package image when the package image is input to execute a process of inputting the plurality of candidate package images and outputting the occurrence rate of image words of each package image.

The program according to any one of claims 1 to 11, which causes a third learning model that has been trained to output the standard deviation of the favorability of the package image when the package image is input to execute a process of inputting the plurality of candidate package images and outputting the standard deviation of the favorability of each package image.

Accept the number of design elements and the number of designs included in the candidate package images;
outputting a design element selection field for selecting a design element from the number of accepted design elements;
outputting a design selection field for selecting a design from the number of designs received, in correspondence with each design element selection field;
generating a plurality of candidate package images;
A first learning model that has been trained to output information regarding the favorability of a package image when the package image is input, inputting the generated package images into the first learning model and outputting information regarding the favorability of each package image;
An information processing method in which a computer executes a process of outputting a package image to be proposed from the plurality of package images based on the output information on the degree of favorability.

An information processing device including a control unit,
The control unit is
Accept the number of design elements and the number of designs included in the candidate package images;
outputting a design element selection field for selecting a design element from the number of accepted design elements;
outputting a design selection field for selecting a design from the number of designs received, in correspondence with each design element selection field;
generating a plurality of candidate package images ;
A first learning model that has been trained to output information regarding the favorability of a package image when the package image is input, inputting the generated package images into the first learning model and outputting information regarding the favorability of each package image ;
A package image to be proposed is output from the plurality of package images based on the output information regarding the degree of favorability.
Information processing device.

Accept the selection of a set number of design elements;
Accepting a selection of a set number of designs corresponding to each of the accepted design elements;
Submit your design for the accepted design elements;
obtaining a plurality of package images extracted based on information regarding the favorability of the package images from a plurality of candidate package images generated by rearranging the designs of the design elements;
Displaying each of the acquired candidate package images, an ID of the package image, and information regarding the favorability of the package image;
A program that causes a computer to execute a process of, when a request for generating a package image that is a candidate for the next generation of the displayed package image is received, transmitting the received generation request.

Accept the selection of the basic package image that will be the original image,
Accepts style selection,
Obtain multiple style images corresponding to the accepted style;
Accepting selection of multiple target style images from the multiple acquired style images;
Send the accepted basic package image and multiple target style images,
obtaining a plurality of candidate package images generated based on the base package image and the target style image;
The program according to claim 15 , further comprising: displaying information about each of the acquired candidate package images, an ID of the package image, and a favorability level of the package image.

Obtain the score set for each product category,
Generate multiple candidate package images,
inputting the acquired score and the generated multiple package images into a first learning model that has been trained to output information regarding the favorability of the package image when a score and a package image are input, and outputting information regarding the favorability of each package image;
A package image to be proposed is output from the plurality of package images based on the output information regarding the degree of favorability.
A program that causes a computer to carry out processing.