JP7660845B2

JP7660845B2 - Attribute value extraction system, attribute value extraction method, and program

Info

Publication number: JP7660845B2
Application number: JP2021214566A
Authority: JP
Inventors: 圭司新里; ▲彦▼迪夏; 維▲徳▼ 陳; 直樹吉永
Original assignee: University of Tokyo NUC; Rakuten Group Inc
Current assignee: University of Tokyo NUC; Rakuten Group Inc
Priority date: 2021-12-28
Filing date: 2021-12-28
Publication date: 2025-04-14
Anticipated expiration: 2041-12-28
Also published as: JP2023098065A

Description

本開示は、属性値抽出システム、属性値抽出方法、及びプログラムに関する。 This disclosure relates to an attribute value extraction system, an attribute value extraction method, and a program.

従来、機械学習の手法を利用して、商品又はコンテンツ等のアイテムに属性値を付与する技術が知られている。非特許文献１及び非特許文献２には、第１アイテムに関する第１アイテムデータ及び第１クエリと、第１アイテムに関する属性値と、の関係が学習された属性値抽出モデルに相当するモデルに、第２アイテムに関する第２アイテムデータと、第２アイテムに関する属性である第２クエリと、を入力し、属性値抽出モデルに相当するモデルから出力された第２属性値などの固有表現に相当する情報を第２アイテムに付与する技術が記載されている。 Conventionally, there is known a technique for assigning attribute values to items such as products or content using a machine learning technique. Non-Patent Document 1 and Non-Patent Document 2 describe a technique in which second item data related to a second item and a second query, which is an attribute related to the second item, are input to a model equivalent to an attribute value extraction model that has learned the relationship between first item data and a first query related to a first item and an attribute value related to the first item, and information corresponding to a named entity such as a second attribute value output from the model equivalent to the attribute value extraction model is assigned to the second item.

Qifan Wang, Li Yang, Bhargav Kanagal, Sumit Sanghai, D.Sivakumar, Bin Shu, Zac Yu, and Jon Elsas. 2020. Learning to extract attribute value from product via question answering: A multi-task approach. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 47-55, Online. ACM.Qifan Wang, Li Yang, Bhargav Kanagal, Sumit Sanghai, D.Sivakumar, Bin Shu, Zac Yu, and Jon Elsas. 2020. Learning to extract attribute value from product via question answering: A multi-task approach. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 47-55, Online. ACM. Xiaoya Li, Jingrong Feng, Yuxian Meng, Qinghong Han, Fei Wu, and Jiwei Li. 2020. A unified MRC framework for named entity recognition. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 5849-5859, Online. Association for Computational Linguistics.Xiaoya Li, Jingrong Feng, Yuxian Meng, Qinghong Han, Fei Wu, and Jiwei Li. 2020. A unified MRC framework for named entity recognition. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 5849-5859, Online. Association for Computational Linguistics.

しかしながら、非特許文献１及び非特許文献２の技術では、例えば、属性値抽出モデルに相当するモデルから出力される第２属性値の精度が、クエリ次第で低下することがある。例えば、属性値抽出モデルに入力されるクエリは、比較的長い疑問文が元々想定されていたが、比較的シンプルな語がクエリとして入力されると、属性値抽出モデルに相当するモデルから出力される第２属性値の精度が低下する可能性がある。 However, in the techniques of Non-Patent Document 1 and Non-Patent Document 2, for example, the accuracy of the second attribute value output from a model equivalent to the attribute value extraction model may decrease depending on the query. For example, the query input to the attribute value extraction model was originally expected to be a relatively long interrogative sentence, but when a relatively simple word is input as the query, the accuracy of the second attribute value output from the model equivalent to the attribute value extraction model may decrease.

本開示の目的の１つは、属性値抽出モデルからの出力の精度を高めることである。 One of the objectives of this disclosure is to improve the accuracy of the output from the attribute value extraction model.

本開示に係る属性値抽出システムは、第１アイテムに関する第１アイテムデータ及び第１クエリと、前記第１アイテムに関する第１属性値と、の関係が学習された属性値抽出モデルを記憶する記憶部と、第２アイテムに関する第２属性に関連付けられた第２属性値を取得する第２属性値取得部と、前記第２アイテムに関する第２アイテムデータ、前記第２属性値に応じた第２クエリ、及び前記属性値抽出モデルに基づいて、前記第２アイテムデータから第３属性値を抽出する第３属性値抽出部と、を含む。 The attribute value extraction system according to the present disclosure includes a storage unit that stores an attribute value extraction model in which a relationship between first item data and a first query related to a first item and a first attribute value related to the first item is learned, a second attribute value acquisition unit that acquires a second attribute value associated with a second attribute related to a second item, and a third attribute value extraction unit that extracts a third attribute value from the second item data based on the second item data related to the second item, a second query corresponding to the second attribute value, and the attribute value extraction model.

本開示によれば、質問応答モデルからの出力の精度が高まる。 This disclosure improves the accuracy of output from question answering models.

属性値抽出システムの全体構成の一例を示す図である。FIG. 1 illustrates an example of an overall configuration of an attribute value extraction system. 商品データから取得される属性及び属性値の一例を示す図である。11A and 11B are diagrams illustrating an example of attributes and attribute values acquired from product data. 従来における質問応答モデルの一例を示す図である。FIG. 1 is a diagram illustrating an example of a conventional question answering model. 実施形態における質問応答モデルの一例を示す図である。FIG. 2 is a diagram illustrating an example of a question answering model in the embodiment. 属性値抽出システムで実現される機能の一例を示す機能ブロック図である。FIG. 2 is a functional block diagram showing an example of functions realized by the attribute value extraction system. 訓練データベースの一例を示す図である。FIG. 2 is a diagram illustrating an example of a training database. 属性データベースの一例を示す図である。FIG. 4 is a diagram illustrating an example of an attribute database. 属性値抽出システムで実行される処理の一例を示すフロー図である。FIG. 2 is a flow diagram illustrating an example of a process executed in the attribute value extraction system. 複数の第２属性値の各々に関する特徴量の平均を計算する処理の一例を示す図である。FIG. 11 is a diagram illustrating an example of a process for calculating an average of feature amounts relating to each of a plurality of second attribute values. 複数の第２属性値の各々に関する特徴量の平均を計算する処理の一例を示す図である。FIG. 11 is a diagram illustrating an example of a process for calculating an average of feature amounts relating to each of a plurality of second attribute values. 複数の第２属性値の各々に関する特徴量の平均を計算する処理の一例を示す図である。FIG. 11 is a diagram illustrating an example of a process for calculating an average of feature amounts relating to each of a plurality of second attribute values.

［１．属性値抽出システムの全体構成］
本開示に係る属性値抽出システムの実施形態の一例を説明する。図１は、属性値抽出システムの全体構成の一例を示す図である。ネットワークＮは、インターネット又はＬＡＮ等の任意のネットワークである。属性値抽出システムＳは、少なくとも１つのコンピュータを含めばよく、図１の例に限られない。 [1. Overall configuration of the attribute value extraction system]
An example of an embodiment of an attribute value extraction system according to the present disclosure will be described. Fig. 1 is a diagram showing an example of the overall configuration of the attribute value extraction system. A network N is any network such as the Internet or a LAN. The attribute value extraction system S is not limited to the example of Fig. 1 as long as it includes at least one computer.

サーバ１０は、サーバコンピュータである。制御部１１は、少なくとも１つのプロセッサを含む。記憶部１２は、ＲＡＭ等の揮発性メモリと、ハードディスク等の不揮発性メモリと、を含む。通信部１３は、有線通信用の通信インタフェースと、無線通信用の通信インタフェースと、の少なくとも一方を含む。 The server 10 is a server computer. The control unit 11 includes at least one processor. The storage unit 12 includes a volatile memory such as a RAM and a non-volatile memory such as a hard disk. The communication unit 13 includes at least one of a communication interface for wired communication and a communication interface for wireless communication.

ユーザ端末２０は、ユーザが操作するコンピュータである。例えば、ユーザ端末２０は、パーソナルコンピュータ、スマートフォン、タブレット端末、又はウェアラブル端末である。制御部２１、記憶部２２、及び通信部２３の物理的構成は、それぞれ制御部１１、記憶部１２、及び通信部１３と同様である。操作部２４は、タッチパネル又はマウス等の入力デバイスである。表示部２５は、液晶ディスプレイ又は有機ＥＬディスプレイである。 The user terminal 20 is a computer operated by a user. For example, the user terminal 20 is a personal computer, a smartphone, a tablet terminal, or a wearable terminal. The physical configurations of the control unit 21, the memory unit 22, and the communication unit 23 are similar to those of the control unit 11, the memory unit 12, and the communication unit 13, respectively. The operation unit 24 is an input device such as a touch panel or a mouse. The display unit 25 is a liquid crystal display or an organic EL display.

なお、記憶部１２，２２に記憶されるプログラム又はデータは、ネットワークＮを介して供給されてもよい。また、各コンピュータには、コンピュータ読み取り可能な情報記憶媒体を読み取る読取部（例えば、メモリカードスロット）と、外部機器とデータの入出力をするための入出力部（例えば、ＵＳＢポート）と、の少なくとも一方が含まれてもよい。例えば、情報記憶媒体に記憶されたプログラム又はデータが、読取部及び入出力部の少なくとも一方を介して供給されてもよい。 The programs or data stored in the storage units 12 and 22 may be supplied via the network N. Each computer may also include at least one of a reading unit (e.g., a memory card slot) that reads a computer-readable information storage medium and an input/output unit (e.g., a USB port) for inputting and outputting data to and from an external device. For example, the programs or data stored in the information storage medium may be supplied via at least one of the reading unit and the input/output unit.

［２．属性値抽出システムの概要］
本実施形態では、商品に関する商品データの中から、商品の属性に応じた属性値が取得される場合を例に挙げて、属性値抽出システムＳで実行される処理を説明する。商品は、商取引の対象となる物である。本実施形態では、オンラインショッピングモールにおける商品を例に挙げるが、商品自体は、任意の商品であってよく、例えば、インターネットオークション又はフリーマーケットアプリで売買される商品であってもよい。 2. Overview of the Attribute Value Extraction System
In this embodiment, the process executed by the attribute value extraction system S will be described by taking as an example a case where an attribute value corresponding to an attribute of a product is acquired from product data related to the product. A product is an object that is the subject of commercial transaction. In this embodiment, a product in an online shopping mall is taken as an example, but the product itself may be any product, for example, a product bought and sold in an internet auction or a flea market app.

商品データは、商品の詳細に関する情報である。例えば、商品データは、文字、数字、その他の記号、画像、音声、動画、又はこれらの組み合わせを含む。本実施形態では、オンラインショッピングモールにおける商品を例に挙げるので、店舗の担当者が入力した任意のテキスト（商品の説明文）が商品データに相当する。例えば、商品データは、オンラインショッピングモールのウェブサイト又はアプリケーションから閲覧可能である。本実施形態では、店舗の担当者がユーザに相当する場合を説明するが、ユーザは、属性値抽出システムＳを利用する者であればよく、店舗の担当者に限られない。例えば、ユーザは、オンラインショッピングモールの管理者であってもよい。 Product data is information about the details of a product. For example, product data includes letters, numbers, other symbols, images, audio, video, or a combination of these. In this embodiment, products in an online shopping mall are used as an example, so any text (product description) entered by a store staff member corresponds to the product data. For example, the product data can be viewed from the website or application of the online shopping mall. In this embodiment, a case is described in which a store staff member corresponds to the user, but the user may be anyone who uses the attribute value extraction system S, and is not limited to a store staff member. For example, the user may be an administrator of the online shopping mall.

属性は、商品を分類するための区分である。属性は、カテゴリ又はジャンルと呼ばれることもある。属性は、階層的に定義されてもよい。属性が階層的に定義される場合、属性が上位であるほど属性の意味が抽象的になり、属性が下位であるほど属性の意味が具体的になる。以降、単に属性と記載した箇所は、属性の名前を意味する。属性は、文字、数字、その他の記号、又はこれらの組み合わせによって表現される。属性は、所定の観点で商品を分類可能なものであればよく、任意の観点で属性を定義可能である。例えば、属性は、ブランド、色、素材、サイズ、機能、柄、又は産地といった情報である。 Attributes are categories for classifying products. Attributes are sometimes called categories or genres. Attributes may be defined hierarchically. When attributes are defined hierarchically, the higher the attribute, the more abstract the meaning of the attribute, and the lower the attribute, the more specific the meaning of the attribute. Hereinafter, when simply describing an attribute, it means the name of the attribute. Attributes are expressed by letters, numbers, other symbols, or combinations of these. Attributes can be defined from any perspective as long as they can classify products from a specific perspective. For example, attributes are information such as brand, color, material, size, function, pattern, or place of origin.

属性値は、具体的な分類を示す値である。属性には、複数の属性値が予め用意されている。商品には、予め用意された複数の属性値のうちの少なくとも１つが付与される。予め用意された複数の属性値は、商品に付与される属性値の候補である。属性値は、文字、数字、その他の記号、又はこれらの組み合わせによって表現される。例えば、ユーザは、ある商品の商品データをアップロードする時に、属性と、この属性に対して用意された複数の属性値のうちの少なくとも１つと、を指定する。この商品には、当該指定された属性及び属性値が付与される。 An attribute value is a value that indicates a specific classification. Multiple attribute values are prepared in advance for an attribute. At least one of the multiple prepared attribute values is assigned to a product. The multiple prepared attribute values are candidates for the attribute values to be assigned to a product. Attribute values are expressed by letters, numbers, other symbols, or combinations of these. For example, when uploading product data for a product, a user specifies an attribute and at least one of the multiple attribute values prepared for that attribute. The specified attribute and attribute value are assigned to the product.

例えば、属性「ブランド」には、属性値として、種々のブランド名が用意されている。あるブランドの商品には、属性「ブランド」に対して用意された複数の属性値のうち、この商品のブランドのブランド名を示す属性値が付与される。例えば、属性「色」には、属性値として、黒、白、黄色といった種々の色が予め用意されている。見た目が黒い商品には、属性「色」に対して用意された複数の属性値のうち、「黒」を示す属性値が付与される。他の属性についても同様であり、任意の属性値を予め用意し、任意の属性値を商品に付与できる。 For example, various brand names are prepared as attribute values for the attribute "brand." Products of a certain brand are assigned an attribute value indicating the brand name of the brand of this product from among the multiple attribute values prepared for the attribute "brand." For example, various colors such as black, white, and yellow are prepared in advance as attribute values for the attribute "color." Products that look black are assigned an attribute value indicating "black" from among the multiple attribute values prepared for the attribute "color." The same is true for other attributes; any attribute value can be prepared in advance and assigned to products.

図２は、商品データから取得される属性及び属性値の一例を示す図である。例えば、ユーザが操作部２４を操作してオンラインショッピングモールのウェブサイトにアクセスすると、商品データを含む商品ページＰが表示部２５に表示される。例えば、商品ページＰには、店舗の名前、商品のタイトル、商品の画像、及び商品の詳細な説明文といった商品データが表示される。ユーザが、商品データをアップロードする時に属性及び属性値を指定していれば、当該指定された属性及び属性値も商品データとして商品ページＰに表示される。商品の属性及び属性値は、検索時のインデックスとして利用される。 Figure 2 is a diagram showing an example of attributes and attribute values obtained from product data. For example, when a user operates the operation unit 24 to access a website of an online shopping mall, a product page P including product data is displayed on the display unit 25. For example, product data such as the store name, product title, product image, and detailed product description are displayed on the product page P. If the user specifies attributes and attribute values when uploading product data, the specified attributes and attribute values are also displayed as product data on the product page P. The product attributes and attribute values are used as indexes during searches.

商品データの中から、商品の属性に応じた適切な属性値を取得できれば、種々の効果を奏することができる。例えば、ユーザが属性値を指定する必要がなくなるので、ユーザの負担を軽減できる。例えば、ユーザが誤って不適切な属性値を指定したとしても、商品データの中から取得した適切な属性値に置き換えることもできる。例えば、現時点では属性値として用意されていなくても、商品データの中から新たな属性値を取得することもできる。例えば、ユーザの嗜好又はマーケットプレイスをより深く理解し、マーケティングに活用することもできる。 If appropriate attribute values according to the attributes of a product can be obtained from product data, various effects can be achieved. For example, the burden on the user can be reduced because the user does not need to specify attribute values. For example, even if the user mistakenly specifies an inappropriate attribute value, it can be replaced with an appropriate attribute value obtained from the product data. For example, new attribute values can be obtained from the product data even if they are not currently available as attribute values. For example, it can be possible to gain a deeper understanding of user preferences or the marketplace and use this for marketing.

図２の例であれば、商品データの中から、属性「ブランド」の属性値「ＢＢＢバッグ」、属性「色」の属性値「ブラック」、属性「素材」の属性値「レザー」、属性「サイズ」の属性値「１０’’Ｈ×１３’’Ｌ×６’’Ｄ」、及び属性「機能」の属性値「防水」を取得することができれば、先述した種々の効果を奏することができる。本実施形態では、これらの属性値を取得するために、属性値抽出モデルとして質問応答モデル（ＱＡモデル）が利用される。本実施形態では、属性値抽出モデルとして質問応答モデルが利用される実施形態を例示するが、商品データ等のエンティティとその属性を入力として属性値を抽出するモデルであれば、その種別に制限はない。以降、属性値抽出モデルの一例として質問応答モデルを説明するので、以降の説明で質問応答モデルと記載した箇所は、属性値抽出モデルと読み替えることができる。 In the example of FIG. 2, if the attribute value "BBB bag" of the attribute "brand", the attribute value "black" of the attribute "color", the attribute value "leather" of the attribute "material", the attribute value "10''H x 13''L x 6''D" of the attribute "size", and the attribute value "waterproof" of the attribute "function" can be obtained from the product data, the various effects described above can be achieved. In this embodiment, a question answering model (QA model) is used as the attribute value extraction model to obtain these attribute values. In this embodiment, an embodiment in which a question answering model is used as the attribute value extraction model is illustrated, but there is no restriction on the type as long as it is a model that extracts attribute values by inputting entities such as product data and their attributes. In the following, a question answering model will be described as an example of an attribute value extraction model, so that in the following description, the description of the question answering model can be replaced with the description of the attribute value extraction model.

質問応答モデルは、機械学習を利用したモデルである。機械学習自体は、種々の手法を利用可能である。例えば、質問応答モデルは、教師有り学習、半教師有り学習、又は教師無し学習の何れのモデルであってもよい。本実施形態では、ＢＥＲＴと呼ばれるモデルを利用した質問応答モデルを例に挙げるが、質問応答モデル自体は、任意のモデルであってよい。例えば、質問応答モデルは、ＢＥＲＴ以外のいわゆるＴｒａｎｓｆｏｒｍｅｒベースのモデルであってもよいし、Ｔｒａｎｓｆｏｒｍｅｒが登場する以前に利用されていたニューラルネットワークのモデルであってもよい。 The question answering model is a model that uses machine learning. Various methods can be used for the machine learning itself. For example, the question answering model may be any model of supervised learning, semi-supervised learning, or unsupervised learning. In this embodiment, a question answering model that uses a model called BERT is given as an example, but the question answering model itself may be any model. For example, the question answering model may be a so-called Transformer-based model other than BERT, or may be a neural network model that was used before the advent of the Transformer.

図３は、従来における質問応答モデルの一例を示す図である。以降、従来における質問応答モデルＭ１及び質問応答モデルＭ２の概要を説明するが、質問応答モデルＭ１及び質問応答モデルＭ２の詳細は、先行技術文献として挙げた非特許文献１、非特許文献２、又はこれらに記載された参考文献を参照されたい。 Figure 3 is a diagram showing an example of a conventional question answering model. Below, an overview of conventional question answering models M1 and M2 will be explained. For details of question answering models M1 and M2, please refer to Non-Patent Document 1 and Non-Patent Document 2 listed as prior art documents, or the references described therein.

質問応答モデルＭ１は、ニュース記事及びクエリが入力されると、クエリに応じた応答を出力するモデルである。質問応答モデルＭ１には、ニュース記事及びクエリと、クエリに応じた応答と、の関係を示す訓練データが学習済みである。質問応答モデルＭ１に入力されるクエリは、ニュース記事の内容に関する質問である。質問応答モデルＭ１から出力される応答は、クエリである質問に対する回答である。 The question-answering model M1 is a model that, when a news article and a query are input, outputs a response corresponding to the query. The question-answering model M1 has learned training data that indicates the relationship between the news article and the query, and the response corresponding to the query. The query input to the question-answering model M1 is a question about the content of the news article. The response output from the question-answering model M1 is an answer to the question, which is the query.

質問応答モデルＭ１は、ニュース記事の中から、応答として適切な部分を推定し、当該部分を応答として出力する。図３の例では、日本における緊急事態宣言の解除に関するニュース記事と、緊急事態宣言が解除される時期を問うクエリと、が質問応答モデルＭ１に入力される。質問応答モデルＭ１は、緊急事態宣言が解除される時期として、ニュース記事の「９月３０日」の部分が適切であることを推定し、この部分「９月３０日」を応答として出力する。このような質問応答モデルＭ１は、商品データから属性値を取得する目的でも利用できる。 The question-answering model M1 estimates which parts of a news article are appropriate as a response, and outputs the parts as a response. In the example of FIG. 3, a news article about the lifting of the state of emergency in Japan and a query asking when the state of emergency will be lifted are input to the question-answering model M1. The question-answering model M1 estimates that the part of the news article that reads "September 30th" is appropriate as the date when the state of emergency will be lifted, and outputs this part, "September 30th," as a response. Such a question-answering model M1 can also be used to obtain attribute values from product data.

質問応答モデルＭ２は、商品データ及びクエリが入力されると、クエリに応じた応答を出力するモデルである。質問応答モデルＭ２には、商品データ及びクエリと、クエリに応じた応答と、の関係を示す訓練データが学習済みである。質問応答モデルＭ２に入力されるクエリは、商品データが示す商品の属性である。クエリは、商品データのうち、属性に応じた属性値が含まれる部分を問う質問ということができる。質問応答モデルＭ２から出力される応答は、クエリである属性に応じた属性値である。 The question-answering model M2 is a model that, when product data and a query are input, outputs a response corresponding to the query. The question-answering model M2 has learned training data that indicates the relationship between the product data and the query, and the response corresponding to the query. The query input to the question-answering model M2 is an attribute of the product indicated by the product data. The query can be said to be a question that asks about a part of the product data that contains an attribute value corresponding to the attribute. The response output from the question-answering model M2 is an attribute value corresponding to the attribute that is the query.

質問応答モデルＭ２は、商品データの中から、応答として適切な部分を推定し、当該推定結果を応答として出力する。図３の例では、バッグの特徴に関する商品データと、クエリである属性「ブランド」と、が質問応答モデルＭ２に入力される。図３では、商品の画像も質問応答モデルＭ２に入力されるものとしているが、実際には、商品データのうちのテキスト部分のみが質問応答モデルＭ２に入力されるものとする。ここで、商品データのうちの画像部分は、ニューラルネットワーク等の機械学習モデルに基づく特徴抽出器等を介して特徴ベクトル等の数値の組み合わせに変換されることで、質問応答モデルＭ２に入力されてよい。また、ここでは、クエリとして用いる属性は、ユーザにより指定されるものとする。 The question-answering model M2 estimates an appropriate part of the product data as a response, and outputs the estimated result as a response. In the example of FIG. 3, product data related to the characteristics of the bag and the attribute "brand" which is a query are input to the question-answering model M2. In FIG. 3, an image of the product is also input to the question-answering model M2, but in reality, only the text part of the product data is input to the question-answering model M2. Here, the image part of the product data may be converted into a combination of numerical values such as a feature vector via a feature extractor based on a machine learning model such as a neural network, and then input to the question-answering model M2. Here, the attribute used as the query is specified by the user.

例えば、ユーザは、商品データをアップロードする前に、図３の商品データと、属性「ブランド」と、を質問応答モデルＭ２に入力する。質問応答モデルＭ２は、属性「ブランド」に応じた属性値（ブランド名）として、商品データの中の「ＢＢＢバッグ」の部分が適切であることを推定し、この部分「ＢＢＢバッグ」を応答として出力する。例えば、属性「色」、属性「素材」、属性「サイズ」、又は属性「機能」といった他の属性も、質問応答モデルＭ２へのクエリとして入力可能である。 For example, before uploading product data, a user inputs the product data in FIG. 3 and the attribute "brand" into question-answering model M2. Question-answering model M2 estimates that the portion "BBB bag" in the product data is appropriate as an attribute value (brand name) corresponding to the attribute "brand", and outputs this portion "BBB bag" as a response. For example, other attributes such as the attribute "color", the attribute "material", the attribute "size", or the attribute "function" can also be input as a query to question-answering model M2.

ただし、質問応答モデルＭ２に入力されるクエリは、質問応答モデルＭ１に入力されるクエリよりも短く具体性に欠けることが多いので、質問応答モデルＭ２がクエリの意味を認識できないことがある。質問応答モデルＭ２がクエリの意味を認識できなければ、質問応答モデルＭ２からの出力の精度も低下する。特に、オンラインショッピングモールの場合、商品データに基づいて訓練データを作成すると、人気のある商品の属性が多くなり、他の商品に関する訓練データがスパースになりがちである。訓練データのスパースネス性も、質問応答モデルＭ２からの出力の精度が低下する要因の１つである。 However, since the queries input to the question-answering model M2 are often shorter and less specific than those input to the question-answering model M1, the question-answering model M2 may not be able to recognize the meaning of the query. If the question-answering model M2 cannot recognize the meaning of the query, the accuracy of the output from the question-answering model M2 will also decrease. In particular, in the case of an online shopping mall, when training data is created based on product data, there tend to be many attributes of popular products, and the training data for other products tends to be sparse. The sparseness of the training data is also one of the factors that can reduce the accuracy of the output from the question-answering model M2.

属性「機能」を例に挙げると、エアコンのような商品であれば、「タイマー」及び「省エネ」といった機能が存在する。バッグのような商品であれば、属性「機能」として、「防水」、「防カビ」、又は「防汚」といった機能が存在する。このため、同じ属性「機能」だったとしても、種々の意味が存在する。質問応答モデルＭ２は、属性「機能」のような短くて抽象的なクエリの意味を認識できず、適切な応答を出力できないことがある。 Taking the attribute "function" as an example, a product such as an air conditioner may have functions such as "timer" and "energy saving." A product such as a bag may have attributes such as "waterproof," "mildew-proof," or "stain-resistant." For this reason, the same attribute "function" may have a variety of meanings. The question-answering model M2 may not be able to recognize the meaning of a short, abstract query such as the attribute "function," and may not be able to output an appropriate response.

他の属性も同様であり、短くて抽象的なクエリが質問応答モデルＭ２に入力された場合、質問応答モデルＭ２は、クエリの意味を十分に認識できず、質問応答モデルＭ２からの出力の精度が低下する可能性がある。即ち、クエリの質によって、質問応答モデルＭ２からの出力の精度が低下する可能性がある。そこで、本実施形態では、クエリとして、商品の属性だけではなく、属性に対して予め用意された属性値も入力することによって、クエリ拡張を実現してクエリの質を高めるようにしている。即ち、先述した訓練データのスパースネス性の問題も解決できる。 The same is true for other attributes. If a short and abstract query is input to the question-answering model M2, the question-answering model M2 may not be able to fully recognize the meaning of the query, and the accuracy of the output from the question-answering model M2 may decrease. That is, the quality of the query may decrease the accuracy of the output from the question-answering model M2. Therefore, in this embodiment, by inputting not only the product attributes but also pre-prepared attribute values for the attributes as the query, the query is expanded and the quality of the query is improved. That is, the problem of sparseness of the training data mentioned above can also be solved.

図４は、実施形態における質問応答モデルの一例を示す図である。質問応答モデルＭ３、商品データ及びクエリが入力されると、クエリに応じた応答を出力するモデルという点では、質問応答モデルＭ２と同様である。ただし、先述したように、クエリが拡張されており、クエリの意味を認識しやすいようになっている。本実施形態では、質問応答モデルＭ３の訓練データに含まれるクエリも、属性だけではなく属性値も含まれるようになっているが、質問応答モデルＭ３の訓練データは、質問応答モデルＭ２と同様であってもよい。 Figure 4 is a diagram showing an example of a question-answering model in an embodiment. Question-answering model M3 is similar to question-answering model M2 in that it is a model that outputs a response according to the query when product data and a query are input. However, as described above, the query is expanded so that the meaning of the query is easier to recognize. In this embodiment, the query included in the training data for question-answering model M3 also includes attribute values in addition to attributes, but the training data for question-answering model M3 may be similar to that of question-answering model M2.

例えば、商品データが示す商品がバッグであり、クエリとして入力される商品の属性が「機能」だったとする。先述した通り、属性「機能」は、短くて抽象的な属性であり、バッグ以外の商品にも関係する種々の意味を有する。このため、バッグ関連の属性の意味であることを質問応答モデルＭ３が認識できるように、属性「機能」と、バッグ関連の属性「機能」に予め用意された属性値「防カビ」と、を含むクエリが入力される。 For example, suppose the product indicated by the product data is a bag, and the product attribute input as the query is "function." As mentioned above, the attribute "function" is a short and abstract attribute, and has various meanings related to products other than bags. For this reason, a query is input that includes the attribute "function" and the attribute value "mold prevention" that is prepared in advance for the bag-related attribute "function," so that the question-answering model M3 can recognize that the meaning is a bag-related attribute.

図４の商品は、カビを防ぐ機能を有する商品ではないが、クエリに含まれる属性値「防カビ」によって、属性「機能」の意味を質問応答モデルＭ３が認識できるようになる。このため、質問応答モデルＭ３は、商品データの中から、クエリに含まれる属性値「防カビ」と意味的又は表記的に近い表現を有する機能名である「防水」といった属性値を、応答として出力できるようになる。他にも例えば、属性値「防汚」（汚れを防ぐ機能）といったように、抽出したい属性値「防水」と意味的又は表記的に近い他の表現がクエリとして与えられた場合も同様に、質問応答モデルＭ３は、商品データの中から属性値「防水」を、応答として出力できる。この点は、例えば、属性「機能」に限られず、属性「種類」といったように、短く抽象的な属性を含む他のクエリが質問応答モデルＭ３に入力される場合も同様である。 The product in FIG. 4 is not a product that has a function for preventing mold, but the attribute value "mold prevention" included in the query allows the question-answering model M3 to recognize the meaning of the attribute "function". As a result, the question-answering model M3 can output, as a response, an attribute value such as "waterproof", which is a function name that has an expression that is semantically or typographically similar to the attribute value "mold prevention" included in the query, from the product data. Similarly, when another expression that is semantically or typographically similar to the attribute value "waterproof" to be extracted, such as the attribute value "stain prevention" (function for preventing stains), is given as a query, the question-answering model M3 can output the attribute value "waterproof" from the product data as a response. This point is also the same when a query that includes a short and abstract attribute, such as the attribute "type", is input to the question-answering model M3, not limited to the attribute "function".

本実施形態では、予め用意された複数の属性値のうちの何れかがランダムに選択されるものとする。このため、図４の例では、ユーザが指定した属性「機能」と、ランダムに選択された属性値「防カビ」と、がクエリとして質問応答モデルＭ３に入力される。バッグの商品に対してエアコンの機能を示す属性値が選択されると、質問応答モデルＭ３が適切な応答を出力できない可能性があるので、バッグ関連の属性「機能」を意味することは、ユーザが指定する等して、何らかの形で特定可能であるものとする。ユーザが指定した他の属性及び属性値からバッグ関連の属性「機能」を意味することが特定されてもよいし、ユーザ自身がバッグ関連の属性「機能」の属性値「防カビ」を指定してもよい。 In this embodiment, one of a number of pre-prepared attribute values is selected at random. For this reason, in the example of FIG. 4, the attribute "function" specified by the user and the randomly selected attribute value "anti-mold" are input as a query to the question answering model M3. If an attribute value indicating the function of an air conditioner is selected for a bag product, there is a possibility that the question answering model M3 will not be able to output an appropriate response. Therefore, it is assumed that the meaning of the bag-related attribute "function" can be specified in some way, such as by the user. The meaning of the bag-related attribute "function" may be specified from other attributes and attribute values specified by the user, or the user himself may specify the attribute value "anti-mold" of the bag-related attribute "function".

例えば、属性「ブランド」、属性「色」、属性「素材」、属性「サイズ」、又は属性「機能」といった他の属性も同様に、これらの属性だけではなく、予め用意された属性値もクエリとして入力することによって、質問応答モデルＭ３は、クエリの意味をより正確に認識できるようになる。このため、質問応答モデルＭ３からの出力の精度を高めることができる。以降、属性値抽出システムＳの詳細を説明する。 For example, by inputting not only other attributes such as the attribute "brand", the attribute "color", the attribute "material", the attribute "size", or the attribute "function" as well as pre-prepared attribute values as queries, the question answering model M3 can more accurately recognize the meaning of the query. This makes it possible to improve the accuracy of the output from the question answering model M3. The attribute value extraction system S will be described in detail below.

［３．属性値抽出システムで実現される機能］
図５は、属性値抽出システムＳで実現される機能の一例を示す機能ブロック図である。本実施形態では、主な機能がサーバ１０により実現される場合を説明する。データ記憶部１００は、記憶部１２を主として実現される。学習部１０１、第２属性取得部１０２、第２属性値取得部１０３、第３属性値抽出部１０４、及び第３属性値付与部１０５は、制御部１１を主として実現される。 [3. Functions Realized by the Attribute Value Extraction System]
5 is a functional block diagram showing an example of functions realized by the attribute value extraction system S. In this embodiment, a case will be described in which the main functions are realized by the server 10. The data storage unit 100 is mainly realized by the storage unit 12. The learning unit 101, the second attribute acquisition unit 102, the second attribute value acquisition unit 103, the third attribute value extraction unit 104, and the third attribute value assignment unit 105 are mainly realized by the control unit 11.

以降、質問応答モデルＭ３の訓練データに含まれる商品データ、クエリ、及び属性値を、それぞれ第１商品データ、第１クエリ、及び第１属性値をいう。第１商品データが示す商品を、第１商品という。質問応答モデルＭ３に入力される商品データ及びクエリを、それぞれ第２商品データ及び第２クエリという。第２クエリに含まれる属性及び属性値を、それぞれ第２属性及び第２属性値という。第２商品データが示す商品を、第２商品という。学習済みの質問応答モデルＭ３から出力される属性値を、第３属性値という。 Hereinafter, the product data, query, and attribute value included in the training data of the question-answering model M3 are referred to as the first product data, first query, and first attribute value, respectively. The product indicated by the first product data is referred to as the first product. The product data and query input to the question-answering model M3 are referred to as the second product data and second query, respectively. The attribute and attribute value included in the second query are referred to as the second attribute and second attribute value, respectively. The product indicated by the second product data is referred to as the second product. The attribute value output from the trained question-answering model M3 is referred to as the third attribute value.

［３－１．データ記憶部］
データ記憶部１００は、第２商品に第３属性値を付与するために必要なデータを記憶する。例えば、データ記憶部１００は、第１商品に関する第１商品データ及び第１クエリと、第１商品に関する第１属性値と、の関係が学習された質問応答モデルＭ３を記憶する。第１商品は、第１アイテムの一例である。第１商品データは、第１アイテムデータの一例である。このため、第１商品と記載した箇所は、第１アイテムと読み替えることができる。第１商品データと記載した箇所は、第１アイテムデータと読み替えることができる。 [3-1. Data storage unit]
The data storage unit 100 stores data necessary for assigning a third attribute value to the second product. For example, the data storage unit 100 stores a question answering model M3 in which the relationship between the first product data and the first query regarding the first product and the first attribute value regarding the first product is learned. The first product is an example of a first item. The first product data is an example of first item data. Therefore, the first product can be read as the first item. The first product data can be read as the first item data.

第１アイテムは、質問応答モデルＭ３の訓練用のアイテムである。アイテムとは、属性及び属性値が付与される対象となる物である。アイテムは、商品のように有体物であってもよいし、サービス又はデータのような無体物であってもよい。アイテムは、任意の物であってよく、商品に限られない。例えば、アイテムは、宿泊施設に関するコンテンツ、レストランに関するコンテンツ、電子書籍、動画、楽曲、ウェブサイト、又はその他のコンテンツであってもよい。例えば、アイテムは、金融サービス又は通信サービスといったサービスの紹介文であってもよいし、ＳＮＳにおける投稿であってもよい。 The first item is an item for training the question answering model M3. An item is an object to which attributes and attribute values are assigned. An item may be a tangible object such as a product, or an intangible object such as a service or data. An item may be any object and is not limited to a product. For example, an item may be content related to accommodation, content related to a restaurant, an e-book, a video, a song, a website, or other content. For example, an item may be an introduction to a service such as a financial service or a communication service, or a post on a social networking site.

第１アイテムデータは、第１アイテムに関する何らかの内容を含むアイテムデータである。アイテムデータは、アイテムの詳細に関するデータである。アイテムデータは、アイテムに応じたデータであればよい。例えば、アイテムが宿泊施設であれば、宿泊施設又は部屋のタイトル又は紹介文がアイテムデータに相当してもよい。例えば、アイテムが電子書籍であれば、電子書籍の実データ部分がアイテムデータに相当してもよい。第１アイテムデータは、第１アイテムに関するアイテムデータである。 The first item data is item data that includes some content related to the first item. The item data is data related to details of the item. The item data may be data according to the item. For example, if the item is an accommodation facility, the title or description of the accommodation facility or room may correspond to the item data. For example, if the item is an e-book, the actual data portion of the e-book may correspond to the item data. The first item data is item data related to the first item.

例えば、データ記憶部１００は、学習済みの質問応答モデルＭ３を記憶する。質問応答モデルＭ３は、入力されたデータに対する処理を実行するためのプログラム部分と、学習によって調整されるパラメータ部分と、を含む。データ記憶部１００は、学習前の質問応答モデルＭ３を記憶し、学習前の質問応答モデルＭ３のパラメータが学習によって調整される。学習が完了すると、学習前の質問応答モデルＭ３は、学習後の質問応答モデルＭ３に更新される。例えば、データ記憶部１００は、質問応答モデルＭ３以外にも、訓練データベースＤＢ１及び属性データベースＤＢ２を記憶する。 For example, the data storage unit 100 stores a trained question-answering model M3. The question-answering model M3 includes a program portion for executing processing on input data and a parameter portion that is adjusted by learning. The data storage unit 100 stores the question-answering model M3 before learning, and the parameters of the question-answering model M3 before learning are adjusted by learning. When learning is completed, the question-answering model M3 before learning is updated to the question-answering model M3 after learning. For example, in addition to the question-answering model M3, the data storage unit 100 also stores a training database DB1 and an attribute database DB2.

図６は、訓練データベースＤＢ１の一例を示す図である。訓練データベースＤＢ１は、質問応答モデルＭ３の訓練データが格納されたデータベースである。訓練データは、第１商品データ及び第１クエリと、第１属性値と、の関係に関するデータである。訓練データは、学習時に質問応答モデルＭ３に入力される入力部分と、質問応答モデルＭ３から出力されるべき出力部分と、のペアを含む。 Figure 6 is a diagram showing an example of the training database DB1. The training database DB1 is a database in which training data for the question-answering model M3 is stored. The training data is data relating to the relationship between the first product data and the first query, and the first attribute value. The training data includes pairs of an input portion that is input to the question-answering model M3 during learning, and an output portion that is to be output from the question-answering model M3.

訓練データの入力部分は、学習済みの質問応答モデルＭ３に入力されるデータと同じ形式である。このため、第１商品データ及び第１クエリの形式と、第２商品データ及び第２クエリの形式と、は同じである。本実施形態では、第１商品データ及び第１クエリと、第２商品データ及び第２クエリと、が互いに文字形式である場合を説明するが、これらは、数字、その他の記号、又はこれらと文字の組み合わせといった任意の形式であってよい。例えば、質問応答モデルＭ３に文字が入力されるのではなく、文字の特徴量が入力されるのであれば、訓練データの入力部分として、第１商品データ及び第１クエリの特徴量が訓練データに含まれてもよい。 The input portion of the training data has the same format as the data input to the trained question-answering model M3. Therefore, the format of the first product data and the first query is the same as the format of the second product data and the second query. In this embodiment, the first product data and the first query, and the second product data and the second query are both in character format, but these may be in any format, such as numbers, other symbols, or a combination of these and characters. For example, if character features are input to the question-answering model M3 rather than characters, the features of the first product data and the first query may be included in the training data as the input portion of the training data.

訓練データの出力部分は、学習済みの質問応答モデルＭ３から出力されるデータと同じ形式である。このため、第１属性値の形式と、第３属性値の形式と、は同じである。本実施形態では、第１属性値及び第３属性値が互いに文字形式である場合を説明するが、これらは、数字、その他の記号、又はこれらと文字の組み合わせといった任意の形式であってよい。例えば、質問応答モデルＭ３から属性値を示す文字列が出力されるのではなく、属性値を識別可能なＩＤ又は番号が出力されるのであれば、訓練データの出力部分として、第１属性値を識別可能なＩＤ又は番号が訓練データに含まれてもよい。 The output portion of the training data has the same format as the data output from the trained question-answering model M3. Therefore, the format of the first attribute value and the format of the third attribute value are the same. In this embodiment, the first attribute value and the third attribute value are both in character format, but they may be in any format, such as numbers, other symbols, or a combination of these and characters. For example, if the question-answering model M3 does not output a character string indicating the attribute value, but rather outputs an ID or number that can identify the attribute value, the ID or number that can identify the first attribute value may be included in the training data as the output portion of the training data.

本実施形態では、オンラインショッピングモールで実際に販売されている商品が第１商品に相当する場合を説明する。即ち、オンラインショッピングモールに実際にアップロードされた商品データが第１商品データに相当する。例えば、第１商品データは、オンラインショッピングモールで販売される商品のタイトル、説明文、又はこれらの組み合わせである。第１商品データは、ユーザが入力した文字を含む。なお、訓練データは、オンラインショッピングモールで実際に販売されている商品に基づいて作成されるのではなく、属性値抽出システムＳの管理者が手動で作成してもよい。 In this embodiment, a case will be described in which a product actually sold at an online shopping mall corresponds to the first product. That is, product data actually uploaded to the online shopping mall corresponds to the first product data. For example, the first product data is the title, description, or a combination of these of a product sold at the online shopping mall. The first product data includes characters entered by the user. Note that the training data may be created manually by an administrator of the attribute value extraction system S, rather than being created based on products actually sold at the online shopping mall.

第１クエリは、第１商品の属性値を問うための質問である。例えば、第１クエリは、第１商品の属性と、当該属性に対応する属性値と、の組み合わせである。第１商品の属性は、第１商品に関連付けられた属性である。例えば、第１クエリに含まれる属性は、ユーザが指定した属性を示す文字を含む。第１クエリに含まれる属性値は、ユーザが指定した属性値であってもよいし、第１クエリに含まれる属性に対して用意された複数の属性値の中からランダムに選択された属性値であってもよい。 The first query is a question for inquiring about an attribute value of the first product. For example, the first query is a combination of an attribute of the first product and an attribute value corresponding to the attribute. The attribute of the first product is an attribute associated with the first product. For example, the attribute included in the first query includes characters indicating an attribute specified by the user. The attribute value included in the first query may be an attribute value specified by the user, or may be an attribute value randomly selected from multiple attribute values prepared for the attribute included in the first query.

第１属性値は、第１商品の正解となる属性値である。第１属性値は、第１クエリに含まれる属性に対して用意された複数の属性値のうちの何れかである。第１属性値は、第１クエリに含まれる属性値と同じであってもよいし異なってもよい。例えば、第１属性値は、ユーザが指定した属性値を示す文字を含む。ユーザが誤った属性値を指定した場合には、属性値抽出システムＳの管理者によって正しい属性値が指定され、当該属性値が第１属性値として用いられてもよい。 The first attribute value is an attribute value that is the correct answer for the first product. The first attribute value is any one of a plurality of attribute values prepared for the attribute included in the first query. The first attribute value may be the same as or different from the attribute value included in the first query. For example, the first attribute value includes characters that indicate the attribute value specified by the user. If the user specifies an incorrect attribute value, a correct attribute value may be specified by an administrator of the attribute value extraction system S, and the attribute value may be used as the first attribute value.

図７は、属性データベースＤＢ２の一例を示す図である。属性データベースＤＢ２は、複数の属性の各々に対して予め用意された属性値が格納されたデータベースである。例えば、属性データベースＤＢ２には、複数の属性の各々と、予め用意された複数の属性値と、が格納される。属性データベースＤＢ２は、第２属性値を取得するために利用される。 Figure 7 is a diagram showing an example of attribute database DB2. Attribute database DB2 is a database in which pre-prepared attribute values for each of a plurality of attributes are stored. For example, attribute database DB2 stores each of a plurality of attributes and a plurality of pre-prepared attribute values. Attribute database DB2 is used to obtain the second attribute value.

本実施形態では、属性データベースＤＢ２は、訓練データに基づいて作成されるものとする。例えば、サーバ１０は、訓練データの入力部分に含まれる属性と、当該入力部分に対応する出力部分である第１属性値と、を関連付けて属性データベースＤＢ２に格納する。サーバ１０は、ある属性に関連付ける属性値として、所定数以上の訓練データで登場した第１属性値だけを属性データベースＤＢ２に格納してもよい。 In this embodiment, the attribute database DB2 is created based on training data. For example, the server 10 associates an attribute included in the input portion of the training data with a first attribute value, which is the output portion corresponding to the input portion, and stores them in the attribute database DB2. The server 10 may store in the attribute database DB2 only first attribute values that appear in a predetermined number or more of training data as attribute values to be associated with a certain attribute.

なお、属性データベースＤＢ２は、任意の方法によって作成可能であり、訓練データに基づいて作成されなくてもよい。例えば、サーバ１０は、オンラインショッピングモールで販売される商品に関する商品データベースに基づいて、属性データベースＤＢ２を作成してもよい。この場合、サーバ１０は、訓練データとして利用しない商品の属性及び属性値のペアを属性データベースＤＢ２に格納してもよいし、所定数以上の商品で登場した属性及び属性値のペアを属性データベースＤＢ２に格納してもよい。例えば、属性データベースＤＢ２は、属性値抽出システムＳの管理者が手動で作成してもよい。例えば、オンラインショッピングモールで販売される商品に対して付与可能な属性及び属性値のリストが属性データベースＤＢ２として利用されてもよい。 The attribute database DB2 can be created by any method and does not have to be created based on training data. For example, the server 10 may create the attribute database DB2 based on a product database related to products sold in an online shopping mall. In this case, the server 10 may store in the attribute database DB2 pairs of attributes and attribute values of products that are not used as training data, or may store in the attribute database DB2 pairs of attributes and attribute values that appear in a predetermined number or more of products. For example, the attribute database DB2 may be manually created by an administrator of the attribute value extraction system S. For example, a list of attributes and attribute values that can be assigned to products sold in an online shopping mall may be used as the attribute database DB2.

［３－２．学習部］
学習部１０１は、訓練データベースＤＢ１に格納された訓練データに基づいて、質問応答モデルＭ３の学習処理を実行する。例えば、学習部１０１は、訓練データの入力部分が入力された場合に、訓練データの出力部分が出力されるように、質問応答モデルＭ３の学習処理を実行する。学習処理自体は、質問応答モデルＭ３に応じた種々のアルゴリズムを利用可能であり、例えば、ＢＥＲＴ等の事前学習済みのＴｒａｎｓｆｏｒｍｅｒベースのモデルで行われるファインチューニングの態様をとってよく、他のモデルの最適なパラメータを求める際は誤差逆伝播法又は勾配降下法といったアルゴリズムを利用してもよい。 [3-2. Learning section]
The learning unit 101 executes a learning process of the question answering model M3 based on the training data stored in the training database DB1. For example, the learning unit 101 executes the learning process of the question answering model M3 such that, when an input portion of training data is input, an output portion of training data is output. The learning process itself can use various algorithms according to the question answering model M3, and may take the form of fine tuning performed on a pre-trained Transformer-based model such as BERT, and an algorithm such as backpropagation or gradient descent may be used to find optimal parameters for other models.

例えば、学習部１０１は、訓練データの入力部分に含まれる第１商品データと、訓練データの入力部分に含まれる第１クエリと、を結合する。第１商品データには、第１商品データであることを示す語が付与されてもよい。第１クエリには、第１クエリであることを示す語が付与されてもよい。更に、第１クエリの中には、属性及び属性値の各々であることを示す語が付与されてもよい。第１商品データ及び第１クエリの間には、語の区切りを示す語が配置されてもよい。学習部１０１は、第１商品データ及び第１クエリが入力された場合に、第１商品データ及び第１クエリに関連付けられた第１属性値が出力されるように、学習処理を実行する。 For example, the learning unit 101 combines first product data included in the input portion of the training data with a first query included in the input portion of the training data. A word indicating that it is the first product data may be assigned to the first product data. A word indicating that it is the first query may be assigned to the first query. Furthermore, a word indicating that it is an attribute and an attribute value may be assigned to the first query. A word indicating a word separator may be placed between the first product data and the first query. The learning unit 101 executes a learning process such that, when the first product data and the first query are input, a first attribute value associated with the first product data and the first query is output.

［３－３．第２属性取得部］
第２属性取得部１０２は、第２商品に関する第２商品データに関連付けられた第２属性を取得する。第２商品は、第２アイテムの一例である。第２商品データは、第２アイテムデータの一例である。このため、第２商品と記載した箇所は、第２アイテムと読み替えることができる。第２商品データと記載した箇所は、第２アイテムデータと読み替えることができる。第２アイテムは、質問応答モデルＭ３による推定用のアイテムである。アイテムの意味は、先述した通りである。第２アイテムデータは、第２アイテムに関する何らかの内容を含むアイテムデータである。アイテムデータの意味も、先述した通りである。 [3-3. Second attribute acquisition part]
The second attribute acquisition unit 102 acquires a second attribute associated with second product data related to the second product. The second product is an example of a second item. The second product data is an example of second item data. Therefore, the section describing the second product can be read as the second item. The section describing the second product data can be read as the second item data. The second item is an item for estimation by the question answering model M3. The meaning of the item is as described above. The second item data is item data including some content related to the second item. The meaning of the item data is also as described above.

本実施形態では、第２属性取得部１０２は、ユーザが指定した第２属性を取得するものとする。例えば、第２属性取得部１０２は、ユーザ端末２０から、第２属性を取得する。第２属性は、任意の方法によって取得可能であり、ユーザが指定しなくてもよい。例えば、適切な属性及び属性値が商品に付与されているかを判定する目的で属性値抽出システムＳを利用するのであれば、第２属性取得部１０２は、商品データベースを参照し、商品に関連付けられた属性を、第２属性として取得してもよい。即ち、第２属性取得部１０２は、商品のインデックスとして付与されている属性を、第２属性として取得してもよい。 In this embodiment, the second attribute acquisition unit 102 acquires the second attribute specified by the user. For example, the second attribute acquisition unit 102 acquires the second attribute from the user terminal 20. The second attribute can be acquired by any method and does not have to be specified by the user. For example, if the attribute value extraction system S is used for the purpose of determining whether appropriate attributes and attribute values are assigned to a product, the second attribute acquisition unit 102 may refer to the product database and acquire an attribute associated with the product as the second attribute. In other words, the second attribute acquisition unit 102 may acquire an attribute assigned as an index of the product as the second attribute.

他にも例えば、第２属性取得部１０２は、予め定められた文字が第２商品データにある場合に、当該文字を第２属性として取得してもよい。第２属性取得部１０２は、第２商品データの中に、属性データベースＤＢ２に定義された属性の名前と一致する文字があるか否かを判定してもよい。第２属性取得部１０２は、第２商品データの中から、属性データベースＤＢ２に定義された属性の名前と一致する文字を、第２属性として取得してもよい。 For another example, when a predetermined character is present in the second product data, the second attribute acquisition unit 102 may acquire the character as the second attribute. The second attribute acquisition unit 102 may determine whether or not there is a character in the second product data that matches the name of an attribute defined in the attribute database DB2. The second attribute acquisition unit 102 may acquire, from the second product data, a character that matches the name of an attribute defined in the attribute database DB2 as the second attribute.

［３－４．第２属性値取得部］
第２属性値取得部１０３は、第２商品に関する第２属性に関連付けられた第２属性値を取得する。本実施形態では、属性及び属性値の関係が属性データベースＤＢ２に定義されているので、第２属性値取得部１０３は、属性データベースＤＢ２を参照し、第２属性値を取得する。本実施形態では、第２属性取得部１０２により第２属性が取得されるので、第２属性値取得部１０３は、第２属性取得部１０２により取得された第２属性に関連付けられた第２属性値を取得する。なお、属性及び属性値の関係は、属性データベースＤＢ２以外のデータに定義されていてもよい。例えば、属性及び属性値の関係を示すリストがデータ記憶部１００に記憶されていてもよい。 [3-4. Second attribute value acquisition unit]
The second attribute value acquisition unit 103 acquires a second attribute value associated with a second attribute related to a second product. In this embodiment, since the relationship between the attribute and the attribute value is defined in the attribute database DB2, the second attribute value acquisition unit 103 refers to the attribute database DB2 to acquire the second attribute value. In this embodiment, since the second attribute is acquired by the second attribute acquisition unit 102, the second attribute value acquisition unit 103 acquires a second attribute value associated with the second attribute acquired by the second attribute acquisition unit 102. Note that the relationship between the attribute and the attribute value may be defined in data other than the attribute database DB2. For example, a list indicating the relationship between the attribute and the attribute value may be stored in the data storage unit 100.

本実施形態の属性データベースＤＢ２は、訓練データに基づいて作成されているので、属性データベースＤＢ２を参照して第２属性値を取得することは、第２属性値取得部１０３は、質問応答モデルＭ３の訓練データに基づいて、第２属性値を取得することに相当する。訓練データベースＤＢ１にも、属性及び属性値の関係性が定義されているので、第２属性値取得部１０３は、訓練データベースＤＢ１を参照し、第２属性に関連付けられた第２属性値を取得してもよい。 The attribute database DB2 in this embodiment is created based on the training data, so referring to the attribute database DB2 to obtain the second attribute value corresponds to the second attribute value acquisition unit 103 obtaining the second attribute value based on the training data of the question answering model M3. The relationship between the attribute and the attribute value is also defined in the training database DB1, so the second attribute value acquisition unit 103 may refer to the training database DB1 to obtain the second attribute value associated with the second attribute.

第２属性値取得部１０３は、第２属性値の候補に関する辞書データに基づいて、第２属性値を取得してもよい。この辞書データは、訓練データとは異なるデータである。辞書データは、属性値抽出システムＳの管理者が属性及び属性値の関係を手動で定義したデータであってもよいし、電子商取引サービス等で利用されている商品のカテゴリデータを辞書データとして流用してもよい。第２属性値取得部１０３は、辞書データを参照し、第２属性に関連付けられた第２属性値を取得してもよい。辞書データの中に第２属性が存在するか否かは、文字列一致により判定されるものとする。先述したリストは、辞書データの一例であってもよい。 The second attribute value acquisition unit 103 may acquire the second attribute value based on dictionary data on candidates for the second attribute value. This dictionary data is different from the training data. The dictionary data may be data in which the administrator of the attribute value extraction system S manually defines the relationship between the attribute and the attribute value, or product category data used in e-commerce services or the like may be used as the dictionary data. The second attribute value acquisition unit 103 may refer to the dictionary data and acquire the second attribute value associated with the second attribute. Whether or not the second attribute exists in the dictionary data is determined by string matching. The list described above may be an example of dictionary data.

本実施形態では、第２属性値取得部１０３は、第２属性に関連付けられた複数の第２属性値の中から、ランダムに第２属性値を取得する。例えば、第２属性値取得部１０３は、属性データベースＤＢ２を参照し、第２属性に関連付けられた複数の第２属性値を特定する。第２属性値取得部１０３は、当該特定された複数の第２属性値のうちの何れかをランダムに取得する。第２属性値をランダムに選択する方法自体は、種々の方法を利用可能であり、例えば、乱数を利用した方法であってもよい。 In this embodiment, the second attribute value acquisition unit 103 randomly acquires a second attribute value from among a plurality of second attribute values associated with the second attribute. For example, the second attribute value acquisition unit 103 refers to the attribute database DB2 and identifies a plurality of second attribute values associated with the second attribute. The second attribute value acquisition unit 103 randomly acquires one of the identified plurality of second attribute values. Various methods can be used as the method for randomly selecting the second attribute value, and may be, for example, a method that uses random numbers.

［３－５．第３属性値抽出部］
第３属性値抽出部１０４は、第２商品に関する第２商品データ、第２属性値に応じた第２クエリ、及び質問応答モデルＭ３に基づいて、第２商品データから第３属性値を抽出する。本実施形態では、第２クエリに第２属性及び第２属性値が含まれているので、第３属性値抽出部１０４は、第２属性及び第２属性値に応じた第２クエリに基づいて、第３属性値を抽出する。 [3-5. Third attribute value extraction unit]
The third attribute value extraction unit 104 extracts the third attribute value from the second product data based on the second product data related to the second product, the second query corresponding to the second attribute value, and the question answering model M3. In this embodiment, since the second query includes the second attribute and the second attribute value, the third attribute value extraction unit 104 extracts the third attribute value based on the second query corresponding to the second attribute and the second attribute value.

例えば、第３属性値抽出部１０４は、第２商品データ及び第２クエリを結合する。第２商品データには、第２商品データであることを示す語が付与されてもよい。第２クエリには、第２クエリであることを示す語が付与されてもよい。更に、第２クエリの中には、第２属性及び第２属性値の各々であることを示す語が付与されてもよい。第２商品データ及び第２クエリの間には、語の区切りを示す語が配置されてもよい。質問応答モデルＭ３は、当該入力された第２商品データ及び第２クエリの各々の特徴量を計算し、当該計算された特徴量に基づいて、第３属性値を出力する。入力された語の特徴量に応じた属性値を出力する方法自体は、先行技術文献として挙げた非特許文献１、非特許文献２、又はこれらに記載された参考文献を参照されたい。 For example, the third attribute value extraction unit 104 combines the second product data and the second query. The second product data may be assigned a word indicating that it is the second product data. The second query may be assigned a word indicating that it is the second query. Furthermore, the second query may be assigned a word indicating that it is the second attribute and the second attribute value. A word indicating a word separator may be placed between the second product data and the second query. The question answering model M3 calculates the feature amounts of each of the input second product data and the second query, and outputs the third attribute value based on the calculated feature amounts. For the method of outputting the attribute value according to the feature amount of the input word, please refer to Non-Patent Document 1 and Non-Patent Document 2 listed as prior art documents, or reference documents described therein.

本実施形態では、第３属性値抽出部１０４は、質問応答モデルＭ３により第３属性値として推定された第２商品データに含まれる語を、第３属性値として抽出する。質問応答モデルＭ３は、第２商品データのうち、第３属性値として適切な部分を識別する情報を出力する。例えば、質問応答モデルＭ３は、第３属性値として適切な部分の開始部分と終了部分とを出力する。第２商品データの１０文字目から１３文字目が第３属性値として適切だったとすると、質問応答モデルＭ３は、１０文字目が開始部分であることを示す情報と、１３文字目が終了部分であることを示す情報と、を出力する。第３属性値抽出部１０４は、１０文字目から１３文字目の部分を、第３属性値として取得する。 In this embodiment, the third attribute value extraction unit 104 extracts, as the third attribute value, a word included in the second product data estimated as the third attribute value by the question answering model M3. The question answering model M3 outputs information that identifies a portion of the second product data that is appropriate as the third attribute value. For example, the question answering model M3 outputs the start portion and the end portion of the portion that is appropriate as the third attribute value. If the 10th to 13th characters of the second product data are appropriate as the third attribute value, the question answering model M3 outputs information indicating that the 10th character is the start portion and information indicating that the 13th character is the end portion. The third attribute value extraction unit 104 acquires the portion from the 10th to 13th characters as the third attribute value.

本実施形態では、質問応答モデルＭ３により、第３属性値の開始部分と終了部分が出力されるので、第３属性値抽出部１０４は、第２商品データからこれらの開始部分と終了部分で挟まれた部分を、第３属性値として抽出する。質問応答モデルＭ３は、開始部分と終了部分とを出力するのではなく、第２商品データから第３属性値を切り出したうえで出力してもよい。なお、第３属性値は、第２商品データ内の文言として直接的な形で含まれていなくてもよい。質問応答モデルＭ３にクエリとして入力される第２属性に対応する属性値が、エンティティである第２商品データに含まれていなくてもよい。 In this embodiment, the question answering model M3 outputs the start and end parts of the third attribute value, and the third attribute value extraction unit 104 extracts the part between these start and end parts from the second product data as the third attribute value. The question answering model M3 may extract the third attribute value from the second product data and output it, rather than outputting the start and end parts. Note that the third attribute value does not have to be included directly as a wording in the second product data. An attribute value corresponding to the second attribute input as a query to the question answering model M3 does not have to be included in the second product data, which is an entity.

［３－６．第３属性値付与部］
第３属性値付与部１０５は、第３属性値抽出部１０４により抽出された第３属性値を、第２商品に付与する。第２商品に第３属性値を付与するとは、第２商品データに第３属性値を関連付けることである。第３属性値付与部１０５は、質問応答モデルＭ３に入力した第２商品データと、第３属性値抽出部１０４により取得された第３属性値と、を関連付けてデータ記憶部１００に記録することによって、第２商品に第３属性値を付与する。第３属性値は、ユーザによって適否が確認されたうえで、第２商品に付与されてもよい。 [3-6. Third attribute value assignment unit]
The third attribute value assigning unit 105 assigns the third attribute value extracted by the third attribute value extracting unit 104 to the second product. Assigning the third attribute value to the second product means associating the third attribute value with the second product data. The third attribute value assigning unit 105 assigns the third attribute value to the second product by associating the second product data input to the question answering model M3 with the third attribute value acquired by the third attribute value extracting unit 104 and recording them in the data storage unit 100. The third attribute value may be assigned to the second product after the user has confirmed whether it is appropriate.

［４．属性値抽出システムで実行される処理］
図８は、属性値抽出システムＳで実行される処理の一例を示すフロー図である。図８の処理は、サーバ１０の制御部１１が記憶部１２に記憶されたプログラムに従って動作することによって実行される。図８の処理が実行されるにあたり、訓練データベースＤＢ１及び属性データベースＤＢ２が予め作成されているものとする。 [4. Processing Executed in the Attribute Value Extraction System]
Fig. 8 is a flow diagram showing an example of processing executed by the attribute value extraction system S. The processing in Fig. 8 is executed by the control unit 11 of the server 10 operating in accordance with a program stored in the storage unit 12. It is assumed that the training database DB1 and the attribute database DB2 have been created in advance before the processing in Fig. 8 is executed.

図８のように、サーバ１０は、訓練データベースＤＢ１に基づいて、質問応答モデルＭ３の学習処理を実行する（Ｓ１）。Ｓ１では、サーバ１０は、訓練データベースＤＢ１に格納された複数の訓練データの各々の入力部分が入力された場合に、当該訓練データの出力部分が出力されるように、質問応答モデルＭ３のパラメータを調整する。Ｓ１の学習処理が完了すると、ユーザは、学習済みの質問応答モデルＭ３を利用できるようになる。 As shown in FIG. 8, the server 10 executes a learning process of the question-answering model M3 based on the training database DB1 (S1). In S1, the server 10 adjusts the parameters of the question-answering model M3 so that when an input portion of each of the multiple training data stored in the training database DB1 is input, an output portion of the training data is output. When the learning process of S1 is completed, the user becomes able to use the learned question-answering model M3.

サーバ１０は、ユーザ端末２０から、ユーザにより入力された第２商品データ及び第２属性を取得する（Ｓ２）。サーバ１０は、属性データベースＤＢ２に基づいて、Ｓ２で取得した第２属性に関連付けられた複数の属性値の中から、ランダムに第２属性値を取得する（Ｓ３）。サーバ１０は、Ｓ２で取得した第２属性と、Ｓ３で取得した第２属性値と、に基づいて、第２クエリを取得する（Ｓ４）。 The server 10 acquires the second product data and the second attribute input by the user from the user terminal 20 (S2). The server 10 acquires the second attribute value randomly from among the multiple attribute values associated with the second attribute acquired in S2 based on the attribute database DB2 (S3). The server 10 acquires the second query based on the second attribute acquired in S2 and the second attribute value acquired in S3 (S4).

サーバ１０は、Ｓ２で取得した第２商品データ、Ｓ４で取得した第２クエリ、及び質問応答モデルＭ３に基づいて、第２商品データから第３属性値を抽出する（Ｓ５）。Ｓ５では、サーバ１０は、第２商品データ及び第２クエリを質問応答モデルＭ３に入力する。質問応答モデルＭ３は、第２商品データ及び第２クエリに基づいて特徴量を計算し、当該計算された特徴量に応じた第３属性値の開始部分と終了部分を出力する。サーバ１０は、開始部分と終了部分で挟まれた文字列を、第３属性値として抽出する。サーバ１０は、第２商品に、Ｓ５で抽出した第３属性値を付与し（Ｓ６）、本処理は終了する。 The server 10 extracts a third attribute value from the second product data based on the second product data acquired in S2, the second query acquired in S4, and the question answering model M3 (S5). In S5, the server 10 inputs the second product data and the second query to the question answering model M3. The question answering model M3 calculates features based on the second product data and the second query, and outputs the start and end parts of the third attribute value according to the calculated features. The server 10 extracts the character string between the start and end parts as the third attribute value. The server 10 assigns the third attribute value extracted in S5 to the second product (S6), and the process ends.

本実施形態の属性値抽出システムＳによれば、第２商品データ、第２属性値に応じた第２クエリ、及び質問応答モデルＭ３に基づいて、第３属性値を抽出する。質問応答モデルＭ３が第２クエリの意味を認識しやすい第２属性値を利用することによって、質問応答モデルＭ３からの出力の精度が高まる。例えば、「機能」といった抽象的な属性の第２商品だったとしても、適切な第３属性値を出力可能な質問応答モデルＭ３とすることができる。質問応答モデルＭ３からの出力の精度が高まることによって、より適切な第３属性値を第２商品に付与することができる。例えば、従来の質問応答モデルＭ１は、比較的長いクエリが入力されることを想定して作成されているため、従来の質問応答モデルＭ２のように比較的短いクエリが入力されると出力の精度が低下することがある。この点、第２属性値のように、より具体的な意味を有する第２クエリにすることによって、従来の質問応答モデルＭ２よりも精度の高い出力を得ることができる。 According to the attribute value extraction system S of this embodiment, the third attribute value is extracted based on the second product data, the second query corresponding to the second attribute value, and the question-answering model M3. By using the second attribute value that allows the question-answering model M3 to easily recognize the meaning of the second query, the accuracy of the output from the question-answering model M3 is improved. For example, even if the second product has an abstract attribute such as "function", the question-answering model M3 can output an appropriate third attribute value. By improving the accuracy of the output from the question-answering model M3, a more appropriate third attribute value can be assigned to the second product. For example, since the conventional question-answering model M1 is created assuming that a relatively long query is input, the accuracy of the output may decrease when a relatively short query is input as in the conventional question-answering model M2. In this regard, by making the second query have a more specific meaning, such as the second attribute value, it is possible to obtain an output with higher accuracy than the conventional question-answering model M2.

また、属性値抽出システムＳは、第２属性及び第２属性値に応じた第２クエリに基づいて、第３属性値を抽出する。これにより、質問応答モデルＭ３からの出力の精度がより高まる。第２属性を第２クエリに含めることにより、質問応答モデルＭ３から出力された第３属性値がどの属性のものなのかを容易に特定できる。 The attribute value extraction system S also extracts a third attribute value based on the second attribute and a second query corresponding to the second attribute value. This further improves the accuracy of the output from the question answering model M3. By including the second attribute in the second query, it is easy to identify which attribute the third attribute value output from the question answering model M3 belongs to.

また、属性値抽出システムＳは、質問応答モデルＭ３に関係を学習させるための訓練データに基づいて、第２属性値を取得する。これにより、質問応答モデルＭ３に学習させた内容に応じた第２クエリになるので、質問応答モデルＭ３が第２クエリの意味をより認識しやすい第２クエリとすることができる。このため、質問応答モデルＭ３からの出力の精度がより高まる。 The attribute value extraction system S also acquires the second attribute value based on training data for making the question-answering model M3 learn relationships. This results in a second query that corresponds to the content that the question-answering model M3 has been made to learn, making it easier for the question-answering model M3 to recognize the meaning of the second query. This further increases the accuracy of the output from the question-answering model M3.

また、属性値抽出システムＳは、第３属性値の候補に関する辞書データに基づいて、第２属性値を取得する。これにより、辞書データに定義された第２クエリになるので、質問応答モデルＭ３が第２クエリの意味をより認識しやすい第２クエリとすることができる。このため、質問応答モデルＭ３からの出力の精度がより高まる。 The attribute value extraction system S also acquires the second attribute value based on dictionary data related to candidates for the third attribute value. This results in a second query defined in the dictionary data, making it easier for the question-answering model M3 to recognize the meaning of the second query. This further increases the accuracy of the output from the question-answering model M3.

また、属性値抽出システムＳは、第２属性に関連付けられた複数の第２属性値の中から、ランダムに第２属性値を取得する。これにより、第２属性値を取得する処理を簡易化し、質問応答モデルＭ３からの出力を取得するまでの時間を短縮できる。その結果、サーバ１０の処理を高速化し、かつ、サーバ１０の処理負荷を軽減できる。 The attribute value extraction system S also randomly acquires the second attribute value from among a plurality of second attribute values associated with the second attribute. This simplifies the process of acquiring the second attribute value and shortens the time required to acquire the output from the question answering model M3. As a result, the processing speed of the server 10 can be increased and the processing load on the server 10 can be reduced.

また、属性値抽出システムＳは、質問応答モデルＭ３により第３属性値として推定された第２商品データに含まれる語を、第３属性値として抽出する。第２商品データに含まれる語を第３属性値とすることによって、より適切な第３属性値を第２商品に付与できる。 The attribute value extraction system S also extracts, as the third attribute value, a word contained in the second product data that is estimated as the third attribute value by the question answering model M3. By setting the word contained in the second product data as the third attribute value, a more appropriate third attribute value can be assigned to the second product.

また、属性値抽出システムＳは、第２商品データに関連付けられた第２属性を取得し、当該取得された第２属性に関連付けられた第２属性値を取得する。これにより、第２商品の属性に応じた適切な第３属性値を付与できる。 The attribute value extraction system S also acquires a second attribute associated with the second product data, and acquires a second attribute value associated with the acquired second attribute. This allows an appropriate third attribute value to be assigned according to the attribute of the second product.

また、第１アイテムは、第１商品であり、第１アイテムデータは、第１商品に関するデータである。第２アイテムは、第２商品であり、第２アイテムデータは、第２商品に関するデータである。これにより、適切な第３属性値を抽出できる。例えば、オンラインショッピングモールにおけるユーザが第３属性値を指定する手間を省くことができる。例えば、オンラインショッピングモールで販売される商品に誤った属性値が付与されていたとしても、この商品を第２商品として第３属性値を付与することによって、適切な属性値を付与することもできる。 The first item is a first product, and the first item data is data relating to the first product. The second item is a second product, and the second item data is data relating to the second product. This makes it possible to extract an appropriate third attribute value. For example, it is possible to eliminate the need for a user at an online shopping mall to specify a third attribute value. For example, even if an incorrect attribute value is assigned to a product sold at an online shopping mall, it is possible to assign an appropriate attribute value by treating this product as a second product and assigning the third attribute value.

［５．変形例］
本開示は、以上に説明した実施形態に限定されるものではない。本開示の趣旨を逸脱しない範囲で、適宜変更可能である。 5. Modifications
The present disclosure is not limited to the above-described embodiment, and may be modified as appropriate without departing from the spirit and scope of the present disclosure.

［５－１．変形例１］
例えば、第２属性値取得部１０３は、第２属性に関連付けられた複数の第２属性値の中から、相対的に頻度が高い第２属性値を取得してもよい。頻度は、商品データにおいて属性値が登場（出現）する頻度である。例えば、サーバ１０は、オンラインショッピングモールで販売されている商品の商品データのうち、個々の属性及び属性値が関連付けられた数を集計する。サーバ１０は、当該集計結果に基づいて、個々の第２属性値の頻度を取得する。頻度の集計方法は、任意の方法であってよく、例えば、訓練データベースＤＢ１の中での頻度であってもよいし、オンラインショッピングモールの全商品の商品データベースの中での頻度であってもよい。 [5-1. Modification 1]
For example, the second attribute value acquisition unit 103 may acquire a second attribute value with a relatively high frequency from among a plurality of second attribute values associated with the second attribute. The frequency is the frequency with which an attribute value appears (occurs) in product data. For example, the server 10 counts the number of times each attribute and attribute value are associated with each other in the product data of products sold in an online shopping mall. The server 10 acquires the frequency of each second attribute value based on the counting result. The method of counting the frequency may be any method, and may be, for example, the frequency in the training database DB1 or the frequency in the product database of all products in the online shopping mall.

相対的に頻度が高い第２属性値とは、自身よりも頻度が低い他の第２属性値が１つでも存在する第２属性値である。相対的に頻度が高い第２属性値は、頻度が最も低い第２属性値以外の第２属性値ということもできる。例えば、４つの第２属性値があったとすると、頻度が最も高い第２属性値、頻度が２番目に高い第２属性値、及び頻度が３番目に高い第２属性値は、相対的に頻度が高い第２属性値に相当する。 A second attribute value with a relatively high frequency is a second attribute value for which there is at least one other second attribute value with a lower frequency than itself. A second attribute value with a relatively high frequency can also be defined as a second attribute value other than the second attribute value with the lowest frequency. For example, if there are four second attribute values, the second attribute value with the highest frequency, the second attribute value with the second highest frequency, and the second attribute value with the third highest frequency correspond to second attribute values with a relatively high frequency.

変形例１では、第２属性値取得部１０３が、複数の第２属性値の中から、最も頻度が高い第２属性値を取得する場合を説明するが、第２属性値取得部１０３は、相対的に頻度が高い第２属性値を取得すればよく、頻度が２番目以降の第２属性値を取得してもよい。実施形態及び変形例１を組み合わせて、第２属性値取得部１０３は、相対的に頻度が高い第２属性値の中から、ランダムに第２属性値を取得してもよい。 In the first modification, a case is described in which the second attribute value acquisition unit 103 acquires the second attribute value with the highest frequency from among a plurality of second attribute values, but the second attribute value acquisition unit 103 only needs to acquire a second attribute value with a relatively high frequency, and may acquire a second attribute value with a frequency that is second or higher. By combining the embodiment and the first modification, the second attribute value acquisition unit 103 may randomly acquire a second attribute value from among the second attribute values with a relatively high frequency.

先述した４つの第２属性値の例のように、相対的に頻度が高い第２属性値が３つあったとすると、第２属性値取得部１０３は、相対的に頻度が高い３つの第２属性値の中から、ランダムに第２属性値を取得してもよい。この場合、頻度が２番目又は３番目に高い第２属性値が取得される可能性がある。第２属性値取得部１０３が第２属性値を取得した後の処理は、実施形態と同様であり、当該第２属性値に応じた第２クエリが取得され、質問応答モデルＭ３による推定が行われる。 Assuming that there are three second attribute values with relatively high frequencies, as in the example of four second attribute values described above, the second attribute value acquisition unit 103 may randomly acquire a second attribute value from among the three second attribute values with relatively high frequencies. In this case, the second or third most frequently occurring second attribute value may be acquired. The processing after the second attribute value acquisition unit 103 acquires the second attribute value is the same as in the embodiment, and a second query corresponding to the second attribute value is acquired and estimation is performed by the question answering model M3.

変形例１によれば、第２属性に関連付けられた複数の第２属性値の中から、相対的に頻度が高い第２属性値を取得する。相対的に頻度が高い第２属性値は、第２属性の意味が適切に表現されていることが多いので、質問応答モデルＭ３が第２クエリの意味を認識しやすい語である確率が高い。相対的に頻度が高い第２属性値に応じた第２クエリとすることによって、質問応答モデルＭ３がより意味を理解しやすい第２クエリになるので、質問応答モデルＭ３からの出力の精度がより高まる。 According to the first variant example, a second attribute value with a relatively high frequency is obtained from among a plurality of second attribute values associated with the second attribute. A second attribute value with a relatively high frequency often appropriately expresses the meaning of the second attribute, and therefore there is a high probability that the second query is a word that makes it easy for the question-answering model M3 to recognize the meaning of the second query. By making the second query correspond to a second attribute value with a relatively high frequency, the question-answering model M3 becomes a second query whose meaning is easier to understand, and the accuracy of the output from the question-answering model M3 is further improved.

また、複数の第２属性値の中から、最も頻度が高い第２属性値が取得される。最も頻度が高い第２属性値は、第２属性の意味がより適切に表現されていることが多いので、質問応答モデルＭ３が第２クエリの意味を認識しやすい語である確率がより高い。最も頻度が高い第２属性値に応じた第２クエリとすることによって、質問応答モデルＭ３がより意味を理解しやすい第２クエリになるので、質問応答モデルＭ３からの出力の精度がより高まる。 In addition, the most frequent second attribute value is obtained from among the multiple second attribute values. The most frequent second attribute value often more appropriately expresses the meaning of the second attribute, and therefore there is a higher probability that the second query is a word that makes it easier for the question-answering model M3 to recognize the meaning of the second query. By setting the second query according to the most frequent second attribute value, the question-answering model M3 becomes a second query whose meaning is easier to understand, and the accuracy of the output from the question-answering model M3 is further improved.

［５－２．変形例２］
例えば、第２属性値取得部１０３は、複数の第２属性値を取得してもよい。変形例２では、複数の第２属性値に応じた第２クエリが質問応答モデルＭ３に入力される。例えば、第２クエリは、第２属性と、複数の第２属性値と、を含む。例えば、第２クエリは、第２属性と、複数の第２属性値と、が結合されたテキストになる。個々の第２属性値には、実施形態と同様に、第２属性値であることを示す特別な語が付与されてもよい。更に、第２属性値の間には、区切りを示す特別な語が配置されてもよい。 [5-2. Modification 2]
For example, the second attribute value acquisition unit 103 may acquire a plurality of second attribute values. In the second modification example, a second query corresponding to a plurality of second attribute values is input to the question answering model M3. For example, the second query includes the second attribute and a plurality of second attribute values. For example, the second query is a text in which the second attribute and the plurality of second attribute values are combined. As in the embodiment, each second attribute value may be given a special word indicating that it is a second attribute value. Furthermore, a special word indicating a separator may be placed between the second attribute values.

変形例２では、第２属性値取得部１０３が、第２属性に関連付けられた複数の第２属性値の全てを取得する場合を説明するが、第２属性値取得部１０３は、第２属性に関連付けられた複数の第２属性値のうちの一部を取得してもよい。例えば、４つの第２属性値があったとすると、第２属性値取得部１０３は、４つの第２属性値のうちの２つ又は３つを取得してもよい。この場合、実施形態及び変形例２を組み合わせて、第２属性値取得部１０３は、４つの第２属性値の中から、ランダムに２つ又は３つの第２属性値を取得してもよい。他にも例えば、実施形態及び変形例１を組み合わせて、第２属性値取得部１０３は、４つの第２属性値の中から、頻度が高い順に２つ又は３つの第２属性値を取得してもよい。 In the second modification, the second attribute value acquisition unit 103 acquires all of the second attribute values associated with the second attribute, but the second attribute value acquisition unit 103 may acquire a part of the second attribute values associated with the second attribute. For example, if there are four second attribute values, the second attribute value acquisition unit 103 may acquire two or three of the four second attribute values. In this case, by combining the embodiment and the second modification, the second attribute value acquisition unit 103 may randomly acquire two or three second attribute values from among the four second attribute values. As another example, by combining the embodiment and the first modification, the second attribute value acquisition unit 103 may acquire two or three second attribute values from among the four second attribute values in order of frequency.

第３属性値抽出部１０４は、複数の第２属性値に応じた第２クエリに基づいて、第３属性値を抽出する。変形例２では、第２属性に関連付けられた複数の第２属性値の全てが取得されるので、第３属性値抽出部１０４は、複数の第２属性値の全てに応じた第２クエリに基づいて、第３属性値を抽出する。例えば、第３属性値抽出部１０４は、第２商品データと、第２属性値取得部１０３が取得した複数の第２属性値の全てを含む第２クエリと、を質問応答モデルＭ３に入力する。第２クエリが実施形態よりも長くなる可能性がある点で実施形態とは異なるが、質問応答モデルＭ３の処理自体は、実施形態と同様である。変形例２では、訓練データの入力部分に複数の属性値が含まれていてもよい。即ち、変形例２では、複数の第２属性値が入力部分として学習された質問応答モデルＭ３が利用されてもよい。 The third attribute value extraction unit 104 extracts the third attribute value based on the second query corresponding to the multiple second attribute values. In the second modification, all of the multiple second attribute values associated with the second attribute are acquired, so the third attribute value extraction unit 104 extracts the third attribute value based on the second query corresponding to all of the multiple second attribute values. For example, the third attribute value extraction unit 104 inputs the second product data and the second query including all of the multiple second attribute values acquired by the second attribute value acquisition unit 103 to the question answering model M3. This differs from the embodiment in that the second query may be longer than in the embodiment, but the processing of the question answering model M3 itself is the same as in the embodiment. In the second modification, the input portion of the training data may include multiple attribute values. That is, in the second modification, a question answering model M3 in which multiple second attribute values are learned as the input portion may be used.

変形例２によれば、複数の第２属性値に応じた第２クエリに基づいて、第３属性値を抽出する。これにより、第２クエリの情報量が増えるので、質問応答モデルＭ３が意味をより認識しやすい第２クエリとすることができる。複数の第２属性値を利用することによって、第２クエリの拡張を効果的に行うことができるので、質問応答モデルＭ３からの出力の精度がより高まる。 According to the second modification, a third attribute value is extracted based on a second query corresponding to a plurality of second attribute values. This increases the amount of information in the second query, making it possible to make the second query easier for the question-answering model M3 to recognize the meaning of. By utilizing a plurality of second attribute values, the second query can be effectively expanded, thereby further increasing the accuracy of the output from the question-answering model M3.

また、属性に関連付けられた複数の第２属性値の全てを取得し、複数の第２属性値の全てに応じた第２クエリに基づいて、第３属性値を抽出する。これにより、第２クエリの情報量が増えるので、質問応答モデルＭ３が意味をより認識しやすい第２クエリとすることができる。複数の第２属性値の全てを利用することによって、第２クエリの拡張をより効果的に行うことができるので、質問応答モデルＭ３からの出力の精度がより高まる。 In addition, all of the multiple second attribute values associated with the attribute are obtained, and the third attribute value is extracted based on a second query corresponding to all of the multiple second attribute values. This increases the amount of information in the second query, making it possible to make the second query easier for the question-answering model M3 to recognize the meaning of. By utilizing all of the multiple second attribute values, the second query can be expanded more effectively, thereby improving the accuracy of the output from the question-answering model M3.

［５－３．変形例３］
例えば、変形例２のように複数の第２属性値が取得された場合に、第３属性値抽出部１０４は、複数の第２属性値の各々に関する特徴量の平均を計算し、当該計算された平均に基づいて、第３属性値を抽出してもよい。特徴量は、語の特徴に関する情報である。変形例３では、特徴量がいわゆる埋め込みベクトルである場合を説明するが、特徴量は、ベクトル形式に限られず、任意の形式で表現可能である。例えば、特徴量は、単一の数値又は配列といった他の形式であってもよい。 [5-3. Modification 3]
For example, when a plurality of second attribute values are acquired as in the second modification, the third attribute value extraction unit 104 may calculate an average of the feature quantities for each of the second attribute values, and extract the third attribute value based on the calculated average. The feature quantity is information relating to the features of a word. In the third modification, a case will be described in which the feature quantity is a so-called embedded vector, but the feature quantity is not limited to a vector format and can be expressed in any format. For example, the feature quantity may be in another format such as a single numerical value or an array.

図９～図１１は、複数の第２属性値の各々に関する特徴量の平均を計算する処理の一例を示す図である。図９のように、質問応答モデルＭ３には、第２商品データと、第２属性及び複数の第２属性値を含む第２クエリと、が入力される。質問応答モデルＭ３の埋め込みレイヤは、入力された語の特徴量を計算する。埋め込みレイヤによる特徴量の計算方法自体は、従来における質問応答モデルＭ１及び質問応答モデルＭ２と同様である。 Figures 9 to 11 are diagrams showing an example of a process for calculating the average of features for each of multiple second attribute values. As shown in Figure 9, second product data and a second query including a second attribute and multiple second attribute values are input to the question answering model M3. The embedding layer of the question answering model M3 calculates the features of the input words. The method of calculating the features by the embedding layer is the same as that of the conventional question answering model M1 and question answering model M2.

第３属性値抽出部１０４は、埋め込みレイヤにより計算された特徴量の平均を計算する。例えば、第３属性値抽出部１０４は、複数の第２属性値の各々に含まれる語ごとに平均を計算し、当該語ごとに計算された平均に基づいて、第３属性値を抽出してもよい。図１０の例では、第２属性「色」に対し、第２属性値「dark blue」と、第２属性値「brilliant yellow」と、の２つの第２属性値が取得されたものとする。 The third attribute value extraction unit 104 calculates the average of the feature quantities calculated by the embedding layer. For example, the third attribute value extraction unit 104 may calculate the average for each word included in each of the multiple second attribute values, and extract the third attribute value based on the average calculated for each word. In the example of FIG. 10, it is assumed that two second attribute values, the second attribute value "dark blue" and the second attribute value "brilliant yellow", are obtained for the second attribute "color".

第３属性値抽出部１０４は、第２属性値「dark blue」の１語目である「dark」の特徴量と、第２属性値「brilliant yellow」の１語目である「brilliant」の特徴量と、の平均を、１語目の平均として計算する。第３属性値抽出部１０４は、第２属性値「dark blue」の２語目である「blue」の特徴量と、第２属性値「brilliant yellow」の２語目である「yellow」の特徴量と、の平均を、２語目の平均として計算する。 The third attribute value extraction unit 104 calculates the average of the feature amount of "dark", the first word of the second attribute value "dark blue", and the feature amount of "brilliant", the first word of the second attribute value "brilliant yellow", as the average of the first word. The third attribute value extraction unit 104 calculates the average of the feature amount of "blue", the second word of the second attribute value "dark blue", and the feature amount of "yellow", the second word of the second attribute value "brilliant yellow", as the average of the second word.

第３属性値抽出部１０４は、第２属性値「dark blue」と、第２属性値「brilliant yellow」と、の各々に含まれる語ごとの特徴量の平均を計算し、当該平均に基づいて、第３属性値を抽出する。３語以上の第２属性値が質問応答モデルＭ３に入力された場合も、第３属性値抽出部１０４は、図１０と同様にして、語ごとに特徴量の平均を計算し、当該平均に応じた出力を取得すればよい。３つ以上の第２属性値が質問応答モデルＭ３に入力された場合も、第３属性値抽出部１０４は、図１０と同様にして、語ごとに、３つ以上の第２属性値の特徴量の平均を計算し、当該平均に基づいて、第３属性値を抽出すればよい。 The third attribute value extraction unit 104 calculates the average of the feature amounts for each word included in each of the second attribute values "dark blue" and "brilliant yellow", and extracts the third attribute value based on the average. When three or more second attribute values are input to the question answering model M3, the third attribute value extraction unit 104 may also calculate the average of the feature amounts for each word in the same manner as in FIG. 10, and obtain an output according to the average. When three or more second attribute values are input to the question answering model M3, the third attribute value extraction unit 104 may also calculate the average of the feature amounts for three or more second attribute values for each word in the same manner as in FIG. 10, and extract the third attribute value based on the average.

なお、複数の第２属性値には、語の数が互いに異なる第２属性値の組み合わせが含まれていることがある。例えば、ある第２属性値は、１つの語だけを含み、他の第２属性値は、２つの語を含むといったことがある。この場合、出力部は、複数の第２属性値のうち、相対的に語の数が少ない第２属性値に関するパディングに基づいて、語ごとの平均を計算してもよい。パディングは、予め定められた文字である。図１１の例では、第２属性「色」に対し、第２属性値「dark blue」と、第２属性値「red」と、の２つの第２属性値が取得されたものとする。第２属性値「dark blue」は２語であるが、第２属性値「red」は１語である。 Note that the multiple second attribute values may include combinations of second attribute values with different numbers of words. For example, one second attribute value may include only one word, and another second attribute value may include two words. In this case, the output unit may calculate the average for each word based on padding for a second attribute value that has a relatively small number of words among the multiple second attribute values. The padding is a predetermined character. In the example of FIG. 11, it is assumed that two second attribute values, the second attribute value "dark blue" and the second attribute value "red", are acquired for the second attribute "color". The second attribute value "dark blue" is two words, while the second attribute value "red" is one word.

第３属性値抽出部１０４は、第２属性値「dark blue」の１語目である「dark」の特徴量と、第２属性値「red」の１語目である「red」の特徴量と、の平均を、１語目の平均として計算する。第２属性値「red」には２語目が存在しないので、第３属性値抽出部１０４は、第２属性値「dark blue」の２語目である「blue」の特徴量と、所定のパディングである「pad」の特徴量と、の平均を、２語目の平均として計算する。このように、第３属性値抽出部１０４は、第２属性値「red」に対し、語の数が最大である第２属性値「dark blue」の語の数に足りない分のパディングを補填したうえで特徴量の平均を計算し、当該平均に基づいて、第３属性値を抽出する。 The third attribute value extraction unit 104 calculates the average of the feature amount of "dark", which is the first word of the second attribute value "dark blue", and the feature amount of "red", which is the first word of the second attribute value "red", as the average of the first word. Since there is no second word in the second attribute value "red", the third attribute value extraction unit 104 calculates the average of the feature amount of "blue", which is the second word of the second attribute value "dark blue", and the feature amount of "pad", which is a predetermined padding, as the average of the second word. In this way, the third attribute value extraction unit 104 calculates the average of the feature amounts for the second attribute value "red" after compensating for the padding that is insufficient to the number of words in the second attribute value "dark blue", which has the largest number of words, and extracts the third attribute value based on this average.

変形例３によれば、複数の第２属性値の各々に関する特徴量の平均を計算し、当該計算された平均に応じた第２クエリに基づいて、第３属性値を抽出する。これにより、複数の第２属性値の特徴を平均化することによって、第２クエリの拡張を効果的に行うことができるので、質問応答モデルＭ３からの出力の精度がより高まる。 According to the third modification, the average of the feature quantities for each of the multiple second attribute values is calculated, and the third attribute value is extracted based on the second query corresponding to the calculated average. This makes it possible to effectively expand the second query by averaging the features of the multiple second attribute values, thereby further improving the accuracy of the output from the question-answering model M3.

また、複数の第２属性値の各々に含まれる語ごとに平均を計算し、当該語ごとに計算された平均に応じた第２クエリに基づいて、第３属性値を抽出する。これにより、複数の第２属性値の語ごとの特徴を平均化したうえで、質問応答モデルＭ３からの出力を取得できるので、質問応答モデルＭ３からの出力の精度がより高まる。例えば、特定の語数の第３属性値を第２商品に付与できるようになる。 In addition, an average is calculated for each word contained in each of the multiple second attribute values, and a third attribute value is extracted based on a second query corresponding to the average calculated for each word. This makes it possible to average the features for each word of the multiple second attribute values before obtaining an output from the question answering model M3, thereby further improving the accuracy of the output from the question answering model M3. For example, it becomes possible to assign a specific number of words of a third attribute value to a second product.

また、複数の第２属性値には、語の数が互いに異なる第２属性値の組み合わせが含まれている場合に、複数の第２属性値のうち、相対的に語の数が少ない第２属性値に関するパディングに基づいて、語ごとの平均を計算する。これにより、語の数が互いに異なる第２属性値だったとしても、質問応答モデルＭ３からの出力を取得できるので、質問応答モデルＭ３からの出力の精度がより高まる。例えば、複数の第２属性値のうち、最大の語の数の第３属性値を第２商品に付与できるようになる。 In addition, when the multiple second attribute values include a combination of second attribute values with different numbers of words, the average for each word is calculated based on the padding for the second attribute value with a relatively small number of words among the multiple second attribute values. This makes it possible to obtain output from the question answering model M3 even if the second attribute values have different numbers of words, thereby further improving the accuracy of the output from the question answering model M3. For example, it becomes possible to assign to the second product the third attribute value with the greatest number of words among the multiple second attribute values.

［５－４．変形例４］
例えば、第２商品に関連付けられた第２属性が複数存在する場合には、第２属性値取得部１０３は、複数の第２属性の各々に関連付けられた第２属性値を取得してもよい。個々の第２属性に関連付けられた第２属性値を取得する処理は、実施形態で説明した通りである。第２属性値取得部１０３は、第２属性ごとに、実施形態で説明した処理を実行し、第２属性値を取得する。第２属性値取得部１０３は、第２属性値ごとに、変形例１～３の何れかで説明した処理を実行し、第２属性値を取得してもよい。 [5-4. Modification 4]
For example, when there are multiple second attributes associated with the second product, the second attribute value acquisition unit 103 may acquire second attribute values associated with each of the multiple second attributes. The process of acquiring second attribute values associated with each second attribute is as described in the embodiment. The second attribute value acquisition unit 103 executes the process described in the embodiment for each second attribute to acquire a second attribute value. The second attribute value acquisition unit 103 may execute the process described in any of the modified examples 1 to 3 for each second attribute value to acquire a second attribute value.

第３属性値抽出部１０４は、第２属性ごとに、第３属性値を抽出する。個々の第２属性に応じた第３属性値を抽出する処理は、実施形態で説明した通りである。第３属性値付与部１０５は、第２属性ごとに、第２商品に第３属性値を付与する。個々の第２属性に応じた第３属性値を付与する処理は、実施形態で説明した通りである。例えば、第３属性値付与部１０５は、「ブランド」、「色」、「素材」、「サイズ」、「機能」といった第２属性ごとに、第２商品に第３属性値を付与してもよい。 The third attribute value extraction unit 104 extracts a third attribute value for each second attribute. The process of extracting third attribute values according to each second attribute is as described in the embodiment. The third attribute value assignment unit 105 assigns a third attribute value to the second product for each second attribute. The process of assigning a third attribute value according to each second attribute is as described in the embodiment. For example, the third attribute value assignment unit 105 may assign a third attribute value to the second product for each second attribute such as "brand", "color", "material", "size" and "function".

変形例４によれば、複数の第２属性の各々に関連付けられた第２属性値を取得し、第２属性ごとに、第３属性値を抽出する。これにより、複数の第２属性が関連付けられた第２商品だったとしても、第２属性ごとに、第３属性値を抽出できる。その結果、例えば、ユーザの負担をより軽減できる。 According to the fourth modification, a second attribute value associated with each of a plurality of second attributes is obtained, and a third attribute value is extracted for each second attribute. This makes it possible to extract a third attribute value for each second attribute even if a second product has a plurality of second attributes associated therewith. As a result, for example, the burden on the user can be further reduced.

例えば、サーバ１０で実現されるものとして説明した機能は、ユーザ端末２０等の他のコンピュータで実現されてもよいし、複数のコンピュータで分担されてもよい。例えば、データ記憶部１００に記憶されるものとしたデータは、データベースサーバに記憶されていてもよい。 For example, the functions described as being realized by the server 10 may be realized by another computer such as the user terminal 20, or may be shared among multiple computers. For example, data stored in the data storage unit 100 may be stored in a database server.

Ｓ属性値抽出システム、Ｎネットワーク、Ｐ商品ページ、１０サーバ、１１，２１制御部、１２，２２記憶部、１３，２３通信部、２０ユーザ端末、２４操作部、２５表示部、Ｍ１，Ｍ２，Ｍ３質問応答モデル、１００データ記憶部、１０１学習部、１０２第２属性取得部、１０３第２属性値取得部、１０４第３属性値抽出部、１０５第３属性値付与部、ＤＢ１訓練データベース、ＤＢ２属性データベース。 S Attribute value extraction system, N Network, P Product page, 10 Server, 11, 21 Control unit, 12, 22 Memory unit, 13, 23 Communication unit, 20 User terminal, 24 Operation unit, 25 Display unit, M1, M2, M3 Question answering model, 100 Data memory unit, 101 Learning unit, 102 Second attribute acquisition unit, 103 Second attribute value acquisition unit, 104 Third attribute value extraction unit, 105 Third attribute value assignment unit, DB1 Training database, DB2 Attribute database.

Claims

a storage unit that stores an attribute value extraction model in which a relationship between a first item data item and a first query item related to a first item and a first attribute value related to the first item is learned;
a second attribute value acquisition unit that acquires the second attribute value by referring to data in which a second attribute related to a second item and a second attribute value are associated with each other ;
a third attribute value extraction unit that extracts a third attribute value related to the second attribute from the second item data based on second item data related to the second item, a second query including the second attribute and the second attribute value, and the attribute value extraction model;
An attribute value extraction system comprising:

The second attribute value acquisition unit acquires the second attribute value based on training data of the attribute value extraction model.
The attribute value extraction system according to claim 1 .

the second attribute value acquisition unit acquires the second attribute value based on dictionary data regarding candidates for the second attribute value;
3. The attribute value extraction system according to claim 1 or 2 .

the second attribute value acquisition unit randomly acquires the second attribute value from among a plurality of second attribute values associated with the second attribute;
4. The attribute value extraction system according to claim 1 .

the second attribute value acquisition unit acquires the second attribute value having a relatively high frequency from among the plurality of second attribute values associated with the second attribute;
5. The attribute value extraction system according to claim 1 .

the second attribute value acquisition unit acquires the second attribute value that occurs most frequently from among the plurality of second attribute values;
The attribute value extraction system according to claim 5 .

the second attribute value acquisition unit acquires a plurality of the second attribute values;
the second query includes the plurality of second attribute values;
7. An attribute value extraction system according to claim 1 .

the second attribute value acquisition unit acquires all of the second attribute values associated with the second attribute;
the second query includes all of the plurality of second attribute values;
The attribute value extraction system according to claim 7 .

the third attribute value extraction unit calculates an average of the feature amounts calculated by the attribute value extraction model for each of the plurality of second attribute values, and extracts the third attribute value based on the calculated average;
9. The attribute value extraction system according to claim 7 or 8 .

the third attribute value extraction unit calculates the average for each word included in each of the plurality of second attribute values, and extracts the third attribute value based on the average calculated for each word.
The attribute value extraction system according to claim 9 .

The plurality of second attribute values include combinations of the second attribute values each having a different number of the words;
the third attribute value extraction unit calculates the average for each of the words based on padding for the second attribute value having a relatively small number of the words among the plurality of second attribute values;
The attribute value extraction system according to claim 10 .

the third attribute value extraction unit extracts, as the third attribute value, a word included in the second item data estimated as the third attribute value by the attribute value extraction model;
An attribute value extraction system according to any one of claims 1 to 11 .

the attribute value extraction system includes a second attribute acquisition unit that acquires the second attribute designated by a user ;
the second attribute value acquisition unit acquires the second attribute value associated with the second attribute acquired by the second attribute acquisition unit;
An attribute value extraction system according to any one of claims 1 to 12 .

the second attribute value acquisition unit acquires the second attribute value associated with each of the plurality of second attributes;
the third attribute value extraction unit extracts the third attribute value for each of the second attributes;
An attribute value extraction system according to any one of claims 1 to 13 .

the first item is a first commodity;
the first item data is first product data relating to the first product;
the second item is a second commodity;
The second item data is second product data related to the second product.
An attribute value extraction system according to any one of claims 1 to 14 .

The computer
a second attribute value acquisition step of acquiring the second attribute value by referring to data in which a second attribute related to a second item and a second attribute value are associated with each other ;
a third attribute value extraction step of extracting the third attribute value related to the second attribute from the second item data based on second item data related to the second item, a second query including the second attribute and the second attribute value, and an attribute value extraction model in which a relationship between first item data related to a first item, the first query, and the first attribute value related to the first item is learned;
The attribute value extraction method to perform .

a second attribute value acquisition unit that acquires the second attribute value by referring to data in which a second attribute related to a second item and a second attribute value are associated with each other ;
a third attribute value extraction unit that extracts the third attribute value related to the second attribute from the second item data based on an attribute value extraction model in which a relationship between first item data related to a first item and the first query and a first attribute value related to the first item is learned based on second item data related to the second item and a second query including the second attribute and the second attribute value;
A program that makes a computer function as a