JP7614738B2

JP7614738B2 - SEARCH DEVICE AND SEARCH METHOD, LEARNING DEVICE AND LEARNING METHOD, QUESTION ANSWER PREDICTION SYSTEM AND QUESTION ANSWER PREDICTION METHOD, AND PROGRAM

Info

Publication number: JP7614738B2
Application number: JP2020085091A
Authority: JP
Inventors: 隆人小柳; 小太郎山本; 健太郎鈴木; 涼君塚
Original assignee: NTT Comware Corp
Current assignee: NTT Comware Corp
Priority date: 2020-05-14
Filing date: 2020-05-14
Publication date: 2025-01-16
Anticipated expiration: 2040-05-14
Also published as: JP2021179827A

Description

本発明は、検索装置および検索方法、学習装置および学習方法、質問回答予測システムおよび質問回答予測方法、並びにプログラムに関する。 The present invention relates to a search device and a search method, a learning device and a learning method, a question answer prediction system and a question answer prediction method, and a program.

従来より、質問に対して回答を得るための技術として、「全文検索技術」と「機械読解技術」とが知られている。全文検索技術は、質問中の単語の、文書中の出現頻度に基づいて、質問にマッチする文書を検索する技術である。機械読解技術は、質問、回答、および文献のセットを学習データとして学習し、質問が入力された場合に回答が含まれる文献中の区間を出力する技術である。 Full-text search technology and machine reading comprehension technology are well known as technologies for obtaining answers to questions. Full-text search technology searches for documents that match a question based on the frequency of occurrence of words in the question within the document. Machine reading comprehension technology learns from a set of questions, answers, and documents as training data, and when a question is input, outputs the section of the document that contains the answer.

全文検索技術を用いた文書検索装置は、例えば、特許文献１に記載されている。この文書検索装置は、ユーザによって入力された検索条件に基づいて、文書データベースに記憶されている複数の文書に対する全文検索を行い、検索条件に一致する複数の文書を抽出する。 A document search device using full-text search technology is described in, for example, Patent Document 1. This document search device performs a full-text search on multiple documents stored in a document database based on search conditions input by a user, and extracts multiple documents that match the search conditions.

機械読解技術を用いた回答学習装置は、例えば、特許文献２に記載されている。この回答学習装置は、入力された文章及び質問文に基づいて、当該文章における当該質問文に対する回答の根拠となる範囲を推定するための予め学習された読解モデルを用いて、当該範囲の始端及び終端を推定する機械読解部を備える。 An answer learning device using machine reading technology is described in, for example, Patent Document 2. This answer learning device includes a machine reading unit that uses a pre-trained reading model to estimate the range in the text that is the basis for the answer to the question, based on the input text and question, and estimates the start and end of the range.

特開２０１９－８９６３４号公報JP 2019-89634 A 特開２０２０－０６１１７３号公報JP 2020-061173 A

全文検索技術は多数の文献から所望の文献を抽出する技術であるが、全文検索技術により抽出した文献中に質問の回答が含まれている場合でも、当該文献を抽出した理由は当該文献中に質問中の単語が存在しただけであり、その他の理由を提示することができない。 Full-text search technology is a technology that extracts desired documents from a large number of documents, but even if the answer to a question is contained in the documents extracted using full-text search technology, the reason for extracting the document is simply that the words in the question were present in the document, and no other reason can be presented.

機械読解技術は、質問、文献および回答を学習データとして学習するので、学習データに質問および文献が存在する場合には、文献中で質問の回答に対応した区間を抽出することができる。しかし、ユーザの質問に対する回答を含む文献が無い場合、回答を得ることができない。 Machine reading comprehension technology learns from questions, documents, and answers as training data, so if the question and document are present in the training data, it is possible to extract the section of the document that corresponds to the answer to the question. However, if there is no document that contains the answer to the user's question, the answer cannot be obtained.

本発明は、上記の課題に鑑みてなされたものであって、高い精度で質問に対する回答を出力することができる検索装置および検索方法、学習装置および学習方法、質問回答予測システムおよび質問回答予測方法、並びにプログラムを提供することを目的としている。 The present invention has been made in consideration of the above problems, and aims to provide a search device and search method, a learning device and learning method, a question answer prediction system and question answer prediction method, and a program that can output answers to questions with high accuracy.

（１）本発明の一態様は、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを有し、当該生成モデルを用いて、入力される質問に対応する特徴語を生成する特徴語生成部と、入力される前記質問に含まれる単語に基づいて当該単語に合致する単語を含む文献を文献データベースから検索する処理と、前記特徴語生成部により生成された特徴語に基づいて当該特徴語に合致する単語を含む文献を前記文献データベースから検索する処理とを実行する検索部と、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを有し、前記機械読解モデルを用いて、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力する機械読解部と、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを前記機械読解部から取得し、各回答候補の尤度を示すスコアの比較に基づいて当該質問に対する回答を予測した結果を出力する予測部と、を備える、検索装置である。
(1) One aspect of the present invention is a system for generating feature words corresponding to training questions by using a set of training questions and feature words that characterize training documents containing training answers to the training questions as training data, the system including a feature word generation unit that generates feature words corresponding to the training questions using the training model; a search unit that executes a process of searching a literature database for documents containing words that match a word included in the input question based on a word included in the input question, and a process of searching the literature database for documents containing words that match the feature words based on the feature words generated by the feature word generation unit; and a search unit that executes a search function that generates answer candidates corresponding to the training questions and the training documents and likelihoods of the answer candidates by using a set of training questions, training documents containing training answers to the training questions, and the training answers as training data. a machine reading comprehension unit that has a machine reading comprehension model trained to output a core, and uses the machine reading comprehension model to output answer candidates corresponding to an input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and a score indicating the likelihood of the answer candidates; and a prediction unit that obtains from the machine reading comprehension unit answer candidates corresponding to the input question and documents searched based on words contained in the question and a score indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and a score indicating the likelihood of the answer candidates, and outputs a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate.

（２）本発明の一態様は、上記の検索装置であって、前記予測部は、前記機械読解モデルにより出力された、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補と、当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補とが同じである場合において、当該同じ回答候補のそれぞれの尤度を示すスコアが閾値を超える場合、当該同じ回答候補同士のスコアを合計してよい。
( 2 ) One aspect of the present invention is the above-mentioned search device, wherein when an answer candidate output by the machine reading comprehension model corresponding to the input question and literature searched based on words contained in the question is the same as an answer candidate corresponding to the question and literature searched based on feature words corresponding to the question, and when a score indicating the likelihood of each of the same answer candidates exceeds a threshold, the prediction unit may add up the scores of the same answer candidates.

（３）本発明の一態様は、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを生成する第１の学習部と、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを生成する第２の学習部と、を備える、学習装置である。
(3) One aspect of the present invention is a learning device that includes a first learning unit that generates a generative model trained to generate feature words corresponding to a training question using a set of training questions and feature words that are words that characterize training literature including training answers to the training questions as training data, and a second learning unit that generates a machine reading comprehension model trained to output answer candidates corresponding to the training questions and the training literature and scores indicating the likelihood of the answer candidates using the set of training questions, training literature including training answers to the training questions, and the training answers as training data.

（４）本発明の一態様は、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを生成する第１の学習部と、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを生成する第２の学習部と、を備える、学習装置と、前記生成モデルを有し、当該生成モデルを用いて、入力される質問に対応する特徴語を生成する特徴語生成部と、入力される前記質問に含まれる単語に基づいて当該単語に合致する単語を含む文献を文献データベースから検索する処理と、前記特徴語生成部により生成された特徴語に基づいて当該特徴語に合致する単語を含む文献を前記文献データベースから検索する処理とを実行する検索部と、前記機械読解モデルを有し、前記機械読解モデルを用いて、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力する機械読解部と、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを前記機械読解部から取得し、各回答候補の尤度を示すスコアの比較に基づいて当該質問に対する回答を予測した結果を出力する予測部と、を備える、検索装置と、を備える、質問回答予測システムである。
(4) One aspect of the present invention provides a learning device including: a first learning unit that uses a set of training questions and feature words that characterize training documents including training answers to the training questions as training data to generate a generative model trained to generate feature words corresponding to the training questions; and a second learning unit that uses a set of training questions, training documents including training answers to the training questions, and the training answers as training data to generate a machine reading comprehension model trained to output answer candidates corresponding to the training questions and the training documents and scores indicating the likelihood of the answer candidates; a feature word generation unit that has the generative model and uses the generative model to generate feature words corresponding to an input question; a process of searching a literature database for documents including words that match words included in the input question based on words included in the input question; and a process of searching a literature database for documents including words that match words included in the input question based on the feature words generated by the feature word generation unit. from the literature database; a machine reading comprehension unit that has the machine reading comprehension model and uses the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and a score indicating the likelihood of the answer candidates; and a prediction unit that obtains from the machine reading comprehension unit the answer candidates corresponding to the input question and documents searched based on words contained in the question and the answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and outputs a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate.

（５）本発明の一態様は、学習装置が、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを生成するステップと、前記学習装置が、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを生成するステップと、検索装置が、前記生成モデルを用いて、入力される質問に対応する特徴語を生成するステップと、前記検索装置が、入力される前記質問に含まれる単語に基づいて当該単語に合致する単語を含む文献を文献データベースから検索する処理と、前記特徴語に基づいて当該特徴語に合致する単語を含む文献を前記文献データベースから検索する処理とを実行するステップと、前記検索装置が、前記機械読解モデルを用いて、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するステップと、前記検索装置が、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを取得し、各回答候補の尤度を示すスコアの比較に基づいて当該質問に対する回答を予測した結果を出力するステップと、を有する、質問回答予測方法である。
(5) One aspect of the present invention includes a step of a learning device generating a generative model trained to generate feature words corresponding to a learning question using a set of a learning question and feature words that characterize a learning document including a learning answer to the learning question as learning data; a step of the learning device generating a machine reading comprehension model trained to output answer candidates corresponding to the learning question and the learning document and scores indicating the likelihood of the answer candidates using the learning question, the learning document including a learning answer to the learning question, and the set of the learning answers as learning data; a step of a search device generating feature words corresponding to an input question using the generative model; a process of the search device searching a literature database for literature including words that match a word included in the input question based on the word; and a process of searching a literature database for literature including words that match the feature words based on the feature words. a step of the search device using the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and scores indicating the likelihood of the answer candidates; and a step of the search device acquiring answer candidates corresponding to the input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and outputting a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate.

（６）本発明の一態様は、検索装置が、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを有し、当該生成モデルを用いて、入力される質問に対応する特徴語を生成するステップと、前記検索装置が、入力される前記質問に含まれる単語に基づいて当該単語に合致する単語を含む文献を文献データベースから検索する処理と、生成された特徴語に基づいて当該特徴語に合致する単語を含む文献を前記文献データベースから検索する処理とを実行するステップと、前記検索装置が、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを有し、前記機械読解モデルを用いて、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するステップと、前記検索装置が、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを取得し、各回答候補の尤度を示すスコアの比較に基づいて当該質問に対する回答を予測した結果を出力するステップと、を有する、検索方法である。
(6) One aspect of the present invention is a search device having a generative model trained to generate feature words corresponding to a training question using a set of training questions and feature words that characterize training documents including training answers to the training questions as training data, and generating feature words corresponding to the training questions using the generative model; the search device executing a process of searching a literature database for documents including words that match a word included in the input question based on a word included in the input question, and a process of searching the literature database for documents including words that match the feature words based on the generated feature words; and the search device using the set of training questions, training documents including training answers to the training questions, and the training answers as training data to search for answer candidates corresponding to the training questions and the training documents and the likelihood of the answer candidates. a step of using the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and a score indicating the likelihood of the answer candidates; and a step of the search device acquiring answer candidates corresponding to the input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and outputting a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate.

（７）本発明の一態様は、検索装置のコンピュータに、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを有し、当該生成モデルを用いて、入力される質問に対応する特徴語を生成するステップと、入力される前記質問に含まれる単語に基づいて当該単語に合致する単語を含む文献を文献データベースから検索する処理と、生成された特徴語に基づいて当該特徴語に合致する単語を含む文献を前記文献データベースから検索する処理とを実行するステップと、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを有し、前記機械読解モデルを用いて、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するステップと、入力される前記質問と当該質問に含まれる単語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコア、および当該質問と当該質問に対応する特徴語に基づいて検索された文献とに対応する回答候補および当該回答候補の尤度を示すスコアを取得し、各回答候補の尤度を示すスコアの比較に基づいて当該質問に対する回答を予測した結果を出力するステップと、を実行させる、プログラムである。
(7) One aspect of the present invention is a search device having a computer having a generative model trained to generate feature words corresponding to training questions using a set of training questions and feature words that characterize training documents containing training answers to the training questions as training data, and performing the steps of: generating feature words corresponding to an input question using the generative model; searching a literature database for documents containing words that match words included in the input question based on the words included in the input question; and searching the literature database for documents containing words that match the generated feature words; and using the set of training questions, training documents containing training answers to the training questions, and the training answers as training data, searching for answer candidates corresponding to the training questions and the training documents and the likelihoods of the answer candidates. The program has a machine reading comprehension model trained to output a score indicating the likelihood of each answer candidate, and uses the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words contained in the question, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and scores indicating the likelihood of the answer candidates; obtain answer candidates corresponding to the input question and documents searched based on words contained in the question, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and output a result of predicting the answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate.

（８）本発明の一態様は、学習装置が、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを生成するステップと、前記学習装置が、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを生成するステップと、を有する、学習方法である。
(8) One aspect of the present invention is a learning method comprising the steps of: a learning device generating a generative model trained to generate feature words corresponding to a training question using a set of training questions and feature words that are words that characterize training literature including training answers to the training questions as training data; and a learning device generating a machine reading comprehension model trained to output answer candidates corresponding to the training questions and the training literature and scores indicating the likelihood of the answer candidates using the set of training questions, training literature including training answers to the training questions, and the training answers as training data.

（９）本発明の一態様は、学習装置のコンピュータに、学習用質問と当該学習用質問に対する学習用回答を含む学習用文献を特徴づける単語である特徴語とのセットを学習データとして、前記学習用質問に対応する特徴語を生成するように学習した生成モデルを生成するステップと、前記学習装置が、学習用質問、当該学習用質問に対する学習用回答を含む学習用文献、および当該学習用回答のセットを学習データとして、当該学習用質問と当該学習用文献とに対応する回答候補および当該回答候補の尤度を示すスコアを出力するように学習した機械読解モデルを生成するステップと、を実行させる、プログラムである。 (9) One aspect of the present invention is a program that causes a computer of a learning device to execute the steps of: generating a generative model trained to generate feature words corresponding to a learning question, using as training data a set of learning questions and feature words that are words that characterize training literature including training answers to the training questions; and generating a machine reading comprehension model trained by the learning device to use as training data a set of training questions, training literature including training answers to the training questions , and the training answers, and to output answer candidates corresponding to the training questions and the training literature, and scores indicating the likelihood of the answer candidates.

本発明の一態様によれば、高い精度で質問に対する回答を出力することができる。 According to one aspect of the present invention, answers to questions can be output with high accuracy.

実施形態の質問回答予測システムの構成例を示すブロック図である。1 is a block diagram showing a configuration example of a question answer prediction system according to an embodiment. 実施形態の特徴語生成モデルの学習処理を説明するためのブロック図である。FIG. 2 is a block diagram for explaining a learning process of a feature word generation model according to an embodiment. 実施形態の特徴語生成モデルの構成の一例を示すブロック図である。FIG. 2 is a block diagram showing an example of the configuration of a feature word generation model according to the embodiment. 実施形態の機械読解モデルの学習処理を説明するためのブロック図である。FIG. 2 is a block diagram for explaining a learning process of a machine reading comprehension model according to an embodiment. 実施形態の機械読解モデルの構成の一例を示すブロック図である。FIG. 2 is a block diagram showing an example of a configuration of a machine reading comprehension model according to an embodiment. 実施形態の質問回答予測システムにおける処理手順の一例を示すフローチャートである。1 is a flowchart illustrating an example of a processing procedure in a question answer prediction system according to an embodiment. 実施形態の質問回答予測システムにおける処理手順の一例を示すブロック図である。1 is a block diagram showing an example of a processing procedure in a question answer prediction system according to an embodiment;

以下、本発明を適用した検索装置および検索方法、学習装置および学習方法、質問回答予測システムおよび質問回答予測方法、並びにプログラムを、図面を参照して説明する。 Below, a search device and search method, a learning device and learning method, a question answer prediction system and question answer prediction method, and a program to which the present invention is applied will be described with reference to the drawings.

＜質問回答予測システムの構成＞
図１は、実施形態の質問回答予測システムの構成例を示すブロック図である。実施形態の質問回答予測システム１は、未知の質問文を受け付けた場合に、当該質問に含まれる単語を用いて文献を検索し、検索した文献を用いて回答を予測し、予測した回答を提供するサービスを提供するためのＷｅｂシステムやアプリケーションシステムである。 <Configuration of question answer prediction system>
1 is a block diagram showing a configuration example of a question answer prediction system according to an embodiment. The question answer prediction system 1 according to the embodiment is a Web system or an application system for providing a service that, when an unknown question sentence is received, searches documents using words contained in the question, predicts an answer using the searched documents, and provides the predicted answer.

質問回答予測システム１は、例えば、一または複数の端末装置１００と、サービス提供サーバ２００と、ジョブキュー２３０および２４０と、全文検索サーバ３００と、文献データベース３１０と、特徴語生成部４００と、特徴語生成モデル４１０と、機械読解部５００と、機械読解モデル５１０とを備える。質問回答予測システム１における端末装置１００、サービス提供サーバ２００、ジョブキュー２３０および２４０、全文検索サーバ３００、文献データベース３１０、特徴語生成部４００、特徴語生成モデル４１０、機械読解部５００、および機械読解モデル５１０は、例えば、通信ネットワークに接続され、ＮＩＣ（ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣａｒｄ）や無線通信モジュールなどの通信インターフェースを備える（図１では不図示）。通信ネットワークは、例えば、インターネット、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、セルラー網、Ｗｉｆｉ（登録商標）などを含んでよい。 The question answer prediction system 1 includes, for example, one or more terminal devices 100, a service providing server 200, job queues 230 and 240, a full-text search server 300, a literature database 310, a feature word generation unit 400, a feature word generation model 410, a machine reading unit 500, and a machine reading model 510. The terminal device 100, the service providing server 200, the job queues 230 and 240, the full-text search server 300, the literature database 310, the feature word generation unit 400, the feature word generation model 410, the machine reading unit 500, and the machine reading model 510 in the question answer prediction system 1 are connected to, for example, a communication network and include a communication interface such as a NIC (Network Interface Card) or a wireless communication module (not shown in FIG. 1). The communication network may include, for example, the Internet, a Wide Area Network (WAN), a Local Area Network (LAN), a cellular network, Wi-Fi (registered trademark), etc.

端末装置１００は、例えば、表示部、操作部、およびＣＰＵ等を備えたパーソナルコンピュータやスマートフォン等のコンピュータである。端末装置１００は、コンテンツの表示や操作内容の受け付け、および情報の送受信等を行うためにブラウザ１１０を備える。端末装置１００は、ブラウザ１１０に限らず、アプリケーションプログラムなどのＵＡ（ＵｓｅｒＡｇｅｎｔ）を起動してよい。ＵＡは、例えば、サービス提供サーバ２００が提供するサービスを受けるためのアプリケーションである。端末装置１００は、ＵＡとしてのアプリケーションを利用して、サービス提供サーバ２００から受信したコンテンツを用いて表示処理や操作の受け付け処理などを行う。実施形態の端末装置１００は、ユーザの操作に基づく質問文を含む要求をサービス提供サーバ２００に提供し、サービス提供サーバ２００から質問文に対する回答を含む応答を受け付ける。 The terminal device 100 is, for example, a computer such as a personal computer or a smartphone equipped with a display unit, an operation unit, and a CPU. The terminal device 100 is equipped with a browser 110 for displaying content, accepting operation contents, and transmitting and receiving information. The terminal device 100 may launch a UA (User Agent) such as an application program, not limited to the browser 110. The UA is, for example, an application for receiving a service provided by the service providing server 200. The terminal device 100 uses the application as the UA to perform display processing and operation acceptance processing using content received from the service providing server 200. The terminal device 100 of the embodiment provides a request including a question based on a user's operation to the service providing server 200, and accepts a response including an answer to the question from the service providing server 200.

サービス提供サーバ２００は、端末装置１００からの要求に応じ、質問文に対する回答を提供するサービスを実現するサーバ装置である。サービス提供サーバ２００は、例えば、ジョブ管理部２１０と、結果提供部２２０とを備える。ジョブ管理部２１０および結果提供部２２０といった機能部は、例えばＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等のプロセッサがプログラムメモリに格納されたプログラムを実行することにより実現される。また、これらの機能部のうち一部または全部は、ＧＰＵ（ＧｒａｐｈｉｃｓＰｒｏｃｅｓｉｎｇＵｎｉｔ）やＴＰＵ（ＴｅｎｓｏｒＰｒｏｃｅｓｉｎｇ、Ｕｎｉｔ）など高速に機械学習処理（行列計算処理）を計算できるプロセッサや、ＧＰＵやＴＰＵなどのプロセッサの組み合わせ、ＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ）、ＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）、またはＦＰＧＡ（Ｆｉｅｌｄ-ＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）等のハードウェアにより実現されてもよいし、ソフトウェアとハードウェアが協働することで実現されてもよい。 The service providing server 200 is a server device that provides a service of providing answers to questions in response to requests from the terminal device 100. The service providing server 200 includes, for example, a job management unit 210 and a result providing unit 220. Functional units such as the job management unit 210 and the result providing unit 220 are realized by a processor, such as a CPU (Central Processing Unit), executing a program stored in a program memory. In addition, some or all of these functional units may be realized by hardware such as a processor capable of high-speed machine learning processing (matrix calculation processing), such as a GPU (Graphics Processing Unit) or a TPU (Tensor Processing Unit), a combination of processors such as a GPU or a TPU, an LSI (Large Scale Integration), an ASIC (Application Specific Integrated Circuit), or an FPGA (Field-Programmable Gate Array), or may be realized by software and hardware working together.

ジョブ管理部２１０は、端末装置１００からの要求に応じて、特徴語生成処理のジョブおよび機械読解処理のジョブを管理する。ジョブ管理部２１０は、端末装置１００から質問文を受け付けた場合に、特徴語生成処理のジョブをジョブキュー２３０に登録する。ジョブ管理部２１０は、機械読解処理の対象となる文書をジョブキュー２４０に登録する。結果提供部２２０は、処理結果としての回答を含む応答を端末装置１００に提供する。なお、ジョブキュー２３０は、特徴語生成部４００内の記憶部で構成されてよいが、サービス提供サーバ２００内の記憶部で構成されてもよい。ジョブキュー２４０は、機械読解部５００内の記憶部で構成されてよいが、サービス提供サーバ２００内の記憶部で構成されてもよい。 The job management unit 210 manages jobs for the feature word generation process and the machine comprehension process in response to requests from the terminal device 100. When the job management unit 210 receives a question from the terminal device 100, it registers a job for the feature word generation process in the job queue 230. The job management unit 210 registers a document to be subjected to the machine comprehension process in the job queue 240. The result providing unit 220 provides the terminal device 100 with a response including an answer as a processing result. The job queue 230 may be configured in a storage unit in the feature word generation unit 400, or may be configured in a storage unit in the service providing server 200. The job queue 240 may be configured in a storage unit in the machine comprehension unit 500, or may be configured in a storage unit in the service providing server 200.

特徴語生成部４００は、サービス提供サーバ２００から受け付けた質問文を単語に分割することで複数の質問語を生成し、各質問語を用いて特徴語を生成する特徴語生成処理を行う。特徴語生成部４００は、例えば、ＣＰＵ等のプロセッサであり、特徴語生成モデル４１０を実行することにより、特徴語生成処理を実現する。特徴語生成モデル４１０は、特徴語生成処理を実現するための各種のパラメータにより表現される。なお、プロセッサにはＧＰＵまたはＴＰＵなどの高速に行列計算処理を実行できるプロセッサの組み合わせが含まれてもよい。 The feature word generation unit 400 performs a feature word generation process in which it generates multiple question words by dividing the question sentence received from the service providing server 200 into words, and generates feature words using each question word. The feature word generation unit 400 is, for example, a processor such as a CPU, and realizes the feature word generation process by executing the feature word generation model 410. The feature word generation model 410 is expressed by various parameters for realizing the feature word generation process. The processor may include a combination of processors capable of performing high-speed matrix calculation processing, such as a GPU or a TPU.

全文検索サーバ３００は、サービス提供サーバ２００の要求に応じて、全文検索処理を実行するサーバ装置である。全文検索サーバ３００は、サービス提供サーバ２００から受け付けた質問文を単語（以下、質問語とも記載する。）に分割し、分割された単語群用いて全文検索処理を行う。全文検索処理は、文献データベース３１０に登録された文献に質問語と合致する単語が含まれるかを検索し、質問語と合致する単語を含む文献を抽出する処理である。文献データベース３１０は、多数の文献データが格納されたデータベースである。文献データベース３１０は、例えば、データベース管理等を行う処理部、ＨＤＤ（ＨａｒｄＤｉｓｃＤｒｉｖｅ）、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）、フラッシュメモリ、ＥＥＰＲＯＭ（ＥｌｅｃｔｒｉｃａｌｌｙＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲｅａｄＯｎｌｙＭｅｍｏｒy），ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ），またはＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等により実現される。 The full-text search server 300 is a server device that executes full-text search processing in response to a request from the service providing server 200. The full-text search server 300 divides the question received from the service providing server 200 into words (hereinafter also referred to as question terms), and performs full-text search processing using the divided word groups. The full-text search processing is a process of searching whether documents registered in the literature database 310 contain words that match the question terms, and extracting documents that contain words that match the question terms. The literature database 310 is a database in which a large amount of literature data is stored. The literature database 310 is realized, for example, by a processing unit that performs database management, a hard disk drive (HDD), a solid state drive (SSD), a flash memory, an electrically erasable programmable read only memory (EEPROM), a read only memory (ROM), or a random access memory (RAM).

機械読解部５００は、サービス提供サーバ２００から受け付けた質問文を構成する複数の質問語、文献を用いて機械読解処理を行う。機械読解部５００は、例えば、ＣＰＵ等のプロセッサであり、機械読解モデル５１０を実行することにより、機械読解処理を実現する。機械読解モデル５１０は、機械読解処理を実現するための各種のパラメータにより表現される。機械読解部５００は、機械読解処理の処理結果として、質問に対する回答候補およびスコアをサービス提供サーバ２００に供給する。回答候補およびスコアは、結果提供部２２０により所定の処理が施されて端末装置１００に提供される。 The machine reading unit 500 performs machine comprehension processing using multiple question words and documents that make up the question received from the service providing server 200. The machine reading unit 500 is, for example, a processor such as a CPU, and realizes the machine reading comprehension processing by executing the machine reading comprehension model 510. The machine reading comprehension model 510 is expressed by various parameters for realizing the machine reading comprehension processing. The machine reading unit 500 supplies answer candidates and scores for the question to the service providing server 200 as the processing results of the machine reading comprehension processing. The answer candidates and scores are subjected to a predetermined processing by the result providing unit 220 and provided to the terminal device 100.

図２は、実施形態の特徴語生成モデルの学習処理を説明するためのブロック図である。特徴語生成モデル４１０は、機械読解モデル５１０を学習するために用いられる機械読解用学習データのうち、質問文と例えば後述の学習処理部６００により文献から抽出される特徴語を用いて学習される。特徴語生成モデル４１０の学習処理は、まず、特徴語生成モデル４１０に質問文を入力し、特徴語生成モデル４１０から当該質問文に対応する所定の特徴語を出力するように特徴語生成モデル４１０における処理パラメータを学習する。 Figure 2 is a block diagram for explaining the learning process of the feature word generation model of the embodiment. The feature word generation model 410 is trained using a question sentence and feature words extracted from documents by the learning processing unit 600 described below, for example, from machine reading comprehension learning data used to train the machine reading comprehension model 510. The learning process of the feature word generation model 410 first inputs a question sentence into the feature word generation model 410, and learns processing parameters in the feature word generation model 410 so that the feature word generation model 410 outputs a specified feature word corresponding to the question sentence.

学習処理部６００は、例えば以下のステップで処理を行う。（１）学習処理部６００は、例えば質問文と検索対象文献すべてについて単語と出現回数とのインデックスを作成する。（２）学習処理部６００は、作成したインデックスを用いて質問文と、質問文とペアになる回答を含む文献に対し、文献を特徴づける単語を、特徴語として抽出する。文献を特徴づける単語である特徴語は、全文献に対し、対象文献のＢＭ２５スコアが高くなる単語群である。（３）学習処理部６００は、上記ステップにより獲得した質問文と対応する特徴語のペアに対し、質問文を入力し例えばＢＭ２５スコアにおける上位１０件の単語を特徴語として出力するように特徴語生成モデル４１０の処理パラメータを学習する。学習処理部６００は、上記の（２）および（３）を学習用データの全質問文と文献のペアに対して１回もしくは複数回、特徴語生成モデル４１０の出力の誤差が収束するまで繰り返す。 The learning processing unit 600 performs processing in the following steps, for example. (1) The learning processing unit 600 creates an index of words and the number of occurrences for, for example, the question sentence and all documents to be searched. (2) The learning processing unit 600 uses the created index to extract words that characterize documents, as feature words, for documents including the question sentence and the answer that pairs with the question sentence. The feature words that characterize documents are a group of words that increase the BM25 score of the target document compared to all documents. (3) The learning processing unit 600 learns the processing parameters of the feature word generation model 410 so that, for the pair of the question sentence and the feature word corresponding to the question sentence obtained in the above steps, the question sentence is input and, for example, the top 10 words in the BM25 score are output as feature words. The learning processing unit 600 repeats the above (2) and (3) once or multiple times for all pairs of question sentences and documents in the learning data until the error in the output of the feature word generation model 410 converges.

図３は、実施形態の特徴語生成モデルの構成の一例を示すブロック図である。特徴語生成モデル４１０は、例えば、入力層、中間層、および出力層を備え、各層が複数のノードを持つ多層ニューラルネットワークである。学習処理部６００は、多層ニューラルネットワークにおける単語の埋め込み、注意、活性化関数、誤差関数、および勾配降下アルゴリズム、重み係数、および重み行列といった処理パラメータを更新することで、質問文に対応した特徴語を出力するように学習する。なお、実施形態における特徴語生成モデル４１０は多層ニューラルネットワークであるが、これに限定されず、他の機械学習モデルであってよい。さらに、特徴語生成モデル４１０は、質問文から特徴語を生成可能なモデルであれば、機械学習モデルに限定しない。 Figure 3 is a block diagram showing an example of the configuration of a feature word generation model according to an embodiment. The feature word generation model 410 is, for example, a multi-layer neural network including an input layer, an intermediate layer, and an output layer, with each layer having multiple nodes. The learning processing unit 600 learns to output feature words corresponding to a question sentence by updating processing parameters such as word embedding, attention, activation function, error function, gradient descent algorithm, weighting coefficient, and weighting matrix in the multi-layer neural network. Note that, although the feature word generation model 410 in the embodiment is a multi-layer neural network, this is not limited thereto, and other machine learning models may be used. Furthermore, the feature word generation model 410 is not limited to a machine learning model as long as it is a model capable of generating feature words from a question sentence.

図４は、実施形態の機械読解モデルの学習処理を説明するためのブロック図である。機械読解モデル５１０は、質問、当該質問の回答を含む文献、および回答のセットを複数含む機械読解処理用学習データを用いて学習される。回答は、例えば、文献中における区間を示す情報である。機械読解モデル５１０の学習処理は、まず、機械読解モデル５１０に質問および文献を入力し、機械読解モデル５１０から当該質問および文献に対応する回答を出力するように機械読解モデル５１０における処理パラメータを学習する。 Figure 4 is a block diagram for explaining the learning process of the machine reading comprehension model of the embodiment. The machine reading comprehension model 510 is trained using learning data for machine reading comprehension processing that includes a question, a document containing an answer to the question, and multiple sets of answers. The answer is, for example, information indicating a section in a document. The learning process of the machine reading comprehension model 510 first inputs a question and a document into the machine reading comprehension model 510, and learns processing parameters in the machine reading comprehension model 510 so that the machine reading comprehension model 510 outputs an answer corresponding to the question and document.

図５は、実施形態の機械読解モデルの構成の一例を示すブロック図である。機械読解モデル５１０は、例えば入力層、中間層、および出力層を備え、各層が複数のノードを持つ多層ニューラルネットワークである。学習処理部６１０は、多層ニューラルネットワークにおける単語の埋め込み、注意、活性化関数、誤差関数、及び勾配降下アルゴリズム、重み係数、および重み行列といった処理パラメータを更新することで、質問文に対応した特徴語を出力するように学習する。なお、実施形態における機械読解モデル５１０は多層ニューラルネットワークであるが、これに限定されず、他の機械学習モデルであってよい。さらに、機械読解モデル５１０は、質問文と文献から文献中に存在する質問文の回答となる箇所を出力可能なモデルであれば機械学習モデルに限定しない。 FIG. 5 is a block diagram showing an example of the configuration of a machine reading comprehension model according to an embodiment. The machine reading comprehension model 510 is, for example, a multi-layer neural network having an input layer, an intermediate layer, and an output layer, with each layer having multiple nodes. The learning processing unit 610 learns to output feature words corresponding to a question sentence by updating processing parameters such as word embedding, attention, activation function, error function, gradient descent algorithm, weighting coefficient, and weighting matrix in the multi-layer neural network. Note that, although the machine reading comprehension model 510 in the embodiment is a multi-layer neural network, it is not limited to this and may be another machine learning model. Furthermore, the machine reading comprehension model 510 is not limited to a machine learning model as long as it is a model that can output a part that is an answer to the question sentence that exists in the document from the question sentence and the document.

機械読解モデル５１０の出力層は、文献中に含まれる回答を示す区間を評価するためのスコアを出力することが可能なノードであればよい。文献中に含まれる回答を示す区間を評価するためのスコアは、例えば、すべての区間の値が０～１の所定範囲に収まり、他の区間との総和が考慮されない値で表される。このスコアは、例えば、区間同士の絶対的な尤度を示す値と読み替えてよい。この出力層に適用された活性化関数は、例えばＳｉｇｍｏｉｄ（シグモイド）関数と呼ばれる。機械読解モデル５１０は、例えば、複数の文献を用いて複数の区間が抽出された場合に、Ｓｉｇｍｏｉｄ関数を用いて区間およびスコアを出力する。複数の文献を用いて複数の区間が抽出された場合とは、例えば、質問語から全文検索処理により取得された文献Ａから機械読解処理により区間Ａが抽出され、特徴語から全文検索処理により取得された文献Ｂから機械読解処理により区間Ｂが抽出されることである。これにより、サービス提供サーバ２００は、異なる文献Ａ，Ｂであっても、区間Ａのスコアと区間Ｂのスコアとを比較して順位付けをすることができる。 The output layer of the machine reading comprehension model 510 may be any node capable of outputting a score for evaluating an interval indicating an answer contained in a document. The score for evaluating an interval indicating an answer contained in a document is expressed, for example, as a value in which the values of all intervals fall within a predetermined range of 0 to 1 and the sum with other intervals is not taken into consideration. This score may be interpreted, for example, as a value indicating the absolute likelihood between intervals. The activation function applied to this output layer is called, for example, a Sigmoid function. For example, when multiple intervals are extracted using multiple documents, the machine reading comprehension model 510 outputs the interval and score using the Sigmoid function. When multiple intervals are extracted using multiple documents, for example, interval A is extracted by machine reading comprehension processing from document A obtained by full-text search processing from a question word, and interval B is extracted by machine reading comprehension processing from document B obtained by full-text search processing from a feature word. This allows the service providing server 200 to compare the scores of section A and section B and rank them, even if documents A and B are different.

図６は、実施形態の質問回答予測システムにおける処理手順の一例を示すフローチャートである。図７は、実施形態の質問回答予測システムにおける処理手順の一例を示すブロック図である。
上述した質問回答予測システムは、まず、サービス提供サーバ２００が、端末装置１００から質問文を受け付ける（ステップＳ１００）。次に、サービス提供サーバ２００は、特徴語生成部４００に質問文を提供する。特徴語生成部４００は、質問文を単語に分割することで質問語を生成し、質問語に基づく特徴語を生成する（ステップＳ１０２）。サービス提供サーバ２００は、生成された特徴語を取得する。 6 is a flowchart illustrating an example of a processing procedure in the question answer prediction system according to the embodiment. FIG 7 is a block diagram illustrating an example of a processing procedure in the question answer prediction system according to the embodiment.
In the above-described question answer prediction system, first, the service providing server 200 accepts a question sentence from the terminal device 100 (step S100). Next, the service providing server 200 provides the question sentence to the feature word generating unit 400. The feature word generating unit 400 generates question words by dividing the question sentence into words, and generates feature words based on the question words (step S102). The service providing server 200 acquires the generated feature words.

次にサービス提供サーバ２００は、質問語および特徴語を全文検索サーバ３００に提供する。全文検索サーバ３００は、質問語を用いた全文検索処理、および特徴語を用いた全文検索処理を実行する（ステップＳ１０４）。サービス提供サーバ２００は、質問語を用いた全文検索処理の結果としての文献、および特徴語を用いた全文検索処理の結果としての文献を取得する。 Next, the service providing server 200 provides the question term and the characteristic words to the full-text search server 300. The full-text search server 300 executes a full-text search process using the question term and a full-text search process using the characteristic words (step S104). The service providing server 200 acquires documents as a result of the full-text search process using the question term and documents as a result of the full-text search process using the characteristic words.

次にサービス提供サーバ２００は、取得した文献を機械読解処理の処理単位の長さを持つ文書に分割し（ステップＳ１０６）、機械読解部５００は、各文書について機械読解処理を実行する（ステップＳ１０８）。サービス提供サーバ２００は、各質問に対応する文書から取得した回答候補および当該回答候補のスコアを取得する。 Next, the service providing server 200 divides the acquired literature into documents having the length of the processing unit of the machine reading comprehension process (step S106), and the machine reading unit 500 executes the machine reading comprehension process for each document (step S108). The service providing server 200 acquires the answer candidates acquired from the documents corresponding to each question and the scores of the answer candidates.

次にサービス提供サーバ２００は、機械読解部５００から取得した回答候補およびスコアに基づいて、処理結果としての回答を生成する（ステップＳ１１０）。このとき、サービス提供サーバ２００は、回答候補間におけるスコアの比較に基づいて回答を出力する。具体的に、サービス提供サーバ２００は、回答候補が同じである場合において、当該同じ回答候補のそれぞれのスコアが閾値を超える場合、当該同じ回答候補同士のスコアを合計する。 Next, the service providing server 200 generates an answer as a processing result based on the answer candidates and scores acquired from the machine reading unit 500 (step S110). At this time, the service providing server 200 outputs the answer based on a comparison of the scores between the answer candidates. Specifically, when the answer candidates are the same and the scores of the same answer candidates exceed a threshold, the service providing server 200 adds up the scores of the same answer candidates.

ここで、別々の文献から同一の回答が出力された場合に、正解である可能性を高くすることができる。しかし、機械読解モデル５１０が出力したスコアの値を考慮せずにスコアを加算すると頻出の単語等でスコアが低い多数の回答が出力された場合に、回答が正解である精度が低下してしまう。また、スコアの範囲に負値を含めた場合に、加算することでスコアが減少する可能性がある。そこで、サービス提供サーバ２００は、スコアの加算対象を限定するために閾値を設定し、閾値を超えるスコアを持つ回答候補同士のスコアを加算する。サービス提供サーバ２００は、加算されないスコアを持つ回答候補が複数存在する場合、複数の回答候補のスコアのうちで最大値のスコアを、当該回答候補のスコアとして決定する。 Here, when the same answer is output from different documents, the possibility that it is correct can be increased. However, if the scores are added without taking into account the score values output by the machine reading comprehension model 510, the accuracy of the answer being correct will decrease when a large number of answers with low scores are output for frequently occurring words, etc. Also, if negative values are included in the score range, there is a possibility that the score will decrease due to addition. Therefore, the service providing server 200 sets a threshold value to limit the score addition targets, and adds the scores of answer candidates with scores exceeding the threshold value. When there are multiple answer candidates with scores that are not added, the service providing server 200 determines the maximum score among the scores of the multiple answer candidates as the score of that answer candidate.

サービス提供サーバ２００は、複数の回答候補をスコアでランキングする。サービス提供サーバ２００は、スコアが大きい方の回答候補から降順に回答候補を並べた情報を結果として生成し（ステップＳ１１０）、生成した情報を端末装置１００に提供する（ステップＳ１１２）。 The service providing server 200 ranks the multiple answer candidates by score. The service providing server 200 generates result information in which the answer candidates are arranged in descending order from the answer candidate with the highest score (step S110), and provides the generated information to the terminal device 100 (step S112).

なお、実施形態の質問回答予測システム１は、回答を含む文献の文字情報を端末装置１００により表示させ、ユーザの操作に基づく回答を用いて機械読解モデル５１０のための学習処理を行ってよい。 In addition, the question answer prediction system 1 of the embodiment may display text information of a document including the answer on the terminal device 100, and perform a learning process for the machine reading comprehension model 510 using the answer based on the user's operation.

＜実施形態の効果＞
以上説明したように、実施形態の質問回答予測システム１によれば、質問を入力し、質問に含まれる単語に基づいて文献を検索する検索部（例えば全文検索サーバ３００）と、検索部により検索された文献に基づく情報を入力し、質問に対する回答を予測した結果を出力する予測モデル（例えば機械読解モデル５１０）を有する予測部（例えば機械読解部５００）と、備える、検索装置（例えばサービス提供サーバ２００、全文検索サーバ３００および機械読解部５００）を実現することができる。この検索装置において、予測モデルは、例えば、学習用質問、学習用回答、および前記学習用回答を含む学習用文献を含むセットを学習データとして学習したモデルである。
実施形態の質問回答予測システム１によれば、質問、質問の回答、および回答を含む文献を第１学習データとして学習し、未知の質問および文献を入力した場合に回答を出力する予測モデル（例えば機械読解モデル５１０）を生成する第１の学習部（例えば学習処理部６１０）と、第１学習データのうち質問および質問に関連する特徴語を学習データとして学習し、未知の質問を入力した場合に、当該未知の質問に関連する特徴語を出力する特徴語生成モデル（例えば特徴語生成モデル４１０）を生成する第２の学習部（例えば学習処理部６００）と、を備える学習装置（例えば学習処理部６００と学習処理部６１０との組み合わせ）と、特徴語生成モデルを有し、未知の質問を入力した場合に、当該未知の質問に関連する特徴語を生成する特徴語生成部（例えば特徴語生成部４００）と、未知の質問に含まれる単語に基づいて文献を検索する処理と、および特徴語生成部により生成された特徴語に基づいて文献を検索する処理とを実行する検索部（例えば全文検索サーバ３００）と、予測モデルを有し、検索部により検索された文献に基づく情報を入力し、未知の質問に対する回答を予測した結果を出力する予測部（例えば機械読解部５００）と、を備える、検索装置（例えば全文検索サーバ３００と特徴語生成部４００と機械読解部５００との組み合わせ）と、を備える、質問回答予測システムを実現することができる。
実施形態の質問回答予測システム１によれば、質問、質問の回答、および回答を含む文献を第１学習データとして学習し、未知の質問および文献を入力した場合に回答を出力する予測モデルを生成するステップと、第１学習データのうち質問および質問に関連する特徴語を学習データとして学習し、未知の質問を入力した場合に、当該未知の質問に関連する特徴語を出力する特徴語生成モデルを生成するステップと、特徴語生成モデルを用いて、未知の質問を入力した場合に、当該未知の質問に関連する特徴語を生成するステップと、未知の質問に含まれる単語に基づいて文献を検索する処理と、および特徴語生成部により生成された特徴語に基づいて文献を検索する処理とを実行するステップと、予測モデルを用いて、検索された文献に基づく情報を入力し、未知の質問に対する回答を予測した結果を出力するステップと、を有する、質問回答予測方法を実現することができる。
質問回答予測システム１によれば、高い精度で質問に対する回答を出力することができることができる。すなわち、質問回答予測システム１によれば、ユーザの質問に対して単純な文字列等の一致に基づく文献の検索結果ではなく、質問から検索した文献に含まれる文字列のうち回答の確率の高い情報を、回答の結果として提供することができる。この結果、質問回答予測システム１によれば、質問の文脈を考慮した回答を提供することができる。 Effects of the embodiment
As described above, according to the embodiment of the question answer prediction system 1, a search device (e.g., the service providing server 200, the full-text search server 300, and the machine reading unit 500) can be realized that includes a search unit (e.g., the full-text search server 300) that inputs a question and searches for literature based on words included in the question, and a prediction unit (e.g., the machine reading unit 500) that inputs information based on the literature searched by the search unit and has a prediction model (e.g., the machine reading comprehension model 510) that outputs a result of predicting an answer to the question. In this search device, the prediction model is, for example, a model trained using a set including learning questions, learning answers, and learning literature including the learning answers as learning data.
According to the question answer prediction system 1 of the embodiment, a learning device (for example, a combination of the learning processing unit 600 and the learning processing unit 610) including a first learning unit (for example, the learning processing unit 610) that learns questions, answers to the questions, and documents containing the answers as first learning data, and generates a prediction model (for example, the machine reading comprehension model 510) that outputs an answer when an unknown question and document are input, and a second learning unit (for example, the learning processing unit 600) that learns questions and feature words related to the questions from the first learning data as learning data, and generates a feature word generation model (for example, the feature word generation model 410) that outputs feature words related to the unknown question when an unknown question is input, and It is possible to realize a question answer prediction system comprising a search device (e.g., a combination of the full-text search server 300, the feature word generation unit 400, and the machine reading comprehension unit 500) that includes a feature word generation unit (e.g., the feature word generation unit 400) that generates feature words related to an unknown question when an unknown question is input, a search unit (e.g., the full-text search server 300) that performs a process of searching for literature based on words contained in the unknown question and a process of searching for literature based on the feature words generated by the feature word generation unit, and a prediction unit (e.g., the machine reading comprehension unit 500) that has a prediction model, inputs information based on the literature searched by the search unit, and outputs a result of predicting an answer to the unknown question.
According to the embodiment, the question answer prediction system 1 can realize a question answer prediction method including the steps of: learning questions, answers to the questions, and literature containing the answers as first learning data, and generating a prediction model that outputs an answer when an unknown question and literature are input; learning questions and feature words related to the questions from the first learning data as learning data, and generating a feature word generation model that outputs feature words related to the unknown question when an unknown question is input; generating feature words related to the unknown question using the feature word generation model when an unknown question is input; performing a process of searching for literature based on words included in the unknown question and a process of searching for literature based on the feature words generated by the feature word generation unit; and using the prediction model to input information based on the searched literature, and outputting a result of predicting the answer to the unknown question.
According to the question answer prediction system 1, it is possible to output an answer to a question with high accuracy. That is, according to the question answer prediction system 1, it is possible to provide information that has a high probability of being an answer among character strings contained in documents searched from a question as an answer result, rather than a document search result based on a simple match of character strings, etc., to a user's question. As a result, according to the question answer prediction system 1, it is possible to provide an answer that takes into account the context of the question.

実施形態の質問回答予測システム１によれば、質問に含まれる単語を入力し、当該質問に関連する特徴語を生成する生成モデルを有する特徴語生成部（例えば特徴語生成部４００）を備え、検索部により、質問に含まれる単語に基づいて文献を検索する処理と、および特徴語生成部により生成された特徴語に基づいて文献を検索する処理とを実行することができる。生成モデルは、例えば、学習データに含まれる学習用質問を入力し、入力した学習用質問に関連する特徴語を出力するように学習されたモデルである。
この質問回答予測システム１によれば、更に高い精度で質問に対する回答を出力することができることができる。すなわち、質問回答予測システム１によれば、質問に含まれる単語から検索した文献が存在しても当該文献に回答が含まれなくても、特徴語から検索した文献から回答を取得することができる。また、質問回答予測システム１によれば、ユーザの質問から検索した文献に正解となる回答を含んでいない場合でも、特徴語から検索した文献に正解となる回答を含んでいる場合に、質問に対して正解となる回答を提供することができる。 According to the embodiment of the question answer prediction system 1, a feature word generation unit (e.g., the feature word generation unit 400) having a generative model that receives input of words contained in a question and generates feature words related to the question is provided, and a search unit can execute a process of searching for documents based on the words contained in the question and a process of searching for documents based on the feature words generated by the feature word generation unit. The generative model is, for example, a model trained to receive input of a learning question contained in the learning data and output feature words related to the input learning question.
According to this question answer prediction system 1, it is possible to output an answer to a question with even higher accuracy. That is, according to the question answer prediction system 1, even if there is a document searched from a word included in a question but the document does not contain the answer, the answer can be obtained from the document searched from the characteristic word. Furthermore, according to the question answer prediction system 1, even if the document searched from the user's question does not contain the correct answer, if the document searched from the characteristic word contains the correct answer, the correct answer to the question can be provided.

実施形態の質問回答予測システム１によれば、予測モデルにより、結果の元となる情報である一または複数の回答候補、および当該一または複数の回答候補のスコアを出力する出力層を備え、予測部により、質問に含まれる単語に基づく文献中の回答候補およびスコア、特徴語に基づく文献中の回答候補およびスコアを出力し、回答候補間における前記スコアの比較に基づいて結果を出力することができる。また、実施形態の質問回答予測システム１によれば、予測モデルにより、結果の元となる情報である一または複数の回答候補、および当該一または複数の回答候補のスコアであって一または複数のスコアの上限および下限が設定されかつ総和が所定値に調整されないスコアを出力する出力層を備え、予測部により、質問に含まれる単語に基づく文献中の回答候補およびスコア、特徴語に基づく文献中の回答候補およびスコアを出力し、回答候補間における前記スコアの比較に基づいて結果を出力することができる。
この質問回答予測システム１によれば、更に高い精度で質問に対する回答を出力することができる。すなわち、質問回答予測システム１によれば、質問ならびに特徴語に基づく文献から抽出した複数の回答候補のスコアを比較することで、多数の回答候補から回答の確率の高い情報を提供することができる。 According to the embodiment of the question answer prediction system 1, the prediction model includes an output layer that outputs one or more answer candidates, which are information on which the results are based, and the scores of the one or more answer candidates, and the prediction unit outputs answer candidates and scores in documents based on words included in the question, and answer candidates and scores in documents based on characteristic words, and the result can be output based on a comparison of the scores between the answer candidates. Also, according to the embodiment of the question answer prediction system 1, the prediction model includes an output layer that outputs one or more answer candidates, which are information on which the results are based, and scores of the one or more answer candidates, in which upper and lower limits for one or more scores are set and the sum is not adjusted to a predetermined value, and the prediction unit outputs answer candidates and scores in documents based on words included in the question, and answer candidates and scores in documents based on characteristic words, and the result can be output based on a comparison of the scores between the answer candidates.
According to this question answer prediction system 1, it is possible to output answers to questions with even higher accuracy. That is, according to the question answer prediction system 1, by comparing the scores of multiple answer candidates extracted from documents based on the question and feature words, it is possible to provide information with a high probability of being the answer from a large number of answer candidates.

質問回答予測システム１によれば、予測部により、予測モデルにより出力された回答候補が同じである場合において、当該同じ回答候補のそれぞれのスコアが閾値を超える場合、当該同じ回答候補同士のスコアを合計することができる。これにより、質問回答予測システム１によれば、複数の文献で、回答の確率が低い多数の回答候補が出力された場合でも、ユーザに提供する回答の精度が低くなることを抑制することができる。具体的には、多くの文献に出現する情報であるが、質問の文脈とはあまり合致しない情報等を回答として提供することを抑制することができる。 According to the question answer prediction system 1, when the answer candidates output by the prediction model are the same, the prediction unit can add up the scores of the same answer candidates if the scores of the same answer candidates exceed a threshold. As a result, according to the question answer prediction system 1, even if a large number of answer candidates with low answer probability are output from multiple documents, it is possible to prevent the accuracy of the answer provided to the user from decreasing. Specifically, it is possible to prevent information that appears in many documents but does not match the context of the question from being provided as an answer.

なお、各実施形態および変形例について説明したが、一例であってこれらに限られず、例えば、各実施形態や各変形例のうちのいずれかや、各実施形態の一部や各変形例の一部を、他の１または複数の実施形態や他の１または複数の変形例と組み合わせて本発明の一態様を実現させてもよい。 Note that although each embodiment and each modified example have been described, these are merely examples and are not intended to be limiting. For example, any of the embodiments and modified examples, or a part of each embodiment or a part of each modified example, may be combined with one or more other embodiments or one or more other modified examples to realize one aspect of the present invention.

なお、本実施形態における個体推定装置１００の各処理を実行するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、当該記録媒体に記録されたプログラムを、コンピュータシステムに読み込ませ、実行することにより、端末装置１００、サービス提供サーバ２００、全文検索サーバ３００、特徴語生成部４００、機械読解部５００に係る上述した種々の処理を行ってもよい。 In addition, the programs for executing the various processes of the individual estimation device 100 in this embodiment may be recorded on a computer-readable recording medium, and the programs recorded on the recording medium may be read into and executed by a computer system to perform the various processes described above relating to the terminal device 100, the service providing server 200, the full-text search server 300, the feature word generation unit 400, and the machine reading unit 500.

なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器などのハードウェアを含むものであってもよい。また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、フラッシュメモリなどの書き込み可能な不揮発性メモリ、ＣＤ－ＲＯＭなどの可搬媒体、コンピュータシステムに内蔵されるハードディスクなどの記憶装置のことをいう。 Note that the term "computer system" here may include hardware such as the OS and peripheral devices. In addition, if a WWW system is used, the term "computer system" also includes the homepage provision environment (or display environment). Furthermore, "computer-readable recording media" refers to storage devices such as flexible disks, magneto-optical disks, ROMs, writable non-volatile memories such as flash memory, portable media such as CD-ROMs, and hard disks built into computer systems.

さらに「コンピュータ読み取り可能な記録媒体」とは、インターネットなどのネットワークや電話回線などの通信回線を介してプログラムが送信された場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリ（例えばＤＲＡＭ（Ｄｙｎａｍｉｃ
ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ））のように、一定時間プログラムを保持しているものも含むものとする。また、上記プログラムは、このプログラムを記憶装置などに格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。 Furthermore, the term "computer-readable recording medium" refers to a volatile memory (e.g., DRAM (Dynamic Random Access Memory)) within a computer system that is a server or a client when a program is transmitted via a network such as the Internet or a communication line such as a telephone line.
This also includes those that hold a program for a certain period of time, such as a random access memory (RDMA). The program may be transmitted from a computer system in which the program is stored in a storage device to another computer system via a transmission medium, or by a transmission wave in the transmission medium.

ここで、プログラムを伝送する「伝送媒体」は、インターネットなどのネットワーク（通信網）や電話回線などの通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。また、上記プログラムは、前述した機能の一部を実現するためのものであってもよい。さらに、前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル（差分プログラム）であってもよい。 Here, the "transmission medium" that transmits the program refers to a medium that has the function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line. The above program may also be one that realizes part of the above-mentioned functions. Furthermore, it may be one that can realize the above-mentioned functions in combination with a program already recorded in the computer system, a so-called differential file (differential program).

以上、本発明の実施形態について図面を参照して詳述したが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計なども含まれる。 The above describes an embodiment of the present invention in detail with reference to the drawings, but the specific configuration is not limited to this embodiment, and includes designs that do not deviate from the gist of the invention.

１質問回答予測システム
１００端末装置
１１０ブラウザ
２００サービス提供サーバ
２１０ジョブ管理部
２２０結果提供部
２３０、２４０ジョブキュー
３００全文検索サーバ
３１０文献データベース
４００特徴語生成部
４１０特徴語生成モデル
５００機械読解部
５１０機械読解モデル
６００、６１０学習処理部 1 Question answer prediction system 100 Terminal device 110 Browser 200 Service providing server 210 Job management unit 220 Result providing unit 230, 240 Job queue 300 Full-text search server 310 Literature database 400 Feature word generation unit 410 Feature word generation model 500 Machine reading comprehension unit 510 Machine reading comprehension model 600, 610 Learning processing unit

Claims

a feature word generation unit that has a generation model trained to generate feature words corresponding to the training questions using a set of training questions and feature words that characterize training documents including training answers to the training questions as training data, and that uses the generation model to generate feature words corresponding to the input questions;
a search unit that executes a process of searching a literature database for documents containing words that match a word included in the input question, based on the word included in the question, and a process of searching the literature database for documents containing words that match the feature word, based on the feature word generated by the feature word generation unit;
a machine reading comprehension unit that has a machine reading comprehension model trained to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihood of the answer candidates, using a set of learning questions, learning documents including learning answers to the learning questions, and the learning answers as learning data, and that uses the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words included in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and scores indicating the likelihood of the answer candidates;
a prediction unit that obtains from the machine comprehension unit answer candidates corresponding to the input question and documents searched based on words included in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and scores indicating the likelihood of the answer candidates, and outputs a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate;
A search device comprising:

the prediction unit, when an answer candidate output by the machine reading comprehension model, which corresponds to the input question and a document searched based on a word included in the question, is the same as an answer candidate corresponding to the question and a document searched based on a feature word corresponding to the question, and when a score indicating the likelihood of each of the same answer candidates exceeds a threshold, adds up the scores of the same answer candidates;
The search device according to claim 1 .

a first learning unit that uses a set of training questions and feature words that characterize training documents including training answers to the training questions as training data to generate a generative model trained to generate feature words corresponding to the training questions;
a second learning unit that uses a set of learning questions, learning documents including learning answers to the learning questions, and the learning answers as learning data to generate a machine reading comprehension model that has been trained to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihoods of the answer candidates;
Learning device.

a learning device including: a first learning unit that uses a set of learning questions and feature words that characterize learning documents including learning answers to the learning questions as learning data to generate a generation model trained to generate feature words corresponding to the learning questions; and a second learning unit that uses a set of learning questions, learning documents including learning answers to the learning questions, and the learning answers as learning data to generate a machine reading comprehension model trained to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihood of the answer candidates;
a feature word generation unit having the generative model and using the generative model to generate feature words corresponding to an input question; a search unit executing a process of searching a literature database for documents containing words matching a word included in the input question based on the word included in the question, and a process of searching the literature database for documents containing words matching the feature words based on the feature words generated by the feature word generation unit; a machine reading comprehension unit having the machine reading comprehension model and using the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on the words included in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on the feature words corresponding to the question; and a prediction unit obtaining from the machine reading comprehension unit answer candidates corresponding to the input question and documents searched based on the words included in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on the feature words corresponding to the question, and scores indicating the likelihood of the answer candidates from the machine reading comprehension unit, and outputting a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate.
A question answer prediction system comprising:

a step in which a learning device uses a set of training questions and feature words that characterize training documents including training answers to the training questions as training data to generate a generative model trained to generate feature words corresponding to the training questions;
a step of generating a machine reading comprehension model trained by the learning device to use a set of learning questions, learning documents including learning answers to the learning questions, and the learning answers as learning data, and to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihoods of the answer candidates;
A step in which the search device generates feature words corresponding to an input question using the generation model;
the search device executes a process of searching a literature database for documents including words that match a word included in the input question based on the word, and a process of searching the literature database for documents including words that match the characteristic word based on the characteristic word;
a step of the search device using the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words included in the question and scores indicating the likelihood of the answer candidates, and outputting answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and scores indicating the likelihood of the answer candidates;
the search device acquires answer candidates corresponding to the input question and documents searched based on words included in the question, and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and scores indicating the likelihood of the answer candidates, and outputs a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate;
The method for predicting an answer to a question includes the steps of:

a step in which the search device has a generation model trained to generate feature words corresponding to the training questions using a set of training questions and feature words that characterize training documents including training answers to the training questions as training data, and generates feature words corresponding to the input question using the generation model;
The search device executes a process of searching a literature database for documents containing words that match a word included in the input question based on the word, and a process of searching the literature database for documents containing words that match the generated characteristic word based on the generated characteristic word;
the search device has a machine reading comprehension model trained to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihood of the answer candidates, using a set of learning questions , learning documents including learning answers to the learning questions, and the learning answers as learning data, and uses the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and scores indicating the likelihood of the answer candidates;
the search device acquires answer candidates corresponding to the input question and documents searched based on words included in the question, and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and scores indicating the likelihood of the answer candidates, and outputs a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate;
A search method comprising:

The computer of the search device
a step of having a generative model trained to generate feature words corresponding to training questions using a set of training questions and feature words that characterize training documents including training answers to the training questions as training data, and generating feature words corresponding to an input question using the generative model;
A step of executing a process of searching a literature database for documents containing words that match a word included in the input question based on the word, and a process of searching the literature database for documents containing words that match the generated characteristic word based on the generated characteristic word;
a step of having a machine reading comprehension model trained to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihood of the answer candidates, using a set of learning questions, learning documents including learning answers to the learning questions, and the learning answers as learning data, and using the machine reading comprehension model to output answer candidates corresponding to the input question and documents searched based on words contained in the question and scores indicating the likelihood of the answer candidates, and answer candidates corresponding to the question and documents searched based on feature words corresponding to the question and scores indicating the likelihood of the answer candidates;
obtaining answer candidates corresponding to the input question and documents searched based on words included in the question, and scores indicating the likelihood of the answer candidates, and obtaining answer candidates corresponding to the question and documents searched based on feature words corresponding to the question, and scores indicating the likelihood of the answer candidates, and outputting a result of predicting an answer to the question based on a comparison of the scores indicating the likelihood of each answer candidate;
A program to execute.

a step in which a learning device uses a set of training questions and feature words that characterize training documents including training answers to the training questions as training data to generate a generative model trained to generate feature words corresponding to the training questions;
a step of generating a machine reading comprehension model trained by the learning device to use a set of learning questions, learning documents including learning answers to the learning questions, and the learning answers as learning data, and to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihoods of the answer candidates;
A learning method having:

The learning device computer
generating a generative model trained to generate feature words corresponding to the training questions, using a set of training questions and feature words that characterize training documents including training answers to the training questions as training data;
a step of generating a machine reading comprehension model trained by the learning device to use a set of learning questions, learning documents including learning answers to the learning questions, and the learning answers as learning data, and to output answer candidates corresponding to the learning questions and the learning documents and scores indicating the likelihoods of the answer candidates;
A program to execute.