JP7629844B2

JP7629844B2 - ANSWER CANDIDATE PROPOSAL SYSTEM AND ANSWER CANDIDATE PROPOSAL METHOD

Info

Publication number: JP7629844B2
Application number: JP2021206569A
Authority: JP
Inventors: 剛齊藤; 敦荻野
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2021-12-21
Filing date: 2021-12-21
Publication date: 2025-02-14
Anticipated expiration: 2041-12-21
Also published as: JP2023091791A

Description

本発明は、新規質問文章に対する回答文章の候補となる回答候補文章を生成する回答候補提案システムおよび回答候補提案方法に関する。 The present invention relates to an answer candidate suggestion system and an answer candidate suggestion method that generate answer candidate sentences that are candidates for answers to new question sentences.

インターネットや公衆通信網を介して得られたユーザの新規質問文章に対する回答文章の候補となる回答候補文章を生成する技術がある。例えば、特許文献１には、質問文字列を複数の形態素（単語）に分解し、得られた複数の形態素に基づいて、蓄積された過去の回答文字列群から回答文字列を選択して出力する技術が開示されている。 There is a technology that generates candidate answer sentences that serve as answers to new questions from users obtained via the Internet or public communication networks. For example, Patent Document 1 discloses a technology that breaks down a question string into multiple morphemes (words) and selects and outputs an answer string from a group of accumulated past answer strings based on the obtained multiple morphemes.

また、特許文献２には、質問文を形態素解析して複数の形態素に分解し、得られた複数の形態素から生成した検索クエリを用いて検索処理を行い、検索結果から回答候補の文の集合を抽出し、抽出した回答候補の文の集合に含まれる回答候補の文をランキングする技術が開示されている。 Patent Document 2 also discloses a technology that performs morphological analysis on a question sentence, breaks it down into multiple morphemes, performs a search process using a search query generated from the multiple morphemes obtained, extracts a set of candidate answer sentences from the search results, and ranks the candidate answer sentences included in the extracted set of candidate answer sentences.

特開２０１８－１８１０３３号公報JP 2018-181033 A 特開２０１３－２５４４２０号公報JP 2013-254420 A

ところで、新規質問文章が、質問の意図と関係のない単語を多く含む場合には、質問文書を形態素解析して得られる複数の形態素は、質問の意図と関係のない単語を多く含む。この場合に、特許文献１に記載の技術では、質問文字列を形態素解析して得られる、質問の意図と関係のない単語を多く含む複数の形態素に基づいて、過去の回答文字列群から回答文字列を選択して出力する。このため、出力する回答文字列は、質問の意図と関係のない多くの単語に関連する回答文字列となる。従って、特許文献１に記載の技術では、出力する回答文字列は、ユーザの質問の意図に沿わない、不適切な回答文字列となるおそれがある。 However, when a new question text contains many words unrelated to the intent of the question, the multiple morphemes obtained by morphological analysis of the question text contain many words unrelated to the intent of the question. In this case, the technology described in Patent Document 1 selects and outputs an answer string from a group of past answer strings based on the multiple morphemes containing many words unrelated to the intent of the question, which are obtained by morphological analysis of the question string. Therefore, the answer string to be output is an answer string related to many words unrelated to the intent of the question. Therefore, with the technology described in Patent Document 1, there is a risk that the answer string to be output will be an inappropriate answer string that does not match the intent of the user's question.

また、上記の場合に、特許文献２に記載の技術では、質問文を形態素解析して得られる、質問の意図と関係のない単語を多く含む複数の形態素から検索クエリ生成し、検索クエリを用いて検索処理を行う。検索結果は、検索クエリに含まれる、複数の形態素に多く含まれる質問の意図と関係のない単語の影響を受ける。このため、検索結果から抽出される回答候補の文は、内容が質問の意図と関係が弱いおそれがある。従って、特許文献２に記載の技術では、回答候補の文は不適切な回答候補の文となるおそれがある。 In the above case, the technology described in Patent Document 2 generates a search query from multiple morphemes containing many words unrelated to the intent of the question, which are obtained by morphological analysis of the question sentence, and performs a search process using the search query. The search results are influenced by words unrelated to the intent of the question that are contained in many morphemes included in the search query. For this reason, the contents of the answer candidate sentences extracted from the search results may have a low relationship to the intent of the question. Therefore, with the technology described in Patent Document 2, the answer candidate sentences may be inappropriate.

そこで、本発明の目的は、新規質問文章に対して好適な回答候補文章を出力する回答候補提案システムおよび回答候補提案方法を提供することを目的とする。 The object of the present invention is to provide an answer candidate suggestion system and an answer candidate suggestion method that output suitable answer candidate sentences for new question sentences.

上記目的を達成するため、本発明の回答候補提案システムの一態様は、新規質問文章に対する回答文章の候補となる回答候補文章を生成する回答候補提案システムであって、プロセッサと、記憶装置とを備え、前記記憶装置は、過去の質問文章と、当該過去の質問文章に対する過去の回答文章を対応付けて保存する質問回答データベースを格納し、前記プロセッサは、前記新規質問文章が入力されると、前記質問回答データベースに保存された前記過去の質問文章および前記新規質問文章に基づいて項目候補単語群を生成し、さらに、生成された項目候補単語群からユーザが選択した項目単語群が入力されると、前記項目単語群と前記新規質問文章に基づいて、前記項目単語群を含む質問情報を生成し、前記質問回答データベースに保存された前記過去の質問文章それぞれに対して、前記質問情報との類似度を算出し、前記質問情報との類似度に基づいて前記質問回答データベースから前記質問情報に類似する過去の質問文章を抽出し、抽出した前記質問情報に類似する過去の質問文章に対応付けられた過去の回答文章を前記質問回答データベースから抽出して、第１の回答候補文章とする。 In order to achieve the above object, one aspect of the answer candidate suggestion system of the present invention is an answer candidate suggestion system that generates answer candidate sentences that are candidates for answer sentences to a new question sentence, the system comprising a processor and a storage device, the storage device stores a question and answer database that associates and stores past question sentences with past answer sentences to the past question sentences, the processor generates a group of item candidate words based on the past question sentences and the new question sentence stored in the question and answer database when the new question sentence is input, and when an item word group selected by the user from the generated item candidate word group is input, the processor generates question information including the item word group based on the item word group and the new question sentence, calculates a similarity between each of the past question sentences stored in the question and answer database and the question information, extracts past question sentences similar to the question information from the question and answer database based on the similarity with the question information, and extracts from the question and answer database past answer sentences associated with the extracted past question sentences similar to the question information to set them as first answer candidate sentences.

また、本発明の回答候補提案システムの回答候補提案方法の一態様は、新規質問文章に対する回答文章の候補となる回答候補文章を生成する回答候補提案システムにおける回答候補提案方法であって、回答候補提案システムの記憶装置は、過去の質問文章と、当該過去の質問文章に対する過去の回答文章を対応付けて保存する質問回答データベースを格納し、前記新規質問文章が入力されると、前記質問回答データベースに保存された前記過去の質問文章および前記新規質問文章に基づいて項目候補単語群を生成し、さらに、生成された項目候補単語群からユーザが選択した項目単語群が入力されると、前記項目単語群と前記新規質問文章に基づいて、前記項目単語群を含む質問情報を生成し、前記質問回答データベースに保存された前記過去の質問文章それぞれに対して、前記質問情報との類似度を算出し、前記質問情報との類似度に基づいて前記質問回答データベースから前記質問情報に類似する過去の質問文章を抽出し、抽出した前記質問情報に類似する過去の質問文章に対応付けられた過去の回答文章を前記質問回答データベースから抽出して、第１の回答候補文章とする。 An aspect of the answer candidate suggestion method of the answer candidate suggestion system of the present invention is an answer candidate suggestion method in an answer candidate suggestion system that generates answer candidate sentences that are candidates for answer sentences to a new question sentence, in which a storage device of the answer candidate suggestion system stores a question and answer database that associates and stores past question sentences with past answer sentences to the past question sentences, and when the new question sentence is input, a group of item candidate words is generated based on the past question sentence and the new question sentence stored in the question and answer database, and when an item word group selected by a user from the generated item candidate word group is input, question information including the item word group is generated based on the item word group and the new question sentence, and a similarity between each of the past question sentences stored in the question and answer database and the question information is calculated, and past question sentences similar to the question information are extracted from the question and answer database based on the similarity with the question information, and past answer sentences associated with the extracted past question sentences similar to the question information are extracted from the question and answer database to be used as first answer candidate sentences.

本発明によれば、新規質問文章に対して好適な回答候補文章を出力できる。 The present invention makes it possible to output suitable answer candidate sentences for new question sentences.

実施例における回答候補提案システムの機能ブロック図の一例を示す図である。FIG. 2 is a diagram illustrating an example of a functional block diagram of the answer candidate suggestion system in the embodiment. 実施例における回答候補提案システムのハードウェア構成例を示すブロック図である。1 is a block diagram showing an example of a hardware configuration of an answer candidate suggestion system according to an embodiment. 質問回答データベースの一例を示す図である。FIG. 2 is a diagram illustrating an example of a question and answer database. 疑問語要望語リストの一例を示す図である。FIG. 13 is a diagram showing an example of a question word request word list. 個人情報語リストの一例を示す図である。FIG. 13 is a diagram showing an example of a personal information word list. 補足単語リストの一例を示す図である。FIG. 13 is a diagram showing an example of a supplementary word list. 項目候補単語テーブルの一例を示す図である。FIG. 13 is a diagram showing an example of an item candidate word table. 実施例の項目候補単語群生成処理の例を示すフローチャートである。11 is a flowchart illustrating an example of a process for generating a group of candidate item words according to an embodiment. ユーザ端末に表示される新規質問文章入力画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of a new question sentence input screen displayed on a user terminal. ユーザ端末に表示される項目単語群選択画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of an item word group selection screen displayed on a user terminal. オペレータ端末に表示される回答候補生成選択画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of an answer candidate generation and selection screen displayed on an operator terminal. 回答候補提案システム１の回答候補文章生成処理の一例を示すフローチャートである。11 is a flowchart showing an example of an answer candidate sentence generation process of the answer candidate suggestion system 1. オペレータ端末に表示される回答候補表示画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of an answer candidate display screen displayed on an operator terminal.

以下、図面を参照して本発明の実施の形態を説明する。実施例は、本発明を説明するための例示であって、説明の明確化のため、適宜、省略および簡略化がなされている。本発明は、他の種々の形態でも実施することが可能である。特に限定しない限り、各構成要素は単数でも複数でも構わない。 The following describes embodiments of the present invention with reference to the drawings. The embodiments are illustrative for explaining the present invention, and some parts have been omitted or simplified as appropriate for clarity of explanation. The present invention can also be implemented in various other forms. Unless otherwise specified, each component may be singular or plural.

図面において示す各構成要素の位置、大きさ、形状、範囲などは、発明の理解を容易にするため、実際の位置、大きさ、形状、範囲などを表していない場合がある。このため、本発明は、必ずしも、図面に開示された位置、大きさ、形状、範囲などに限定されない。 The position, size, shape, range, etc. of each component shown in the drawings may not represent the actual position, size, shape, range, etc., in order to facilitate understanding of the invention. Therefore, the present invention is not necessarily limited to the position, size, shape, range, etc. disclosed in the drawings.

各種情報の例として、「テーブル」、「リスト」、「キュー」等の表現にて説明することがあるが、各種情報はこれら以外のデータ構造で表現されてもよい。例えば、「ＸＸテーブル」、「ＸＸリスト」、「ＸＸキュー」等の各種情報は、「ＸＸ情報」としてもよい。識別情報について説明する際に、「識別情報」、「識別子」、「名」、「ＩＤ」、「番号」等の表現を用いるが、これらについてはお互いに置換が可能である。 As examples of various types of information, expressions such as "table," "list," and "queue" may be used, but the various types of information may be expressed in other data structures. For example, various types of information such as "XX table," "XX list," and "XX queue" may be expressed as "XX information." When explaining identification information, expressions such as "identification information," "identifier," "name," "ID," and "number" are used, but these are interchangeable.

同一あるいは同様の機能を有する構成要素が複数ある場合には、同一の符号に異なる添字を付して説明する場合がある。また、これらの複数の構成要素を区別する必要がない場合には、添字を省略して説明する場合がある。 When there are multiple components with the same or similar functions, they may be described using the same reference numerals with different subscripts. Also, when there is no need to distinguish between these multiple components, the subscripts may be omitted.

実施例において、プログラムを実行して行う処理について説明する場合がある。ここで、計算機は、プロセッサ（例えばＣＰＵ、ＧＰＵ）によりプログラムを実行し、記憶資源（例えばメモリ）やインターフェースデバイス（例えば通信ポート）等を用いながら、プログラムで定められた処理を行う。そのため、プログラムを実行して行う処理の主体を、プロセッサとしてもよい。同様に、プログラムを実行して行う処理の主体が、プロセッサを有するコントローラ、装置、システム、計算機、ノードであってもよい。プログラムを実行して行う処理の主体は、演算部であれば良く、特定の処理を行う専用回路を含んでいてもよい。ここで、専用回路とは、例えばＦＰＧＡ（Field Programmable Gate Array）やＡＳＩＣ（Application Specific Integrated Circuit）、ＣＰＬＤ（Complex Programmable Logic Device）等である。 In the embodiments, the processing performed by executing a program may be described. Here, the computer executes the program using a processor (e.g., CPU, GPU), and performs the processing defined by the program using storage resources (e.g., memory) and interface devices (e.g., communication ports). Therefore, the subject of the processing performed by executing the program may be the processor. Similarly, the subject of the processing performed by executing the program may be a controller, device, system, computer, or node having a processor. The subject of the processing performed by executing the program may be a calculation unit, and may include a dedicated circuit that performs specific processing. Here, the dedicated circuit is, for example, an FPGA (Field Programmable Gate Array), an ASIC (Application Specific Integrated Circuit), or a CPLD (Complex Programmable Logic Device).

プログラムは、プログラムソースから計算機にインストールされてもよい。プログラムソースは、例えば、プログラム配布サーバまたは計算機が読み取り可能な記憶メディアであってもよい。プログラムソースがプログラム配布サーバの場合、プログラム配布サーバはプロセッサと配布対象のプログラムを記憶する記憶資源を含み、プログラム配布サーバのプロセッサが配布対象のプログラムを他の計算機に配布してもよい。また、実施例において、２以上のプログラムが１つのプログラムとして実現されてもよいし、１つのプログラムが２以上のプログラムとして実現されてもよい。 The program may be installed on the computer from a program source. The program source may be, for example, a program distribution server or a computer-readable storage medium. When the program source is a program distribution server, the program distribution server may include a processor and a storage resource that stores the program to be distributed, and the processor of the program distribution server may distribute the program to be distributed to other computers. In addition, in the embodiments, two or more programs may be realized as one program, and one program may be realized as two or more programs.

実施例の回答候補提案システム１は、新規質問文章および項目単語群が入力されると、新規質問文章および項目単語群に基づいて、新規質問文章に対する回答文章の候補となる回答候補文章（以下で説明する、第１の回答候補文章、第２の回答候補文章）を生成する。
項目単語群とは、項目候補単語群から選択された、新規質問文章に関する単語である。
項目候補単語群とは、質問回答データベース２１(後述)に保存された過去の質問文章および新規質問文章に基づいて生成された、新規質問文章に関する複数の単語（項目候補単語）である。なお、「～単語群」との記載は、少なくとも１つの「～単語」を意味する。 In the embodiment, when a new question sentence and a group of item words are input, the answer candidate suggestion system 1 generates answer candidate sentences (first answer candidate sentence and second answer candidate sentence described below) that are candidates for the answer sentence to the new question sentence based on the new question sentence and the group of item words.
The item word group is words related to the new question sentence, selected from the item candidate word group.
The item candidate word group is a plurality of words (item candidate words) related to a new question sentence that are generated based on past question sentences and new question sentences stored in the question and answer database 21 (described later). Note that the expression "-- word group" means at least one "-- word."

＜システム構成＞
図１は、実施例における回答候補提案システム１の機能ブロック図の一例を示す図である。図１に示すように、回答候補提案システム１は、ユーザ端末２と、オペレータ端末３と、ウェブ検索エンジン４とに、ネットワークＮＷを介して接続されている。 <System Configuration>
1 is a diagram showing an example of a functional block diagram of an answer candidate suggestion system 1 in an embodiment. As shown in FIG. 1, the answer candidate suggestion system 1 is connected to a user terminal 2, an operator terminal 3, and a web search engine 4 via a network NW.

ユーザ端末２は、問題文章を入力するユーザに操作される。ユーザ端末２は、ユーザから入力を受け付ける入力装置と、ディスプレイやタッチパネルなどの情報を表示する出力装置を備える。ユーザ端末２は、ネットワークＮＷを介して、回答候補提案システム１やオペレータ端末３と情報の送受信ができる。また、ユーザ端末２は、回答候補提案システム１やオペレータ端末３から受信した情報を表示できる。そして、ユーザ端末２は、ユーザから入力された情報を回答候補提案システム１やオペレータ端末３に送信できる。 The user terminal 2 is operated by a user who inputs a question sentence. The user terminal 2 has an input device that accepts input from the user, and an output device such as a display or touch panel that displays information. The user terminal 2 can send and receive information to and from the answer candidate suggestion system 1 and the operator terminal 3 via the network NW. The user terminal 2 can also display information received from the answer candidate suggestion system 1 and the operator terminal 3. The user terminal 2 can then transmit information input by the user to the answer candidate suggestion system 1 and the operator terminal 3.

オペレータ端末３は、オペレータに操作され、オペレータからの入力を受け付ける入力装置と、ディスプレイやタッチパネルなどの情報を表示する出力装置を備えている。オペレータ端末３は、ネットワークＮＷを介して、回答候補提案システム１やユーザ端末２と情報の送受信ができる。オペレータ端末３は、回答候補提案システム１を利用するヘルプデスクに設置されるほか、例えばヘルプデスクの委託業者等が保有してもよい。オペレータ端末３として、例えば、パーソナルコンピュータ等の電子機器が用いられる。 The operator terminal 3 is operated by an operator and includes an input device that accepts input from the operator, and an output device such as a display or touch panel that displays information. The operator terminal 3 can send and receive information to and from the answer candidate suggestion system 1 and the user terminal 2 via the network NW. The operator terminal 3 is installed in a help desk that uses the answer candidate suggestion system 1, and may also be owned by, for example, a contractor of the help desk. For example, an electronic device such as a personal computer is used as the operator terminal 3.

ウェブ検索エンジン４は、ネットワークＮＷを介して、少なくとも１つの単語を受信すると、受信した単語に関するＷＥＢサイトの情報を含む検索結果を返す。検索結果に含まれるＷＥＢサイトの情報には、ＷＥＢサイトの概要文やＵＲＬが含まれる。ここで、概要文とは、ウェブ検索エンジン等にて生成された、各ＷＥＢサイトの概要文章（例えば、１００字程度）であり、スニペットと称される場合もある。 When the web search engine 4 receives at least one word via the network NW, it returns search results including information about websites related to the received word. The website information included in the search results includes a summary of the website and the URL. Here, the summary is a summary sentence (e.g., about 100 characters) of each website generated by a web search engine, etc., and is sometimes called a snippet.

ネットワークＮＷは、有線のネットワークでもよいし、無線のネットワークでもよい。また、ネットワークＮＷは、インターネットのようなグローバルネットワークであってもよいし、構内ネットワーク（ＬＡＮ：Local Area Network）であってもよい。 The network NW may be a wired network or a wireless network. The network NW may also be a global network such as the Internet, or a local area network (LAN).

回答候補提案システム１は、項目候補単語群生成部１１と、回答候補文章生成部１２とを備えている。また、回答候補提案システム１は、質問回答データベース２１と、疑問詞要望語リスト２２と、個人情報単語リスト２３と、補足単語リスト２４と、項目候補単語テーブル２５と、を格納している。 The answer candidate suggestion system 1 includes an item candidate word group generation unit 11 and an answer candidate sentence generation unit 12. The answer candidate suggestion system 1 also stores a question and answer database 21, an interrogative word/request word list 22, a personal information word list 23, a supplementary word list 24, and an item candidate word table 25.

項目候補単語群生成部１１は、詳細は図８のフローチャートを用いて後述するが、ユーザ端末２のユーザが入力した新規質問文章が回答候補提案システム１に入力された場合に、新規質問文章に関する項目候補単語群を生成する。さらに、項目候補単語群生成部１１は、生成した項目候補単語群を後述するネットワークＩ／Ｆ３６（送受信装置）に出力して、ネットワークＩ／Ｆ３６に項目候補単語群をネットワークＮＷ介してユーザ端末２に送信させる。 The item candidate word group generation unit 11, the details of which will be described later using the flowchart in FIG. 8, generates an item candidate word group related to a new question sentence when the new question sentence input by the user of the user terminal 2 is input to the answer candidate suggestion system 1. Furthermore, the item candidate word group generation unit 11 outputs the generated item candidate word group to a network I/F 36 (transmitting/receiving device) described later, and causes the network I/F 36 to transmit the item candidate word group to the user terminal 2 via the network NW.

回答候補文章生成部１２は、詳細は図１２のフローチャートを用いて後述するが、項目単語群および新規質問文章が回答候補提案システム１に入力された場合に、回答候補文章（第１の回答候補文章、第２の回答候補文章）を生成する。そして、回答候補文章生成部１２は、生成した回答候補文章を、後述するネットワークＩ／Ｆ３６（送受信装置）に出力して、ネットワークＩ／Ｆ３６に、回答候補文章をネットワークＮＷ介してオペレータ端末３に送信させる。 The answer candidate sentence generation unit 12, the details of which will be described later using the flowchart in FIG. 12, generates answer candidate sentences (first answer candidate sentence, second answer candidate sentence) when a group of item words and a new question sentence are input to the answer candidate suggestion system 1. The answer candidate sentence generation unit 12 then outputs the generated answer candidate sentences to a network I/F 36 (transmitting/receiving device) described later, and causes the network I/F 36 to transmit the answer candidate sentences to the operator terminal 3 via the network NW.

質問回答データベース２１は、詳細は図３を用いて後述するが、過去の質問文章と、過去の質問文章に対する過去の回答文章と、過去の質問文章のｔｆｉｄｆベクトルと、を対応付けて格納する。 The question and answer database 21, the details of which will be described later with reference to FIG. 3, stores past question sentences, past answer sentences to the past question sentences, and tfidf vectors of the past question sentences in association with each other.

疑問詞要望語リスト２２は、詳細は図４を用いて後述するが、疑問があることを表す疑問詞および要望があることを表す要望語を保存するデータベースである。 The interrogative word/request word list 22, details of which will be described later with reference to FIG. 4, is a database that stores interrogative words that express doubts and request words that express requests.

個人情報単語リスト２３は、詳細は図５を用いて後述するが、個人情報を表す複数の個人情報単語を保存するデータベースである。 The personal information word list 23 is a database that stores multiple personal information words that represent personal information, details of which will be described later using Figure 5.

補足単語リスト２４は、詳細は図６を用いて後述するが、補足単語を保存するデータベースである。 The supplementary word list 24 is a database that stores supplementary words, the details of which will be described later using FIG. 6.

項目候補単語テーブル２５は、詳細は図７を用いて後述するが、過去の質問文章に含まれる単語（特に動詞）と、項目候補単語群とを対応付けて保存しているデータベースである。 The item candidate word table 25, details of which will be described later with reference to FIG. 7, is a database that stores words (particularly verbs) contained in past question sentences in association with item candidate word groups.

図２は、回答候補提案システム１のハードウェア構成例を示すブロック図である。図２に示すように、回答候補提案システム１は、プロセッサ３１、主記憶装置３２、副記憶装置３３、入力装置３４、出力装置３５、ネットワークＩ／Ｆ３６、これらを接続するバス３７を有している。回答候補提案システム１は、例えばＰＣやサーバーコンピューターのような一般的な情報処理装置で実現できる。 Figure 2 is a block diagram showing an example of the hardware configuration of the answer candidate suggestion system 1. As shown in Figure 2, the answer candidate suggestion system 1 has a processor 31, a main memory device 32, a sub-memory device 33, an input device 34, an output device 35, a network I/F 36, and a bus 37 that connects these. The answer candidate suggestion system 1 can be realized by a general information processing device such as a PC or a server computer.

プロセッサ３１は、副記憶装置３３に記憶されたデータやプログラムを主記憶装置３２に読み出して、プログラムによって定められた処理を実行する。 The processor 31 reads data and programs stored in the secondary storage device 33 into the main storage device 32 and executes the processing defined by the programs.

主記憶装置３２は、ＲＡＭなどで、揮発性記憶素子を有し、プロセッサ３１が実行するプログラムや、データを記憶する。 The main memory device 32 has volatile memory elements such as RAM, and stores the programs and data executed by the processor 31.

副記憶装置３３は、ＨＤＤ（Hard Disk Drive）やＳＳＤ（Solid State Drive）などで、不揮発性記憶素子を有し、プログラムやデータ等を記憶する装置である。副記憶装置３３には、上述した、質問回答データベース２１と、疑問詞要望語リスト２２と、個人情報単語リスト２３と、補足単語リスト２４と、項目候補単語テーブル２５と、を格納している。 The secondary storage device 33 is a device that has non-volatile memory elements, such as a hard disk drive (HDD) or a solid state drive (SSD), and stores programs, data, etc. The secondary storage device 33 stores the above-mentioned question and answer database 21, interrogative word and request word list 22, personal information word list 23, supplementary word list 24, and item candidate word table 25.

また、副記憶装置３３には、項目候補単語群生成プログラム１１ａと、回答候補文章生成プログラム１２ａと、がインストールされている。図１を用いて上述した、項目候補単語群生成部１１と、回答候補文章生成部１２とは、副記憶装置３３に記憶されている項目候補単語群生成プログラム１１ａと、回答候補文章生成プログラム１２ａとを、プロセッサ３１が主記憶装置３２に読み出して実行することにより実現される。 In addition, the sub-storage device 33 has installed therein an item candidate word group generation program 11a and an answer candidate sentence generation program 12a. The item candidate word group generation unit 11 and the answer candidate sentence generation unit 12 described above with reference to FIG. 1 are realized by the processor 31 reading the item candidate word group generation program 11a and the answer candidate sentence generation program 12a stored in the sub-storage device 33 into the main storage device 32 and executing them.

入力装置３４は、キーボードやマウスなどのユーザの操作を受け付ける装置であり、ユーザの操作により入力された情報を取得する。出力装置３５は、ディスプレイなど情報を出力する装置であり、例えば画面への表示により情報をユーザに提示する。 The input device 34 is a device that accepts user operations such as a keyboard or mouse, and acquires information input by user operations. The output device 35 is a device that outputs information such as a display, and presents information to the user by displaying it on a screen, for example.

ネットワークＩ／Ｆ３６は、ユーザ端末２や、オペレータ端末３や、ウェブ検索エンジン４等の装置と、ネットワークＮＷを介してデータを送受信するためのインターフェースである。すなわち、ネットワークＩ／Ｆ３６は、ネットワークＮＷを介して、ユーザ端末２、オペレータ端末３、ウェブ検索エンジン４に情報の送受信が可能な送受信装置である。回答候補提案システム１は、ネットワークＩ／Ｆ３６を用いて、ネットワークＮＷに接続されているユーザ端末２や、オペレータ端末３や、ウェブ検索エンジン４等の装置とデータの送受信を行うことができる。 The network I/F 36 is an interface for transmitting and receiving data via the network NW with devices such as the user terminal 2, the operator terminal 3, and the web search engine 4. In other words, the network I/F 36 is a transmitting/receiving device capable of transmitting and receiving information to and from the user terminal 2, the operator terminal 3, and the web search engine 4 via the network NW. The answer candidate suggestion system 1 can use the network I/F 36 to transmit and receive data with devices such as the user terminal 2, the operator terminal 3, and the web search engine 4 that are connected to the network NW.

ユーザ端末２およびオペレータ端末３は、回答候補提案システム１と同様のハードウェア資源を使用することで構成できる。 The user terminal 2 and the operator terminal 3 can be configured using hardware resources similar to those of the answer candidate suggestion system 1.

＜各種データ構造＞
図３は、質問回答データベース２１の一例を示す図である。図３に示す質問回答データベース２１では、質問回答ＩＤ３０１は、過去の質問文章３０２を識別するＩＤである。回答文章３０３、ｔｆｉｄｆベクトル３０４（詳細は後述する）は、過去の質問文章３０２に対応付けられている。この様に、質問回答データベース２１は、過去の質問文章３０２と、当該過去の質問文章３０２に対する過去の回答文章３０３を対応付けて保存する。ｔｆｉｄｆベクトル３０４は、質問回答データベース２１に、新たに過去の質問文章と過去の回答文章との組が保存される度に、質問回答データベース２１に保存された全ての過去の質問文章に対して生成してもよい。また、例えば、過去の質問文章と過去の回答文章との組が所定の数、質問回答データベース２１に保存される毎等、あらかじめ設定したタイミングでｔｆｉｄｆベクトル３０４を生成し直しても良い。 <Various data structures>
3 is a diagram showing an example of the question and answer database 21. In the question and answer database 21 shown in FIG. 3, a question and answer ID 301 is an ID for identifying a past question sentence 302. An answer sentence 303 and a tfidf vector 304 (details of which will be described later) are associated with the past question sentence 302. In this manner, the question and answer database 21 stores the past question sentence 302 and the past answer sentence 303 to the past question sentence 302 in association with each other. The tfidf vector 304 may be generated for all past question sentences stored in the question and answer database 21 every time a new pair of a past question sentence and a past answer sentence is stored in the question and answer database 21. In addition, the tfidf vector 304 may be regenerated at a preset timing, for example, every time a predetermined number of pairs of a past question sentence and a past answer sentence are stored in the question and answer database 21.

図４は、疑問詞要望語リスト２２の一例を示す図である。図４に示す疑問詞要望語リスト２２では、疑問詞要望語ＩＤ４０１は、疑問詞要望語４０２を識別するＩＤである。疑問詞要望語４０２は、疑問があることを表す疑問詞または要望があることを表す要望語である。図４には、疑問詞要望語４０２の例として、「しょうか」と「下さい」を示した。他の疑問詞要望語４０２の例として、「すか」、「のか」、「んか」、「なの」、「だれ」、「なに」、「何」、「どこ」、「いつ」、「いくつ」、「いくら」、「どう」、「なぜ」、「いか」、「どの」、「だれ」、「誰」、「どなた」、「何」、「どれ」、「どんな」、「いかなる」、「ほしい」、「欲しい」、「ください」、「たい」、「求」、「頼」、「？」が挙げられる。 Fig. 4 is a diagram showing an example of the interrogative word request list 22. In the interrogative word request list 22 shown in Fig. 4, the interrogative word request ID 401 is an ID that identifies the interrogative word request 402. The interrogative word request 402 is an interrogative word that expresses a doubt or a request word that expresses a request. In Fig. 4, "shouka" and "kudasai" are shown as examples of the interrogative word request 402. Other examples of the interrogative word request 402 include "suka", "noka", "nka", "nano", "dare", "nani", "what", "where", "when", "how many", "how much", "how", "why", "ika", "do", "who", "who", "who", "what", "which", "what", "what kind", "any", "want", "want", "want", "want", "want", "request", and "?".

図５は、個人情報単語リスト２３の一例を示す図である。図５に示す個人情報単語リスト２３では、個人情報単語ＩＤ５０１は、個人情報単語５０２を識別するＩＤである。個人情報単語５０２は、個人情報を表す複数の個人情報単語である。図５には、個人情報単語５０２の例として、郵便番号として「＊＊＊－＊＊＊＊」と、電話番号として「＊＊＊－＊＊＊＊－＊＊＊＊」を示した。個人情報単語５０２の他の例として、郵便番号として「＊＊＊＊＊＊＊」、電話番号として「＊＊＊＊＊＊＊＊＊＊＊」、「カード番号」、「生年月日」、「メールアドレス」、「人名」、「住所」が挙げられる。なお、以上で「＊」は、一文字の数字を表す。 Figure 5 is a diagram showing an example of personal information word list 23. In personal information word list 23 shown in Figure 5, personal information word ID 501 is an ID that identifies personal information word 502. Personal information word 502 is a plurality of personal information words that represent personal information. Figure 5 shows, as examples of personal information word 502, "****-****" as a postal code and "****-****-****" as a telephone number. Other examples of personal information word 502 include "****" as a postal code, "********" as a telephone number, "card number", "date of birth", "email address", "name", and "address". Note that in the above, "*" represents a single numeric character.

図６は、補足単語リスト２４の一例を示す図である。図６に示す補足単語リスト２４では、補足単語ＩＤ６０１は、補足単語６０２を識別するＩＤである。補足単語６０２とは、新規質問文章に含まれる質問の意図に関して重要な意味をもつ場合が多いと考えられる単語である。なおかつ、補足単語６０２は、ｔｆ－ｉｄｆ法の後述する「（Ａ）文章に含まれる単語の重要度を算出し、文章に含まれる重要単語を抽出する重要単語抽出方法」で、重要度が低く算出され、重要単語として抽出されない場合が多いと考えられる単語（形態素）である。図６には、補足単語６０２の例として、「ない」を示した。 Figure 6 is a diagram showing an example of the supplementary word list 24. In the supplementary word list 24 shown in Figure 6, a supplementary word ID 601 is an ID that identifies a supplementary word 602. A supplementary word 602 is a word that is likely to have an important meaning with respect to the intent of a question contained in a new question sentence. Furthermore, the supplementary word 602 is a word (morpheme) that is likely to be calculated as having a low importance and not extracted as an important word in the "(A) important word extraction method for calculating the importance of words contained in a sentence and extracting important words contained in the sentence" of the tf-idf method described below. Figure 6 shows "not" as an example of a supplementary word 602.

図７は、項目候補単語テーブル２５の一例を示す図である。図７に示す項目候補単語テーブル２５では、項目候補単語ＩＤ７０１は、単語７０２を識別するＩＤである。項目候補単語群７０３は、単語７０２に対応付けられている。項目候補単語テーブル２５は、例えば、次の様に、質問回答データベース２１に保存された過去の質問文章に基づいて生成される。まず、質問回答データベース２１に保存されている、過去の質問文章それぞれに対して、疑問詞要望語リスト２２に保存されている疑問詞および要望語を少なくとも１つ含む疑問要望文を抽出する。次に、抽出した疑問要望文それぞれを形態素解析し、疑問要望文に含まれる動詞を項目候補単語テーブル２５の単語７０２とし、疑問要望文に含まれる少なくとも１つの名詞を項目候補単語群７０３として、項目候補単語テーブル２５に保存する。この様に、項目候補単語テーブル２５の項目候補単語群７０３は、質問回答データベース２１に保存された過去の質問文章に基づいて生成されている。 7 is a diagram showing an example of the item candidate word table 25. In the item candidate word table 25 shown in FIG. 7, the item candidate word ID 701 is an ID for identifying the word 702. The item candidate word group 703 is associated with the word 702. The item candidate word table 25 is generated, for example, based on past question sentences stored in the question and answer database 21 as follows. First, for each past question sentence stored in the question and answer database 21, an interrogative request sentence containing at least one interrogative word and request word stored in the interrogative word request word list 22 is extracted. Next, each of the extracted interrogative request sentences is subjected to morphological analysis, and the verb contained in the interrogative request sentence is set as the word 702 in the item candidate word table 25, and at least one noun contained in the interrogative request sentence is set as the item candidate word group 703 and stored in the item candidate word table 25. In this way, the item candidate word group 703 in the item candidate word table 25 is generated based on past question sentences stored in the question and answer database 21.

＜ｔｆ－ｉｄｆ法＞
回答候補提案システム１は、新規質問文章に基づいて、項目候補単語群や、第１の回答候補文章や、第２の回答候補文章を生成する過程で、ｔｆ－ｉｄｆ法（単語頻度逆文書頻度法）の重要度と、コサイン類似度を算出する。重要度は、文章に含まれる単語の重要度である。一方、コサイン類似度は、文章と文章の類似度である。以下では、ｔｆ－ｉｄｆ法において、（Ａ）文章に含まれる単語の重要度を算出し、文章に含まれる重要単語を抽出する重要単語抽出方法と、（Ｂ）文章と文章のコサイン類似度を算出し、対象とする文章に類似する類似文章を抽出する類似文書抽出方法と、の概要を以下に説明する。（Ａ）における重要度の算出、（Ｂ）におけるコサイン類似度の算出では、複数の文章が格納されたデータベース（本実施例では質問回答データベース２１）を使用する。 <tf-idf method>
The answer candidate suggestion system 1 calculates the importance and cosine similarity of the tf-idf method (word frequency inverse document frequency method) in the process of generating a group of item candidate words, a first answer candidate sentence, and a second answer candidate sentence based on a new question sentence. The importance is the importance of a word contained in a sentence. On the other hand, the cosine similarity is the similarity between sentences. Below, an overview of (A) an important word extraction method that calculates the importance of a word contained in a sentence and extracts important words contained in the sentence, and (B) a similar document extraction method that calculates the cosine similarity between sentences and extracts similar sentences similar to the target sentence in the tf-idf method will be described below. In the calculation of the importance in (A) and the calculation of the cosine similarity in (B), a database in which a plurality of sentences is stored (the question and answer database 21 in this embodiment) is used.

（Ａ）文章に含まれる単語の重要度を算出し、文章に含まれる重要単語を抽出する重要単語抽出方法では、重要度を対象とする文章中の全ての単語に対して算出する。 (A) In a method for extracting important words that calculates the importance of words contained in a sentence and extracts important words contained in the sentence, the importance is calculated for all words in the target sentence.

単語の重要度（ｔｆｉｄｆ値とする）は、ｔｆとｉｄｆの積である。まず、文章を形態素解析し、文章を形態素（単語）に分解する。そして、ｔｆを算出する。文章中の全単語数をＮ、重要度算出対象の単語の文章中の出現回数をｎとすると、ｔｆは、例えばｔｆ＝ｎ／Ｎで表される。ｔｆは文章での単語の出現回数の多さを表す。またｔｆでは、文章中の出現回数ｎの多い単語程、重要とみなす。 The importance of a word (referred to as the tfidf value) is the product of tf and idf. First, the sentence is subjected to morphological analysis to break the sentence down into morphemes (words). Then, tf is calculated. If the total number of words in the sentence is N, and the number of times the word for which importance is to be calculated appears in the sentence is n, then tf can be expressed, for example, as tf = n/N. tf represents the number of times a word appears in a sentence. In addition, with tf, the more times a word appears in a sentence, n, the more important it is considered to be.

次に、ｉｄｆを算出する。データベースに格納された文章の数をＤとし、重要度算出対象の単語を含む文章の数をｄとする。ｉｄｆは、例えば、ｉｄｆ＝－ｌｏｇ（ｄ／Ｄ）で表される。これを、ｉｄｆ＝ｌｏｇ（Ｄ／ｄ）と表すこともできる。重要度算出対象の単語を含む文章の数ｄが小さい程、ｉｄｆ＝ｌｏｇ（Ｄ／ｄ）は大きくなる。ｉｄｆは、データベースに格納されている全文章中で、重要度算出対象の単語を含む文章の数ｄの少なさを表す。ｉｄｆでは、対象の単語を含む文章の数ｄが小さい単語程、重要とみなす。 Next, idf is calculated. Let D be the number of sentences stored in the database, and d be the number of sentences containing the word for which importance is to be calculated. idf is expressed, for example, as idf = -log(d/D). This can also be expressed as idf = log(D/d). The smaller the number d of sentences containing the word for which importance is to be calculated, the larger idf = log(D/d). idf indicates how few the number d of sentences containing the word for which importance is to be calculated is among all the sentences stored in the database. With idf, the smaller the number d of sentences containing the target word, the more important the word is considered to be.

単語の重要度は、ｔｆｉｄｆ値＝ｔｆ・ｉｄｆ＝ｎ／Ｎ・（－ｌｏｇ（ｄ／Ｄ））である。そして、文章中の全ての単語に対してｔｆｉｄｆ値（重要度）を算出する。そして、ｔｆｉｄｆ値の高い単語のうち、上位から所定の割合（または所定の数）の単語を、重要単語とする。 The importance of a word is tfidf value = tf * idf = n/N * (-log(d/D)). Then, the tfidf value (importance) is calculated for all words in the sentence. Then, among the words with high tfidf values, a certain percentage (or a certain number) of the top words are determined to be important words.

（Ｂ）文章と文章のコサイン類似度を算出し、対象とする文章に類似する類似文章を抽出する類似文書抽出方法では、以下で説明するように、データベースに格納されている文章それぞれと、対象とする文章とに、ｔｆｉｄｆベクトルを算出し、コサイン類似度を算出する。 (B) In a similar document extraction method that calculates the cosine similarity between sentences and extracts sentences that are similar to a target sentence, as described below, a tfidf vector is calculated between each sentence stored in a database and the target sentence, and the cosine similarity is calculated.

まず、データベースに格納されている全文章と、対象とする文章と、を形態素解析し、文章を単語（形態素）に分解する。次に、分解して得られた複数の単語から、単語の重複する分を削除し、単語それぞれを成分とする単語ベクトルを生成する。次に、データベースに格納されている文章それぞれと、対象とする文章に対して、ｔｆｉｄｆベクトルを算出する。ｔｆｉｄｆベクトルは、単語ベクトルの成分の単語に対するｔｆｉｄｆ値を成分とするベクトルである。 First, all sentences stored in the database and the target sentence are subjected to morphological analysis to break the sentences down into words (morphemes). Next, duplicate words are removed from the multiple words obtained from the breakdown, and a word vector with each word as a component is generated. Next, tfidf vectors are calculated for each sentence stored in the database and the target sentence. A tfidf vector is a vector whose components are the tfidf values for the words that are components of the word vector.

単語ベクトルと、ｔｆｉｄｆベクトルとの例を挙げると、「スマートフォンは軽い。」という文を、形態素解析して生成される単語ベクトルは、例えば、（スマートフォン，は，軽い，。）となる。これに対するｔｆｉｄｆベクトルは、例えば、（「スマートフォン」のｔｆｉｄｆ値，「は」のｔｆｉｄｆ値，「軽い」のｔｆｉｄｆ値，「。」のｔｆｉｄｆ値）となる。 To give an example of a word vector and a tfidf vector, the word vector generated by morphological analysis of the sentence "Smartphones are light" is, for example, (smartphone, is, light, .). The corresponding tfidf vector is, for example, (tfidf value of "smartphone", tfidf value of "wa", tfidf value of "light", tfidf value of ".").

次に、データベースに格納されている文章のｔｆｉｄｆベクトルそれぞれと、対象とする文章のｔｆｉｄｆベクトルとのコサイン類似度（２つのｔｆｉｄｆベクトルの間の角度に対するコサインの値）を算出する。２つのｔｆｉｄｆベクトルＡ、Ｂのコサイン類似度は、コサイン類似度＝Ａ・Ｂ／（｜Ａ｜｜Ｂ｜）となる。対象文章とのコサイン類似度の値が大きい文章ほど（コサイン類似度が高い文章ほど）、類似度が高い文章とする。 Next, calculate the cosine similarity (the cosine value of the angle between two tfidf vectors) between each of the tfidf vectors of the sentences stored in the database and the tfidf vector of the target sentence. The cosine similarity of two tfidf vectors A and B is cosine similarity = A · B / (|A||B|). A sentence with a larger cosine similarity value with the target sentence (a sentence with higher cosine similarity) is considered to be more similar.

そして、データベースに含まれる文章のうちで、コサイン類似度の高さで上位から所定の割合（または所定の数）の文章を、類似度が高い類似文章とする。ここで、コサイン類似度の代わりに、データベースに格納されている文章のｔｆｉｄｆベクトルと、対象とする文章のｔｆｉｄｆベクトルと、の内積を用いても良い。 Then, among the sentences included in the database, a certain percentage (or a certain number) of the sentences that rank highest in terms of cosine similarity are determined to be similar sentences with high similarity. Here, instead of cosine similarity, the inner product of the tfidf vector of a sentence stored in the database and the tfidf vector of the target sentence may be used.

以上の説明は、ｔｆ－ｉｄｆ法の概要であり、ｔｆ－ｉｄｆ法を用いる際のｔｆ－ｉｄｆ法のアルゴリズムは、以上で説明した方法から適宜変更できる。また、ｔｆ－ｉｄｆ法の「（Ｂ）文章と文章のコサイン類似度を算出し、対象とする文章に類似する類似文章を抽出する類似文書抽出方法」に換えて、例えばＤｏｃ２Ｖｅｃ法等の文章の類似度を算出する他の方法を用いて類似文章を抽出しても良い。 The above explanation is an overview of the tf-idf method, and the algorithm of the tf-idf method when using the tf-idf method can be changed as appropriate from the method explained above. Also, instead of the tf-idf method's "(B) similar document extraction method that calculates the cosine similarity between sentences and extracts similar sentences that are similar to the target sentence," other methods that calculate the similarity of sentences, such as the Doc2Vec method, may be used to extract similar sentences.

＜処理手順＞
次に、回答候補提案システム１の処理手順について説明する。ユーザは、ユーザ端末２を操作して、ユーザ端末２に、回答候補提案システム１にアクセスさせる。回答候補提案システム１は、ユーザ端末２からアクセスされると、項目候補単語群生成部１１により実行される、項目候補単語群生成処理を開始する。以下では、図９及び図１０を参照しつつ、図８を用いて項目候補単語群生成処理について説明する。 <Processing Procedure>
Next, a description will be given of the processing procedure of the answer candidate suggestion system 1. A user operates the user terminal 2 to cause the user terminal 2 to access the answer candidate suggestion system 1. When the answer candidate suggestion system 1 is accessed from the user terminal 2, it starts an item candidate word group generation process executed by the item candidate word group generation unit 11. Below, the item candidate word group generation process will be described using FIG. 8 with reference to FIG. 9 and FIG. 10.

図８は、回答候補提案システム１の項目候補単語群生成処理の一例を示すフローチャートである。 Figure 8 is a flowchart showing an example of the process for generating a group of item candidate words in the answer candidate suggestion system 1.

回答候補提案システム１は、ユーザ端末２に新規質問文章入力画面情報を送信する（ステップＳ１０１）。新規質問文章入力画面情報は、新規質問文章入力画面の構成の情報と、ユーザ端末２に新規質問文章入力画面を表示させる旨の情報と、を含む。新規質問文章入力画面は、新規質問文章の入力と、入力された新規質問文章を回答候補提案システム１に送信する旨の入力と、を受け付けることができるように構成されている。 The answer candidate suggestion system 1 transmits new question text input screen information to the user terminal 2 (step S101). The new question text input screen information includes information on the configuration of the new question text input screen and information to display the new question text input screen on the user terminal 2. The new question text input screen is configured to be able to accept input of a new question text and input to transmit the input new question text to the answer candidate suggestion system 1.

図９は、ユーザ端末２に表示される新規質問文章入力画面の一例を示す説明図である。図９に示す新規質問文章入力画面９００は、新規質問文章入力欄９０１と、項目選択ボタン９０２とを備えている。新規質問文章入力欄９０１は、ユーザが新規質問文章を入力する欄である。項目選択ボタン９０２は、入力された新規質問文章を回答候補提案システム１に送信する旨を入力するボタンである。ユーザが、新規質問文章入力欄９０１に新規質問文章を入力し、さらに、項目選択ボタン９０２を押すと、ユーザ端末２は、新規質問文章入力欄９０１に入力された新規質問文章を、回答候補提案システム１に送信するようになっている。図９には、新規質問文章入力欄９０１に、「繋がらないから助けてほしい。私日立花子はＡ県Ｂ市Ｃ丁目に住んでいるが、自宅の椅子に座って本を読んでいた時に発覚した。」との新規質問文章が入力されており、項目選択ボタン９０２が押されると、入力された新規質問文章が、回答候補提案システム１に送信される。 9 is an explanatory diagram showing an example of a new question sentence input screen displayed on the user terminal 2. The new question sentence input screen 900 shown in FIG. 9 includes a new question sentence input field 901 and an item selection button 902. The new question sentence input field 901 is a field in which the user inputs a new question sentence. The item selection button 902 is a button for inputting that the input new question sentence is to be sent to the answer candidate proposal system 1. When the user inputs a new question sentence in the new question sentence input field 901 and further presses the item selection button 902, the user terminal 2 transmits the new question sentence input in the new question sentence input field 901 to the answer candidate proposal system 1. In FIG. 9, a new question sentence is input in the new question sentence input field 901, saying, "I can't connect, so please help me. I, Hitachi Hanako, live in C-chome, B-city, A-prefecture, and I discovered this when I was sitting in a chair at home reading a book." When the item selection button 902 is pressed, the input new question sentence is transmitted to the answer candidate proposal system 1.

図８に戻り、次に、回答候補提案システム１は、所定時間待機する（ステップＳ１０２）。 Returning to FIG. 8, next, the answer candidate suggestion system 1 waits for a predetermined time (step S102).

次に、回答候補提案システム１は、ユーザ端末２から新規質問文章を受信したか否かを判定する（ステップＳ１０３）。ユーザ端末２から新規質問文章を受信したと判定された場合（ステップＳ１０３：ＹＥＳ）はステップＳ１０４に進み、ユーザ端末２から新規質問文章を受信していないと判定された場合（ステップＳ１０３：ＮＯ）は、ステップＳ１０２に戻る。これにより、回答候補提案システム１は、ユーザ端末２から新規質問文章を受信するまで、ステップＳ１０２、ステップＳ１０３の処理を繰り返して、新規質問文章を待ち受ける。 Next, the answer candidate suggestion system 1 determines whether or not a new question text has been received from the user terminal 2 (step S103). If it is determined that a new question text has been received from the user terminal 2 (step S103: YES), the process proceeds to step S104, and if it is determined that a new question text has not been received from the user terminal 2 (step S103: NO), the process returns to step S102. As a result, the answer candidate suggestion system 1 waits for a new question text by repeating the processes of steps S102 and S103 until a new question text is received from the user terminal 2.

次に、回答候補提案システム１は、ユーザ端末２から受信した新規質問文章を保存する（ステップＳ１０４）。ここで、回答候補提案システム１のネットワークＩ／Ｆ３６（送受信装置）は、ユーザ端末２から新規質問文章を受信する（入力される）と、プロセッサ３１は、新規質問文章を受信した旨をネットワークＩ／Ｆ３６から受け取り、受信した新規質問文章を主記憶装置３２に記憶させる。以上の様に回答候補提案システム１に新規質問文章が入力される。 Next, the answer candidate suggestion system 1 saves the new question text received from the user terminal 2 (step S104). Here, when the network I/F 36 (transmitter/receiver) of the answer candidate suggestion system 1 receives (inputs) the new question text from the user terminal 2, the processor 31 receives a notification from the network I/F 36 that the new question text has been received, and stores the received new question text in the main memory device 32. In this manner, the new question text is input to the answer candidate suggestion system 1.

次に、回答候補提案システム１は、質問回答データベース２１を用いｔｆ－ｉｄｆ法のコサイン類似度を算出して、新規質問文章に類似する過去の質問文章を抽出し、抽出した新規質問文章に類似する過去の質問文章を保存する（ステップＳ１０５）。ここで、回答候補提案システム１は、質問回答データベース２１を用い、質問回答データベース２１に保存された過去の質問文章それぞれに対して、質問情報との、上述したｔｆ－ｉｄｆ法のコサイン類似度を算出する。そして、質問回答データベース２１に保存されている過去の質問文章のうちで、コサイン類似度の高さで上位から所定の割合（例えば２０％）または所定の数（例えば３）の文章を抽出し、新規質問文章に類似する過去の質問文章として保存する。 Next, the answer candidate suggestion system 1 uses the question and answer database 21 to calculate the cosine similarity by the tf-idf method, extracts past question sentences similar to the new question sentence, and saves the extracted past question sentences similar to the new question sentence (step S105). Here, the answer candidate suggestion system 1 uses the question and answer database 21 to calculate the cosine similarity by the tf-idf method described above between the question information and each of the past question sentences stored in the question and answer database 21. Then, from the past question sentences stored in the question and answer database 21, a predetermined percentage (e.g., 20%) or a predetermined number (e.g., 3) of sentences are extracted from the top in terms of cosine similarity, and saved as past question sentences similar to the new question sentence.

次に、回答候補提案システム１は、質問回答データベース２１を用いて、上述したｔｆ－ｉｄｆ法の重要度を算出して、ステップＳ１０５にて抽出した新規質問文章に類似する過去の質問文章から高重要度単語群を生成し、保存する（ステップＳ１０６）。ここで、回答候補提案システム１は、ステップＳ１０５にて抽出した新規質問文章に類似する過去の質問文章を形態素解析して、複数の過去質問文章形態素を生成する。過去質問文章形態素とは、新規質問文章に類似する過去の質問文章を形態素解析して得られる形態素（単語）である。そして回答候補提案システム１は、複数の過去質問文章形態素それぞれに対して、上述したｔｆ－ｉｄｆ法の重要度を、質問回答データベース２１を用いて算出する。そして、複数の過去質問文章形態素のうちで、ｔｆ－ｉｄｆ法の重要度の高さで上位から所定の割合（例えば２０％）または所定の数（例えば１０）の過去質問文章形態素のうちの名詞を高重要度単語群として保存する。 Next, the answer candidate suggestion system 1 uses the question and answer database 21 to calculate the importance of the above-mentioned tf-idf method, and generates and saves a group of high importance words from past question sentences similar to the new question sentence extracted in step S105 (step S106). Here, the answer candidate suggestion system 1 performs morphological analysis on the past question sentences similar to the new question sentence extracted in step S105 to generate multiple past question sentence morphemes. Past question sentence morphemes are morphemes (words) obtained by morphological analysis of past question sentences similar to the new question sentence. Then, the answer candidate suggestion system 1 calculates the importance of the above-mentioned tf-idf method for each of the multiple past question sentence morphemes using the question and answer database 21. Then, among the multiple past question sentence morphemes, nouns from a predetermined percentage (e.g., 20%) or a predetermined number (e.g., 10) of past question sentence morphemes ranked in the top order of importance by the tf-idf method are saved as a group of high importance words.

次に、回答候補提案システム１は、高重要度単語群から、新規質問文章に含まれる単語を除いた単語群を項目候補単語群とし、保存する（ステップＳ１０７）。ここで、回答候補提案システム１は、新規質問文章を形態素解析して新規質問文章形態素（単語）を生成して保存する。新規質問文章形態素とは、新規質問文章を形態素解析して得られる形態素である。また、新規質問文章形態素を、「新規質問文章に含まれる単語」とする。そして、高重要度単語群から「新規質問文章に含まれる単語」（新規質問文章形態素）を除いて、項目候補単語群とする。なお、ステップＳ１０７を省略し、項重要度単語群を項目候補単語群としてもよい。 Next, the answer candidate suggestion system 1 sets the word group remaining after excluding the words contained in the new question sentence from the high importance word group as an item candidate word group and saves it (step S107). Here, the answer candidate suggestion system 1 performs morphological analysis on the new question sentence to generate new question sentence morphemes (words) and saves them. New question sentence morphemes are morphemes obtained by morphological analysis of the new question sentence. In addition, the new question sentence morphemes are set as "words contained in the new question sentence." Then, the high importance word group is set as an item candidate word group by excluding "words contained in the new question sentence" (new question sentence morphemes). Note that step S107 may be omitted and the item importance word group may be set as the item candidate word group.

次に、回答候補提案システム１は、項目候補単語群と、項目単語群選択画面情報とをネットワークＩ／Ｆ３６（送受信装置）に出力し、ネットワークＩ／Ｆ３６に、項目候補単語群と、項目単語群選択画面情報とをネットワークＮＷを介してユーザ端末２に送信させて、処理を終了する（ステップＳ１０８）。項目単語群選択画面情報は、項目単語群選択画面の構成の情報と、ユーザ端末２に項目単語群選択画面を表示させる旨の情報と、を含む。項目単語群選択画面は、図１０を用いて後述するが、項目候補単語群を表示でき、項目候補単語群から選択される項目単語群の入力と、入力された項目単語群および新規質問文章をオペレータ端末３に送信する旨の入力と、を受け付けることができるように構成されている。 Next, the answer candidate suggestion system 1 outputs the item candidate word group and the item word group selection screen information to the network I/F 36 (transmitting/receiving device), causes the network I/F 36 to transmit the item candidate word group and the item word group selection screen information to the user terminal 2 via the network NW, and ends the process (step S108). The item word group selection screen information includes information on the configuration of the item word group selection screen and information to cause the user terminal 2 to display the item word group selection screen. The item word group selection screen, which will be described later with reference to FIG. 10, is configured to be able to display the item candidate word group and to be able to accept input of an item word group selected from the item candidate word group and input to transmit the input item word group and new question sentence to the operator terminal 3.

図１０は、ユーザ端末２に表示される項目単語群選択画面の一例を示す説明図である。図１０に示す項目単語群選択画面１０００は、項目単語選択ボタン１００１～１００４と、項目単語投稿ボタン１００５を備えている。項目単語選択ボタン１００１～１００４は、項目候補単語が描かれたボタンである。項目単語選択ボタン１００１～１００４は、ユーザに押されると、枠を示す線の種類が切り替わる。項目単語選択ボタン１００１～１００４において、実線で描かれた枠は項目単語選択ボタンに書かれた項目候補単語をユーザが項目単語に選択したことを示し、破線で描かれた枠は項目単語選択ボタンに書かれた項目候補単語をユーザが項目単語に選択していないことを示す。 Figure 10 is an explanatory diagram showing an example of an item word group selection screen displayed on the user terminal 2. The item word group selection screen 1000 shown in Figure 10 includes item word selection buttons 1001-1004 and an item word submission button 1005. The item word selection buttons 1001-1004 are buttons on which item candidate words are drawn. When the item word selection buttons 1001-1004 are pressed by the user, the type of line indicating the frame changes. In the item word selection buttons 1001-1004, a frame drawn with a solid line indicates that the user has selected the item candidate word written in the item word selection button as an item word, and a frame drawn with a dashed line indicates that the user has not selected the item candidate word written in the item word selection button as an item word.

図１０の例では、項目単語選択ボタン１００１、１００２の枠は実線になっており、項目単語選択ボタン１００１の「スマートフォン」と、項目単語選択ボタン１００２の「電波」は項目単語に選択されている。また、項目単語選択ボタン１００３、１００４の枠は破線になっており、項目単語選択ボタン１００３の「コード」と、項目単語選択ボタン１００４の「電子書籍」は項目単語に選択されていない。 In the example of FIG. 10, the frames of the item word selection buttons 1001 and 1002 are solid lines, and the item word selection button 1001 "smartphone" and the item word selection button 1002 "radio waves" are selected as item words. The item word selection buttons 1003 and 1004 are dashed lines, and the item word selection button 1003 "code" and the item word selection button 1004 "electronic book" are not selected as item words.

項目単語投稿ボタン１００５は、入力された項目単語群を回答候補提案システム１に送信する旨を入力するボタンである。ユーザが、項目単語選択ボタン１００１～１００４を押して項目単語を選択し、さらに、項目単語投稿ボタン１００５を押すと、ユーザ端末２は、項目単語選択ボタン１００１～１００４で選択された項目単語（項目単語群）と、新規質問文章と、回答候補生成選択画面情報とを、オペレータ端末３に送信するようになっている。 The item word submission button 1005 is a button for inputting that the input item word group is to be sent to the answer candidate suggestion system 1. When the user presses the item word selection buttons 1001 to 1004 to select an item word, and then presses the item word submission button 1005, the user terminal 2 transmits the item word (item word group) selected with the item word selection buttons 1001 to 1004, the new question text, and answer candidate generation selection screen information to the operator terminal 3.

回答候補生成選択画面情報は、回答候補生成選択画面の構成の情報と、オペレータ端末３に回答候補生成選択画面を表示させる旨の情報と、を含む。回答候補生成選択画面は、図１１を用いて後述するが、新規質問文章および項目単語群を表示でき、回答候補提案システム１に回答候補文章を生成させるか否かの情報の入力と、ウェブ検索で第２の回答候補文章を収集するか否かの情報であるＷＥＢ検索設定情報の入力と、を受け付けることができるように構成されている。 The answer candidate generation selection screen information includes information on the configuration of the answer candidate generation selection screen and information on displaying the answer candidate generation selection screen on the operator terminal 3. The answer candidate generation selection screen, which will be described later with reference to FIG. 11, is configured to be able to display a new question sentence and a group of item words, and to be able to accept input of information on whether or not to cause the answer candidate proposal system 1 to generate an answer candidate sentence, and input of web search setting information, which is information on whether or not to collect second answer candidate sentences by web search.

なお、ユーザ端末２は、項目単語群と、新規質問文章と、回答候補生成選択画面情報とを、オペレータ端末３に送信する代わりに、回答候補提案システム１に項目単語群および新規質問文章を送信してもよい。ここで、回答候補提案システム１は、ユーザ端末２から項目単語群および新規質問文章を受信すると、ＷＥＢから第２の回答候補文章を取得（詳細は後述）するか否かを適宜設定して、図１２に一例をフローチャートで示す回答候補文章生成処理を実行しても良い。 In addition, instead of transmitting the item word group, the new question sentence, and the answer candidate generation selection screen information to the operator terminal 3, the user terminal 2 may transmit the item word group and the new question sentence to the answer candidate suggestion system 1. Here, when the answer candidate suggestion system 1 receives the item word group and the new question sentence from the user terminal 2, it may appropriately set whether or not to acquire a second answer candidate sentence from the WEB (details will be described later), and execute an answer candidate sentence generation process, an example of which is shown in the flowchart of FIG. 12.

以上で説明した、図８のステップＳ１０５～Ｓ１０７では、質問回答データベース２１に保存された過去の質問文章それぞれに対して、新規質問文章との類似度（コサイン類似度）を算出し、新規質問文章との類似度に基づいて、新規質問文章に類似する過去の質問文章を抽出する（ステップＳ１０５）。抽出した新規質問文章に類似する過去の質問文章から生成した複数の過去質問文章形態素それぞれの重要度を算出し、複数の過去質問文章形態素から重要度の高い過去質問文章形態素を抽出して、項目候補単語群とする（ステップＳ１０６～Ｓ１０７）。これにより、項目候補単語群は、新規質問文章に類似する過去の質問文章において、重要度が高い、比較的重要な意味を持つ単語（過去質問文章形態素）となる。 In steps S105 to S107 in FIG. 8 described above, the similarity (cosine similarity) between each past question sentence stored in the question and answer database 21 and the new question sentence is calculated, and past question sentences similar to the new question sentence are extracted based on the similarity with the new question sentence (step S105). The importance of each of the past question sentence morphemes generated from the past question sentences similar to the extracted new question sentence is calculated, and past question sentence morphemes with high importance are extracted from the past question sentence morphemes to form a group of item candidate words (steps S106 to S107). As a result, the group of item candidate words becomes words (past question sentence morphemes) with high importance and relatively important meanings in past question sentences similar to the new question sentence.

また、回答候補文章（第１の回答候補文章、第２の回答候補文章）は、項目候補単語群から選択された項目候補単語と新規質問文章とに基づいて生成される。このため、項目候補単語群は、新規質問文章の質問に関して重要な意味を持つことが望ましい。これに対して、上述した様に、項目候補単語群は、新規質問文章に類似する過去の質問文章において、比較的重要な意味を持つ単語である。従って、回答候補提案システム１は、上記の様に項目候補単語群を生成することで、より適切な項目候補単語群を生成できる。 Furthermore, the answer candidate sentences (first answer candidate sentence, second answer candidate sentence) are generated based on the item candidate words selected from the item candidate word group and the new question sentence. For this reason, it is desirable that the item candidate word group has an important meaning with respect to the question of the new question sentence. In contrast, as described above, the item candidate word group is words that have a relatively important meaning in past question sentences similar to the new question sentence. Therefore, by generating the item candidate word group as described above, the answer candidate suggestion system 1 can generate a more appropriate item candidate word group.

また、図８のステップＳ１０５～Ｓ１０７に換えて、次のように、項目候補単語テーブル２５を用いて、新規質問文章から項目候補単語群を生成してもよい。まず、回答候補提案システム１は、新規質問文章に含まれる文から、疑問詞要望語リスト２２に保存されている疑問詞および要望語を少なくとも一つ含む疑問要望文を抽出する。次に、回答候補提案システム１は、抽出した疑問要望文を形態素解析して、疑問要望文に含まれる複数の単語（形態素）を得る。次に、回答候補提案システム１は、疑問要望文に含まれる複数の単語から、動詞を抽出する。次に、回答候補提案システム１は、項目候補単語テーブル２５（図７参照）を参照して、抽出した動詞に対応する項目候補単語群を項目候補単語テーブル２５から抽出し、項目候補単語群を得る。 In addition, instead of steps S105 to S107 in FIG. 8, a group of item candidate words may be generated from the new question sentence using the item candidate word table 25 as follows. First, the answer candidate suggestion system 1 extracts an interrogative request sentence that includes at least one interrogative word and request word stored in the interrogative word list 22 from a sentence included in the new question sentence. Next, the answer candidate suggestion system 1 performs morphological analysis on the extracted interrogative request sentence to obtain multiple words (morphemes) included in the interrogative request sentence. Next, the answer candidate suggestion system 1 extracts verbs from the multiple words included in the interrogative request sentence. Next, the answer candidate suggestion system 1 refers to the item candidate word table 25 (see FIG. 7), extracts an item candidate word group corresponding to the extracted verb from the item candidate word table 25, and obtains the item candidate word group.

項目候補単語テーブル２５に保存されている項目候補単語群は、図７を用いて上述した様に、質問回答データベース２１に保存された過去の質問文章に基づいて生成されている。従って、以上で説明した、項目候補単語テーブル２５を用いて項目候補単語群を生成する方法でも、項目候補単語群は、質問回答データベース２１に保存された過去の質問文章と新規質問文章に基づいて生成される。これにより、回答候補提案システム１は、より適切な項目候補単語群を生成できる。 The item candidate word group stored in the item candidate word table 25 is generated based on past question sentences stored in the question and answer database 21, as described above with reference to FIG. 7. Therefore, even in the method of generating an item candidate word group using the item candidate word table 25 described above, the item candidate word group is generated based on past question sentences and new question sentences stored in the question and answer database 21. This allows the answer candidate suggestion system 1 to generate a more appropriate item candidate word group.

図１１は、オペレータ端末３に表示される回答候補生成選択画面の一例を示す説明図である。図１１に示す回答候補生成選択画面１１００は、新規質問文章表示枠１１０１と、項目単語群表示枠１１０２と、ウェブ検索選択ボタン１１０３と、送信ボタン１１０４と、回答ボタン１１０５と、を含む。新規質問文章表示枠１１０１は、新規質問文章を表示する枠である。項目単語群表示枠１１０２は、項目単語群を表示する枠である。オペレータ端末３を操作するオペレータが、新規質問文章表示枠１１０１内を押す（クリック等する）と、オペレータ端末３は、オペレータからの入力を受け付けて、オペレータが新規質問文章表示枠１１０１内の新規質問文章を編集できるようになっている。これにより、オペレータ端末３は、オペレータが誤記の修正等の編集を加えた新規質問文章を回答候補提案システム１に送信することができる。その結果、回答候補提案システム１は、編集を加えた新規質問文章を新規質問文章とみなして回答候補文章を生成する。これにより、回答候補提案システム１は、より好適な第１の回答候補文章を生成し得る。 11 is an explanatory diagram showing an example of an answer candidate generation selection screen displayed on the operator terminal 3. The answer candidate generation selection screen 1100 shown in FIG. 11 includes a new question sentence display frame 1101, an item word group display frame 1102, a web search selection button 1103, a send button 1104, and an answer button 1105. The new question sentence display frame 1101 is a frame for displaying a new question sentence. The item word group display frame 1102 is a frame for displaying an item word group. When an operator who operates the operator terminal 3 presses (clicks, etc.) inside the new question sentence display frame 1101, the operator terminal 3 accepts input from the operator and allows the operator to edit the new question sentence in the new question sentence display frame 1101. This allows the operator terminal 3 to transmit the new question sentence edited by the operator, such as correcting typos, to the answer candidate proposal system 1. As a result, the answer candidate proposal system 1 regards the edited new question sentence as a new question sentence and generates an answer candidate sentence. This allows the answer candidate suggestion system 1 to generate a more suitable first answer candidate sentence.

ウェブ検索選択ボタン１１０３は、回答候補提案システム１がウェブ検索で第２の回答候補文章を収集するか否かの情報であるＷＥＢ検索設定情報を入力するためのボタンである。ウェブ検索選択ボタン１１０３は、回答候補提案システム１にウェブ検索で第２の回答候補文章を収集させる場合には、図１１に示すように黒塗りになり、回答候補提案システム１にウェブ検索で第２の回答候補文章を収集させない場合には白塗りになる。ここで、黒塗りか、白塗りかは、オペレータがウェブ検索選択ボタン１１０３押す毎に、切り替わるようになっている。 The web search selection button 1103 is a button for inputting web search setting information, which is information on whether or not the answer candidate suggestion system 1 will collect second answer candidate sentences through a web search. The web search selection button 1103 is filled in black as shown in FIG. 11 if the answer candidate suggestion system 1 is to collect second answer candidate sentences through a web search, and is filled in white if the answer candidate suggestion system 1 is not to collect second answer candidate sentences through a web search. Here, whether it is filled in black or white is switched every time the operator presses the web search selection button 1103.

送信ボタン１１０４は、オペレータが押すと、オペレータ端末３が、回答候補提案システム１に、新規質問文章表示枠１１０１内の新規質問文章と、項目単語群と、ＷＥＢ検索設定情報と、回答候補提案システム１に回答候補文章の生成を指示する情報である回答候補文章生成開始情報と、を含む生成開始情報を送信するようになっている。ここで、生成開始情報に含まれる新規質問文章は、オペレータが送信ボタン１１０４を押した時点での新規質問文章表示枠１１０１内の新規質問文章である。従って、オペレータが送信ボタン１１０４を押す前に、新規質問文章表示枠１１０１内の新規質問文章を編集した場合には、編集後の新規質問文章が新規質問文章として開始情報に含まれる。なお、上述したように、オペレータが新規質問文章表示枠１１０１内の新規質問文章を編集する際に、オペレータが新規質問文章の一部（例えば、オペレータが質問で重要な意味を持つと思う部分）にアンダーラインや太字等の修飾を加えることができるとし、さらに、オペレータが修飾を加えた部分の文字の情報を重要文字情報として、生成開示情報に含めても良い。そして、後述するように、回答候補提案システム１は、重要文字情報を用いて、回答候補文章（第１の回答候補文章、第２の回答候補文章）を生成してもよい。 When the operator presses the send button 1104, the operator terminal 3 transmits to the answer candidate suggestion system 1 generation start information including the new question sentence in the new question sentence display frame 1101, the item word group, the web search setting information, and answer candidate sentence generation start information which is information that instructs the answer candidate suggestion system 1 to generate an answer candidate sentence. Here, the new question sentence included in the generation start information is the new question sentence in the new question sentence display frame 1101 at the time the operator presses the send button 1104. Therefore, if the operator edits the new question sentence in the new question sentence display frame 1101 before pressing the send button 1104, the edited new question sentence is included in the start information as the new question sentence. As described above, when the operator edits the new question sentence in the new question sentence display frame 1101, the operator can add modifications such as underlining or bolding to a part of the new question sentence (for example, a part that the operator thinks is important to the question), and further, the character information of the part that the operator added modifications to may be included in the generated disclosure information as important character information. Then, as described below, the answer candidate suggestion system 1 may generate answer candidate sentences (first answer candidate sentence, second answer candidate sentence) using the important character information.

回答ボタン１１０５は、押されると、オペレータ端末３に表示されている画面が、回答候補生成選択画面から、オペレータが新規質問文章に対する回答文章を入力する画面に切り替わるように構成されている。 When the answer button 1105 is pressed, the screen displayed on the operator terminal 3 is switched from the answer candidate generation selection screen to a screen where the operator inputs an answer sentence to the new question sentence.

回答候補提案システム１は、ネットワークＩ／Ｆ３６でオペレータ端末３から生成開始情報を受信する（これにより、生成開始情報に含まれる、新規質問文章が入力され、さらに、項目候補単語群からユーザが選択した項目単語群が入力される）と、回答候補文章生成部１２により実行される回答候補文章生成処理を開始する。 When the answer candidate suggestion system 1 receives generation start information from the operator terminal 3 via the network I/F 36 (which inputs a new question sentence included in the generation start information, and further inputs a group of item words selected by the user from the group of item candidate words), it starts the answer candidate sentence generation process executed by the answer candidate sentence generation unit 12.

図１２は、回答候補提案システム１の回答候補文章生成処理の一例を示すフローチャートである。 Figure 12 is a flowchart showing an example of the answer candidate sentence generation process of the answer candidate suggestion system 1.

回答候補提案システム１は、オペレータ端末３から受信した生成開始情報に含まれる、ＷＥＢ検索設定情報と、新規質問文章と、項目単語群と、を保存する（ステップＳ２０１）。 The answer candidate suggestion system 1 saves the web search setting information, the new question text, and the group of item words contained in the generation start information received from the operator terminal 3 (step S201).

次に、回答候補提案システム１は、疑問詞要望語リスト２２を用い、新規質問文章から疑問要望文を抽出し、抽出した疑問要望文に項目単語群を加えて質問情報を生成する（ステップＳ２０２）。ここで、回答候補提案システム１は、疑問詞要望語リスト２２に保存された疑問詞または要望語を少なくとも一つ含む疑問要望文を、新規質問文章から抽出する。そして、回答候補提案システム１は、抽出した疑問要望文の後ろまたは前に項目単語群を加えて質問情報とする。例えば、疑問要望文が「繋がらないから助けてほしい。」で、項目単語群が「スマートフォン」及び「電波」の場合、質問情報は、例えば、「繋がらないから助けてほしい。スマートフォン、電波」または「スマートフォン、電波、繋がらないから助けてほしい。」となる。なお、上述した様に生成開始情報が重要文字情報を含む場合には、疑問要望文と、項目単語群と、重要文字情報に含まれる文字（疑問要望文と重複する部分は削除するようにしてもよい）とを加えて、質問情報としてもよい。その結果、オペレータが、新規質問文章にアンダーライン等の修飾を加えた部分の文字を、質問情報に加えることができる。これにより、回答候補提案システム１は、より好適な第１の回答候補文章を生成し得る。 Next, the answer candidate suggestion system 1 uses the interrogative word request word list 22 to extract an interrogative request sentence from the new question sentence, and generates question information by adding the item word group to the extracted interrogative request sentence (step S202). Here, the answer candidate suggestion system 1 extracts an interrogative request sentence that includes at least one interrogative word or request word stored in the interrogative word request word list 22 from the new question sentence. Then, the answer candidate suggestion system 1 adds the item word group to the end or beginning of the extracted interrogative request sentence to generate question information. For example, if the interrogative request sentence is "I can't connect, please help me." and the item word group is "smartphone" and "radio waves," the question information is, for example, "I can't connect, please help me. Smartphone, radio waves" or "Smartphone, radio waves, I can't connect, please help me." Note that, as described above, if the generation start information includes important character information, the interrogative request sentence, the item word group, and characters included in the important character information (parts that overlap with the interrogative request sentence may be deleted) may be added to generate question information. As a result, the operator can add the characters of the part of the new question sentence that has been modified by, for example, underlining to the question information. This allows the answer candidate suggestion system 1 to generate a more suitable first answer candidate sentence.

次に、回答候補提案システム１は、質問回答データベース２１を用い、ｔｆ－ｉｄｆ法のコサイン類似度を算出して、質問情報に類似する過去の質問文章を抽出し、抽出した過去の質問文章に対応付けられた過去の回答文章を第１の回答候補文章として、保存する（ステップＳ２０３）。ここで、回答候補提案システム１は、質問回答データベース２１を用い、質問回答データベース２１に保存された過去の質問文章それぞれに対して、質問情報との上述したｔｆ－ｉｄｆ法のコサイン類似度を算出する。また、質問回答データベース２１に保存されている過去の質問文章のうちで、コサイン類似度の高さで上位から所定の割合（例えば２０％）または所定の数（例えば３）の過去の質問文章を抽出する。そして、抽出した過去の質問文章に対応付けられた過去の回答文章を、質問回答データベース２１から抽出し、抽出した過去の回答文章を、第１の回答候補文章として保存する。 Next, the answer candidate suggestion system 1 uses the question and answer database 21 to calculate the cosine similarity by the tf-idf method, extracts past question sentences similar to the question information, and saves the past answer sentences associated with the extracted past question sentences as first answer candidate sentences (step S203). Here, the answer candidate suggestion system 1 uses the question and answer database 21 to calculate the cosine similarity between the question information and each of the past question sentences stored in the question and answer database 21 by the above-mentioned tf-idf method. In addition, among the past question sentences stored in the question and answer database 21, a predetermined percentage (e.g., 20%) or a predetermined number (e.g., 3) of past question sentences are extracted from the top in terms of cosine similarity. Then, past answer sentences associated with the extracted past question sentences are extracted from the question and answer database 21, and the extracted past answer sentences are saved as first answer candidate sentences.

次に、回答候補提案システム１は、ステップＳ２０１にて保存したＷＥＢ検索設定情報に基づいて、ウェブ検索で第２の回答候補文章を収集するか否かを判定する（ステップＳ２０４）。ウェブ検索で第２の回答候補文章を収集すると判定された場合（ステップＳ２０４：ＹＥＳ）はステップＳ２０５に進み、ウェブ検索で第２の回答候補文章を収集しないと判定された場合（ステップＳ２０４：ＮＯ）は、ステップＳ２０８に進む。ここで、上述した様に、ＷＥＢ検索情報は、ウェブ検索で第２の回答候補文章を収集するか否かの情報であり、回答候補提案システム１は、ＷＥＢ検索情報に基づいて、ウェブ検索で第２の回答候補文章を収集するか否かを判定できる。 Next, the answer candidate suggestion system 1 determines whether or not to collect second answer candidate sentences through a web search based on the web search setting information saved in step S201 (step S204). If it is determined that second answer candidate sentences are to be collected through a web search (step S204: YES), the process proceeds to step S205, and if it is determined that second answer candidate sentences are not to be collected through a web search (step S204: NO), the process proceeds to step S208. Here, as described above, the web search information is information regarding whether or not to collect second answer candidate sentences through a web search, and the answer candidate suggestion system 1 can determine whether or not to collect second answer candidate sentences through a web search based on the web search information.

次に、回答候補提案システム１は、質問情報を形態素解析して複数の質問情報形態素を得て、質問回答データベース２１を用いて質問情報形態素毎のｔｆ－ｉｄｆ法の重要度を算出し、複数の質問情報形態素からｔｆ－ｉｄｆ法の重要度が高い複数の質問情報形態素を抽出し、さらに、個人情報単語リスト２３に保存されている個人情報単語を除いて得られる質問情報形態素群を検索単語群とし、保存する（ステップＳ２０５）。ここで、質問情報形態素とは、質問情報を形態素解析して得られる形態素である。また、ｔｆ－ｉｄｆ法の重要度の算出方法は、上述した。ｔｆ－ｉｄｆ法の重要度が高い複数の質問情報形態素とは、質問情報形態素のうちで、重要度の高さで上位から所定の割合（例えば２０％）または所定の数（例えば３）の質問情報形態素である。 Next, the answer candidate suggestion system 1 performs morphological analysis on the question information to obtain multiple question information morphemes, calculates the importance of the tf-idf method for each question information morpheme using the question and answer database 21, extracts multiple question information morphemes with high importance in the tf-idf method from the multiple question information morphemes, and further stores the question information morphemes obtained by excluding the personal information words stored in the personal information word list 23 as a search word group (step S205). Here, the question information morphemes are morphemes obtained by performing morphological analysis on the question information. The method of calculating the importance of the tf-idf method has been described above. The multiple question information morphemes with high importance in the tf-idf method are a predetermined percentage (e.g., 20%) or a predetermined number (e.g., 3) of question information morphemes that are ranked from the top in terms of importance among the question information morphemes.

また、ステップＳ２０５において、回答候補提案システム１は、補足単語リスト２４に保存されている補足単語（例えば、「ない」）が、質問情報内にある場合、質問情報内にある補足単語を、検索単語群に加えてもよい。これにより、より望ましい第２の回答候補を得ることができる場合がある。 In addition, in step S205, if a supplementary word (e.g., "not") stored in the supplementary word list 24 is present in the question information, the answer candidate suggestion system 1 may add the supplementary word in the question information to the search word group. This may result in a more desirable second answer candidate being obtained.

また、検索単語群に含まれる単語（形態素）のうち、活用する単語は、活用形を残した形式（例えば：「繋がら」、「助け」）とするのが好ましいが、活用形の情報を除いた基本形（例えば：「繋がる」、「助ける」）としてもよい。 In addition, among the words (morphemes) included in the search word group, it is preferable that the words to be inflected are in a form that retains the inflected form (for example, "tsunagara" (connect), "tasuku" (help)), but they may be in a basic form that removes the inflected form information (for example, "tsunagara" (connect), "tasuku" (help)).

次に、回答候補提案システム１は、ウェブ検索エンジンに検索単語群を送信し、ウェブ検索エンジンから返される検索単語群に関する検索結果を取得し、検索結果に含まれるＷＥＢサイトの概要文を収集する（ステップＳ２０６）。ここで、回答候補提案システム１（プロセッサ３１）は、ネットワークＩ／Ｆ３６（送受信装置）が検索単語群をウェブ検索エンジンに送信するように、ネットワークＩ／Ｆ３６（送受信装置）が検索単語群をウェブ検索エンジン４に送信する旨とともに検索単語群を、ネットワークＩ／Ｆ３６（送受信装置）に出力する。これにより、ネットワークＩ／Ｆ３６は、検索単語群を、ネットワークＮＷを介してウェブ検索エンジン４に送信する。検索単語群を受信したウェブ検索エンジン４は、回答候補提案システム１に、検索単語群に関する検索結果を返信する。検索単語群に関する検索結果は、検索単語群に関するＷＥＢサイトの概要文を含む。 Next, the answer candidate suggestion system 1 transmits the search word group to the web search engine, obtains search results related to the search word group returned from the web search engine, and collects summaries of the web sites included in the search results (step S206). Here, the answer candidate suggestion system 1 (processor 31) outputs the search word group to the network I/F 36 (transmitter/receiver) together with a notice that the network I/F 36 (transmitter/receiver) will transmit the search word group to the web search engine 4 so that the network I/F 36 (transmitter/receiver) transmits the search word group to the web search engine. As a result, the network I/F 36 transmits the search word group to the web search engine 4 via the network NW. The web search engine 4, which has received the search word group, returns the search results related to the search word group to the answer candidate suggestion system 1. The search results related to the search word group include summaries of the web sites related to the search word group.

なお、回答候補提案システム１は、過去の質問文章とその回答文章との組が記載された少なくとも１つのＷＥＢページをあらかじめ記憶し、記憶したＷＥＢページを、ステップＳ２０６の処理にてウェブ検索エンジンで検索する対象のＷＥＢサイトに設定してもよい。これにより、ＷＥＢサイトの概要文をより容易に収集でき、ひいては、第２の回答候補文章（後述）をより容易に収集できる。 The answer candidate suggestion system 1 may store in advance at least one web page that lists pairs of past question sentences and their answers, and set the stored web page as the target web site to be searched by a web search engine in the process of step S206. This makes it easier to collect summary sentences of web sites, and therefore easier to collect second answer candidate sentences (described below).

また、ステップＳ２０６の処理にてウェブ検索エンジンを用いる代わりに、あらかじめ登録してあり記憶されている、所定の装置内のデータ（例えば、過去の質問文章とその回答文章との組のデータ等）を検索する検索装置を用いてもよい。ここで、検索装置は、例えば、ＷＥＢサイトの概要文と同様の概要文を生成し、記憶し、この概要文を、上記のＷＥＢサイトの概要文の代わりとしてもよい。これにより、概要文をより効率よく収集し得り、ひいては、第２の回答候補文章（後述）をより効率よく収集し得る。 In addition, instead of using a web search engine in the process of step S206, a search device may be used that searches for pre-registered and stored data in a specified device (e.g., data pairs of past question sentences and their answers). Here, the search device may, for example, generate and store a summary similar to the summary of the website, and use this summary instead of the summary of the website. This makes it possible to collect summaries more efficiently, and ultimately to collect second candidate answer sentences (described below) more efficiently.

次に、回答候補提案システム１は、ステップＳ２０６で得たＷＥＢサイトの概要文を所定の順位付け方法で順位を付け、順位が上位から所定の数（または所定の割合）のＷＥＢサイトの概要文を第２の回答候補文として保存する（ステップＳ２０７）。ここで、順位付け方法は、例えば、ＷＥＢサイトの概要文それぞれに対する新規質問文章とのｔｆ－ｉｄｆ法のコサイン類似度の高さでもよい。また、順位付け方法は、特許文献２に記載されたランキングモデルを用いた順位付けでも良く、他の公知技術であってよい。また、ステップＳ２０７にて、順位付けするかわりに、ステップＳ２０６で、ウェブ検索エンジンが順位付けた、上位の検索結果の検索概要文を第２の回答候補文章としてもよい。さらに、第２の回答候補文章には、ＷＥＢサイトのＵＲＬを含めてよい。これにより、オペレータ端末３のオペレータは、第２の回答候補文章を読む際に、第２の回答候補文章に含まれるＷＥＢサイトの概要文に関する情報を、ＵＲＬを用いてＷＥＢサイトにアクセスして手に入れることができる。 Next, the answer candidate suggestion system 1 ranks the web site summaries obtained in step S206 using a predetermined ranking method, and saves a predetermined number (or a predetermined percentage) of the top ranked web site summaries as second answer candidate sentences (step S207). Here, the ranking method may be, for example, the cosine similarity between each of the web site summaries and the new question sentence by the tf-idf method. The ranking method may also be ranking using the ranking model described in Patent Document 2, or other known techniques. Instead of ranking in step S207, the search summaries of the top search results ranked by the web search engine in step S206 may be used as the second answer candidate sentence. Furthermore, the second answer candidate sentence may include the URL of the web site. In this way, when reading the second answer candidate sentence, the operator of the operator terminal 3 can access the web site using the URL to obtain information about the web site summaries included in the second answer candidate sentence.

次に、回答候補提案システム１は、回答候補文章および回答候補文章表示画面情報を、出力装置（ネットワークＩ／Ｆ３６）に出力して、出力装置（ネットワークＩ／Ｆ３６）に回答候補文章および回答候補文章表示画面情報をオペレータ端末３に送信させて、処理を終了する（ステップＳ２０８）。ここで、回答候補文章には、第１の回答候補文章と、第２の回答候補文章とを含む。言うまでもなく、ステップＳ２０４の処理で、ウェブ検索で第２の回答候補文章を収集しないと判定された場合（ステップＳ２０４：ＮＯ）には、回答候補文章は、第１の回答候補文章のみとなる。また、回答候補文章表示画面情報は、回答候補文章表示画面の構成の情報と、オペレータ端末３に回答候補文章表示画面を表示させる旨の情報と、を含む。回答候補文章表示画面は、回答候補文章を表示できるように構成されている。 Next, the answer candidate suggestion system 1 outputs the answer candidate sentence and the answer candidate sentence display screen information to the output device (network I/F 36), causes the output device (network I/F 36) to transmit the answer candidate sentence and the answer candidate sentence display screen information to the operator terminal 3, and ends the process (step S208). Here, the answer candidate sentence includes the first answer candidate sentence and the second answer candidate sentence. Needless to say, if it is determined in the process of step S204 that the second answer candidate sentence is not to be collected by the web search (step S204: NO), the answer candidate sentence will be only the first answer candidate sentence. In addition, the answer candidate sentence display screen information includes information on the configuration of the answer candidate sentence display screen and information to the effect that the answer candidate sentence display screen is to be displayed on the operator terminal 3. The answer candidate sentence display screen is configured to be able to display the answer candidate sentence.

図１３は、オペレータ端末３に表示される回答候補文章表示画面の一例を示す説明図である。図１３に示す回答候補文章表示画面１３００は、第１の枠１３０１と、第１の回答候補文章欄１３０２、１３０３と、第２の枠１３０４と、第２の回答候補文章欄１３０５、１３０６と、を備えている。第１の回答候補文章を示す「過去回答」と描かれた第１の枠１３０１の右に、第１の回答候補文章を表示する第１の回答候補文章欄１３０２、１３０３が示されている。同様に、第２の回答候補文章を示す「ウェブ検索」と描かれた第２の枠１３０４の右に、第２の回答候補文章を表示する第２の回答候補文章欄１３０５、１３０６が示されている。 Figure 13 is an explanatory diagram showing an example of an answer candidate sentence display screen displayed on the operator terminal 3. The answer candidate sentence display screen 1300 shown in Figure 13 includes a first frame 1301, first answer candidate sentence fields 1302 and 1303, a second frame 1304, and second answer candidate sentence fields 1305 and 1306. To the right of the first frame 1301, which is written "Past Answer" indicating the first answer candidate sentence, the first answer candidate sentence fields 1302 and 1303 displaying the first answer candidate sentence are shown. Similarly, to the right of the second frame 1304, which is written "Web Search" indicating the second answer candidate sentence, the second answer candidate sentence fields 1305 and 1306 displaying the second answer candidate sentence are shown.

オペレータ端末３に、回答候補文章表示画面で、回答候補文章が表示されることで、オペレータは、表示された回答候補文章を参考にして、新規質問文章に対する回答文章を生成できる。これにより、オペレータは、より容易に回答文章を生成できる。また、オペレータが回答文章を生成するために必要となるエネルギーや生成される二酸化炭素の排出量を減らすことができ、地球温暖化を抑制できる。 By displaying the answer candidate sentences on the answer candidate sentence display screen on the operator terminal 3, the operator can generate an answer sentence to a new question sentence by referring to the displayed answer candidate sentences. This allows the operator to generate an answer sentence more easily. In addition, the amount of energy required by the operator to generate an answer sentence and the amount of carbon dioxide emissions generated can be reduced, thereby curbing global warming.

このように、実施例において、回答候補提案システム１は、新規質問文章だけでなく、質問回答データベース２１に保存された過去の質問文章および新規質問文章に基づいて生成された項目候補単語群からユーザが選択した項目単語群に基づいて、回答候補文章（第１の回答候補文章及び第２の回答候補文章）を生成する。これにより、回答候補提案システム１は、新規質問文章だけに基づいて回答候補文章を生成する場合に比べて、新規質問文章の質問の意図により一層沿う、好適な回答候補文章を生成でき、出力できる。 In this way, in the embodiment, the answer candidate suggestion system 1 generates answer candidate sentences (first answer candidate sentence and second answer candidate sentence) based not only on the new question sentence but also on a group of item words selected by the user from a group of item candidate words generated based on the past question sentence and the new question sentence stored in the question and answer database 21. As a result, the answer candidate suggestion system 1 can generate and output suitable answer candidate sentences that are more in line with the intent of the question of the new question sentence compared to when answer candidate sentences are generated based only on the new question sentence.

また、質問回答データベース２１に保存された過去の質問文章と、過去の回答文章とを用いて、回答候補文章（第１の回答候補文章及び第２の回答候補文章）を生成する。これにより、回答候補提案システム１は、より容易に回答候補文章を生成できる。 In addition, answer candidate sentences (first answer candidate sentences and second answer candidate sentences) are generated using past question sentences and past answer sentences stored in the question and answer database 21. This allows the answer candidate suggestion system 1 to more easily generate answer candidate sentences.

また、回答候補提案システム１は、疑問詞および要望語を少なくとも一つ含む疑問要望文を、新規質問文章から抽出し、抽出した疑問要望文に項目単語群を加えて質問情報を生成する（図１２のステップＳ２０２）。これにより、回答候補提案システム１は、新規質問文章から質問の意図と関係の低い部分を除いた、質問の意図と関係の高い疑問要望文に基づいて、第１の回答候補文を生成できる。従って、回答候補提案システム１は、ユーザの質問の意図により一層沿う、好適な第１の回答候補文章を生成できる。 The answer candidate suggestion system 1 also extracts a question/request sentence that includes at least one interrogative word and one request word from the new question sentence, and generates question information by adding a group of item words to the extracted question/request sentence (step S202 in FIG. 12). This allows the answer candidate suggestion system 1 to generate a first answer candidate sentence based on a question/request sentence that is highly related to the intent of the question, excluding parts of the new question sentence that are less related to the intent of the question. Therefore, the answer candidate suggestion system 1 can generate a suitable first answer candidate sentence that is more in line with the intent of the user's question.

また、図１２のステップＳ２０３において、回答候補提案システム１が算出する、質問回答データベース２１に保存された過去の質問文章それぞれに対する、質問情報との類似度は、質問回答データベース２１を用いて算出されるｔｆ－ｉｄｆ法のコサイン類似度である。これにより、回答候補提案システム１は、類似度を容易に算出でき、ひいては、より容易に第１の回答候補文章を生成できる。 In addition, in step S203 of FIG. 12, the similarity between the question information and each of the past question sentences stored in the question and answer database 21, which is calculated by the answer candidate suggestion system 1, is the cosine similarity calculated by the tf-idf method using the question and answer database 21. This allows the answer candidate suggestion system 1 to easily calculate the similarity, and therefore to more easily generate the first answer candidate sentence.

また、回答候補提案システム１は、ネットワークＩ／Ｆ３６（送受信装置）に、回答候補文章（第１の回答候補文章及び第２の回答候補文章）を出力して、ネットワークＩ／Ｆ３６（送受信装置）に、回答候補文章を、ネットワークＮＷを介してオペレータ端末３に送信させる。これにより、オペレータ端末３を操作するオペレータは、容易に回答候補文章（第１の回答候補文章及び第２の回答候補文章）を読むことができる。 In addition, the answer candidate suggestion system 1 outputs the answer candidate sentences (first answer candidate sentence and second answer candidate sentence) to the network I/F 36 (transmitter/receiver device), and causes the network I/F 36 (transmitter/receiver device) to transmit the answer candidate sentences to the operator terminal 3 via the network NW. This allows the operator operating the operator terminal 3 to easily read the answer candidate sentences (first answer candidate sentence and second answer candidate sentence).

また、回答候補提案システム１は、ウェブ検索エンジン４に検索単語群を送信し、ウェブ検索エンジン４から返された検索単語群に関する検索結果に基づいて、第２の回答候補文章を生成する。これにより、回答候補提案システム１は、より容易に第２の回答候補文章を生成できる。 The answer candidate suggestion system 1 also transmits the search words to the web search engine 4, and generates the second answer candidate sentence based on the search results related to the search words returned from the web search engine 4. This allows the answer candidate suggestion system 1 to more easily generate the second answer candidate sentence.

また、回答候補提案システム１は、新規質問文章から疑問要望文を抽出し、疑問要望文に項目単語群を加えた質問情報を生成し（図１２のステップＳ２０２）、ｔｆ－ｉｄｆ法の重要度に基づいて質問情報の複数の質問情報形態素（質問情報の形態素）から検索単語群を生成する（図１２のステップＳ２０５）。これにより、回答候補提案システム１は、新規質問文章から質問の意図と関係の低い部分を除いた、質問の意図と関係の高い疑問要望文に基づいて、検索単語群を生成でき、ひいては、より適切な第２の回答候補文章を生成できる。また、検索単語群は、ｔｆ－ｉｄｆ法の重要度に基づいて生成されることにより、質問の意図により一層沿う検索単語群が生成できる。従って、回答候補提案システム１は、ユーザの質問の意図により一層沿う、好適な第２の回答候補文章を生成できる。 The answer candidate suggestion system 1 also extracts a question request sentence from the new question sentence, generates question information by adding a group of item words to the question request sentence (step S202 in FIG. 12), and generates a search word group from multiple question information morphemes (question information morphemes) of the question information based on the importance of the tf-idf method (step S205 in FIG. 12). As a result, the answer candidate suggestion system 1 can generate a search word group based on the question request sentence that is highly related to the intention of the question, excluding parts of the new question sentence that are less related to the intention of the question, and can thus generate a more appropriate second answer candidate sentence. In addition, the search word group is generated based on the importance of the tf-idf method, so that a search word group that is more in line with the intention of the question can be generated. Therefore, the answer candidate suggestion system 1 can generate a suitable second answer candidate sentence that is more in line with the intention of the user's question.

また、回答候補提案システム１は、ｔｆ－ｉｄｆ法の重要度が高い複数の質問情報形態素（質問情報の形態素）から、個人情報単語リストに保存されている個人情報単語を除いて、検索単語群を生成する（図１２のステップＳ２０５）。検索単語群は、ウェブ検索エンジン４に送信され、ウェブ検索エンジン４は、検索単語群で検索した検索結果を回答候補提案システム１に送信する。検索単語群には、ユーザのプライバシーに関わる個人情報単語が除かれているため、回答候補提案システム１は、ユーザのプライバシーを守った上で検索結果を取得でき、ひいてはユーザのプライバシーを守った上で第２の回答候補文章を生成できる。 The answer candidate suggestion system 1 also generates a search word group from multiple question information morphemes (question information morphemes) with high importance in the tf-idf method, excluding personal information words stored in the personal information word list (step S205 in FIG. 12). The search word group is sent to the web search engine 4, which sends search results searched for using the search word group to the answer candidate suggestion system 1. Because personal information words related to the user's privacy are excluded from the search word group, the answer candidate suggestion system 1 can obtain search results while protecting the user's privacy, and can therefore generate a second answer candidate sentence while protecting the user's privacy.

また、回答候補提案システム１は、検索単語群に関する検索結果に含まれるＷＥＢサイトの概要文に基づいて第２の回答候補文章を生成する（図１２のステップＳ２０６およびＳ２０７）。これにより、第２の回答候補文章の長さは、第２の回答候補文章の内容を把握することが容易になる程度に調整される。従って、オペレータが、第２の回答候補文章の内容を把握することが容易になる。 The answer candidate suggestion system 1 also generates second answer candidate sentences based on the summaries of the websites included in the search results related to the search word group (steps S206 and S207 in FIG. 12). This adjusts the length of the second answer candidate sentences to an extent that makes it easy to understand the content of the second answer candidate sentences. This makes it easy for the operator to understand the content of the second answer candidate sentences.

また、回答候補提案システム１は、ＷＥＢサイトの概要文を所定の順位付け方法で順位を付け、順位が上位のＷＥＢサイトの概要文を第２の回答候補文章とする（図１２のステップＳ２０７）。これにより、回答候補提案システム１は、より適切な第２の回答候補文章を生成できる。 The answer candidate suggestion system 1 also ranks the summaries of the websites using a predetermined ranking method, and sets the summaries of the top-ranked websites as the second answer candidate sentences (step S207 in FIG. 12). This allows the answer candidate suggestion system 1 to generate more appropriate second answer candidate sentences.

また、回答候補提案システム１は、質問回答データベース２１に保存された過去の質問文章それぞれに対して、新規質問文章との類似度（コサイン類似度）を算出して、新規質問文章に類似する過去の質問文章を抽出する（図８のステップＳ１０５）。抽出した新規質問文章に類似する過去の質問文章から生成した複数の過去質問文章形態素それぞれの重要度を算出し、重要度の高い過去質問文章形態素を抽出して、項目候補単語群とする（図８のステップＳ１０６～Ｓ１０７）。これにより、項目候補単語群は、新規質問文章に類似する過去の質問文章において、重要度が高い、比較的重要な意味を持つ単語（過去質問文章形態素）となる。また、項目候補単語群は、回答候補文章（第１の回答候補文章、第２の回答候補文章）を生成するために用いるため、項目候補単語群は、新規質問文章の質問に重要な意味を持つ単語であることが望ましい。従って、回答候補提案システム１は、より適切な項目候補単語群を生成できる。 The answer candidate suggestion system 1 also calculates the similarity (cosine similarity) between each past question sentence stored in the question and answer database 21 and the new question sentence, and extracts past question sentences similar to the new question sentence (step S105 in FIG. 8). The importance of each of the past question sentence morphemes generated from the past question sentences similar to the extracted new question sentence is calculated, and past question sentence morphemes with high importance are extracted and used as an item candidate word group (steps S106 to S107 in FIG. 8). As a result, the item candidate word group becomes words (past question sentence morphemes) that have high importance and relatively important meanings in past question sentences similar to the new question sentence. In addition, since the item candidate word group is used to generate answer candidate sentences (first answer candidate sentence, second answer candidate sentence), it is desirable that the item candidate word group be words that have important meanings to the question of the new question sentence. Therefore, the answer candidate suggestion system 1 can generate a more appropriate item candidate word group.

また、図８にフローチャートで一例を示す項目候補単語群生成処理において、質問回答データベース２１に保存された過去の質問文章それぞれに対する新規質問文章との類似度は、質問回答データベース２１を用いて算出されるｔｆ－ｉｄｆ法のコサイン類似度である（図８のステップＳ１０５）。また、複数の過去質問文章形態素それぞれの重要度は、質問回答データベース２１を用いて算出されるｔｆ－ｉｄｆ法の重要度である（図８のステップＳ１０６）。この様に、ｔｆ－ｉｄｆ法のコサイン類似度および重要度を用いることにより、回答候補提案システム１は、より容易に項目候補単語群を生成できる。 In addition, in the item candidate word group generation process, an example of which is shown in the flowchart in FIG. 8, the similarity between the new question sentence and each of the past question sentences stored in the question and answer database 21 is the cosine similarity calculated by the tf-idf method using the question and answer database 21 (step S105 in FIG. 8). In addition, the importance of each of the multiple past question sentence morphemes is the importance calculated by the tf-idf method using the question and answer database 21 (step S106 in FIG. 8). In this way, by using the cosine similarity and importance calculated by the tf-idf method, the answer candidate suggestion system 1 can more easily generate item candidate word groups.

なお、本発明は上述した実施例に限定されるものではなく、添付した特許請求の範囲の趣旨内における様々な変形例及び同等の構成が含まれる。たとえば、前述した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明したすべての構成を備えるものに本発明は限定されない。また、実施例の構成の一部について、他の構成の追加、削除、または置換をしてもよい。 The present invention is not limited to the above-described embodiments, but includes various modifications and equivalent configurations within the spirit of the appended claims. For example, the above-described embodiments have been described in detail to clearly explain the present invention, and the present invention is not necessarily limited to those having all of the configurations described. In addition, other configurations may be added to, deleted from, or substituted for part of the configuration of the embodiments.

１：回答候補提案システム
２：ユーザ端末
３：オペレータ端末
４：ウェブ検索エンジン
１１：項目候補単語群生成部
１１ａ：項目候補単語群生成プログラム
１２：回答候補文章生成部
１２ａ：回答候補文章生成プログラム
２１：質問回答データベース
２２：疑問詞要望語リスト
２３：個人情報単語リスト
２４：補足単語リスト
２５：項目候補単語テーブル
３１：プロセッサ
３２：主記憶装置
３３：副記憶装置
３４：入力装置
３５：出力装置
３６：ネットワークＩ／Ｆ
３７：バス 1: Answer candidate suggestion system 2: User terminal 3: Operator terminal 4: Web search engine 11: Item candidate word group generation unit 11a: Item candidate word group generation program 12: Answer candidate sentence generation unit 12a: Answer candidate sentence generation program 21: Question and answer database 22: Interrogative word and request word list 23: Personal information word list 24: Supplementary word list 25: Item candidate word table 31: Processor 32: Main memory device 33: Sub-memory device 34: Input device 35: Output device 36: Network I/F
37: Bus

Claims

An answer candidate suggestion system that generates answer candidate sentences that are candidates for answer sentences to a new question sentence,
A processor and a storage device,
The storage device includes:
storing a question and answer database that stores past question sentences and past answer sentences to the past question sentences in association with each other;
The processor,
When the new question text is input,
calculating a similarity between the new question sentence and each of the past question sentences stored in the question and answer database;
extracting the past question sentences similar to the new question sentence based on the calculated similarity with the new question sentence;
performing morphological analysis on the extracted past question sentences to generate a plurality of past question sentence morphemes;
Calculate the importance of each of the multiple past question sentence morphemes,
extracting the past question sentence morphemes having high importance from the plurality of past question sentence morphemes, and setting the extracted morphemes as a group of candidate words for an item;
When the item word group selected by the user from the item candidate word group and the new question sentence are input,
generating question information including the item word group based on the item word group and the new question sentence;
calculating a similarity between the question information and each of the past question sentences stored in the question and answer database;
extracting past question sentences similar to the question information from the question and answer database based on the similarity with the question information;
extracting from the question and answer database a past answer sentence associated with a past question sentence similar to the extracted question information, and setting the extracted answer sentence as a first answer candidate sentence;
Answer candidate suggestion system.

The answer candidate suggestion system according to claim 1,
The storage device further stores an interrogative word and a request word list for storing interrogative words expressing doubt and request words expressing requests,
The processor,
extracting an interrogative sentence including at least one of the interrogative words and the desired words stored in the interrogative word desired word list from the new question sentence;
generating the question information by adding the item word group to the extracted question/request sentence;
Answer candidate suggestion system.

The answer candidate suggestion system according to claim 1,
The similarity to the question information is a cosine similarity calculated by a tf-idf method using the question and answer database.
Answer candidate suggestion system.

The answer candidate suggestion system according to claim 1,
Further, a transmitting/receiving device is provided which is connected to the network and capable of transmitting and receiving information via the network,
The processor outputs the first answer candidate sentence to the transmitting/receiving device.
Answer candidate suggestion system.

The answer candidate suggestion system according to claim 1,
Further, a transmitting/receiving device is connected to a network connected to a web search engine that, upon receiving at least one word, returns search results including information on a web site related to the received word, and is capable of transmitting and receiving information to and from the web search engine via the network,
The storage device further stores an interrogative word and a request word list for storing interrogative words expressing doubt and request words expressing requests,
The processor,
Extracting an interrogative sentence including at least one of the interrogative words or the desired words stored in the interrogative word desired word list from the new question sentence;
generating the question information by adding the item word group to the question request sentence;
morphologically analyzing the question information to generate a plurality of question information morphemes;
calculating a TF-IDF importance for each of the plurality of question information morphemes using a question and answer database;
generating a search word group from the plurality of question information morphemes based on the importance of each of the plurality of question information morphemes in the TF-IDF method ;
outputting the generated search word group to the transmitting/receiving device so that the transmitting/receiving device transmits the generated search word group to the web search engine;
obtaining a search result relating to the group of search words received by the transmitting/receiving device from the web search engine;
The answer candidate suggestion system generates a second answer candidate sentence based on the search results related to the acquired group of search words.

The answer candidate suggestion system according to claim 5 ,
The storage device further stores a personal information word list storing a plurality of personal information words representing personal information;
The processor,
extracting a plurality of question information morphemes having a high importance in the TF-IDF method from the plurality of question information morphemes, and generating the search word group by excluding at least one of the personal information words stored in the personal information word list;
Answer candidate suggestion system.

The answer candidate suggestion system according to claim 5,
the search results returned by the web search engine include a summary of the web site;
The processor generates a second answer candidate sentence based on an outline of a website included in a search result related to the group of search words.
Answer candidate suggestion system.

The answer candidate suggestion system according to claim 7 ,
the processor ranks abstracts of the websites included in the search results related to the group of search words using a predetermined ranking method, and sets the abstracts of the websites ranked higher as second answer candidate sentences.
Answer candidate suggestion system.

The answer candidate suggestion system according to claim 1 ,
the similarity with the new question sentence is a cosine similarity calculated by the tf-idf method using the question and answer database,
the importance of each of the plurality of past question sentence morphemes is an importance calculated by a TF-IDF method using the question and answer database;
Answer candidate suggestion system.

An answer candidate suggestion method in an answer candidate suggestion system that generates answer candidate sentences that are candidates for answer sentences to a new question sentence, comprising:
The answer candidate suggestion system includes a processor and a storage device,
The storage device includes:
storing a question and answer database that stores past question sentences and past answer sentences to the past question sentences in association with each other;
The processor,
When the new question text is input,
calculating a similarity between the new question sentence and each of the past question sentences stored in the question and answer database;
extracting the past question sentences similar to the new question sentence based on the calculated similarity with the new question sentence;
performing morphological analysis on the extracted past question sentences to generate a plurality of past question sentence morphemes;
Calculate the importance of each of the multiple past question sentence morphemes,
extracting the past question sentence morphemes having high importance from the plurality of past question sentence morphemes, and setting the extracted morphemes as a group of candidate words for an item;
When the item word group selected by the user from the item candidate word group and the new question sentence are input,
generating question information including the item word group based on the item word group and the new question sentence;
calculating a similarity between the question information and each of the past question sentences stored in the question and answer database;
extracting past question sentences similar to the question information from the question and answer database based on the similarity with the question information;
extracting from the question and answer database a past answer sentence associated with a past question sentence similar to the extracted question information, and setting the extracted answer sentence as a first answer candidate sentence;
How to suggest answer candidates.