JP7700862B2

JP7700862B2 - Summary learning support device, summary learning support method and program

Info

Publication number: JP7700862B2
Application number: JP2023543588A
Authority: JP
Inventors: いつみ斉藤; 京介西田; 仙吉田
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2021-08-26
Filing date: 2021-08-26
Publication date: 2025-07-01
Anticipated expiration: 2041-08-26
Also published as: JPWO2023026444A1; WO2023026444A1

Description

本発明は、要約学習支援装置、要約学習支援方法及びプログラムに関する。 The present invention relates to a summary learning support device, a summary learning support method, and a program.

ニューラルネットワークを用いて要約文を生成するモデルの学習データとして、要約対象のソーステキストと正しい要約結果である要約データとのペアが一般的である。 The training data for a model that generates summary sentences using a neural network typically consists of pairs of source text to be summarized and summary data that is the correct summary result.

一方で、ソーステキスト以外の入力パラメータ（以下、「クエリ」という。）が必要とされるモデルが有る（例えば、非特許文献１）。斯かるモデルによれば、クエリに即した要約文を生成することができる。斯かるモデルは、ソーステキスト、クエリ及び要約データ等のパラメータの組が学習データとされる。On the other hand, there are models that require input parameters (hereinafter referred to as "queries") other than the source text (for example, Non-Patent Document 1). Such models can generate summaries that correspond to the queries. In such models, a set of parameters such as the source text, the query, and the summary data is used as training data.

他方において、要約文の生成方法には、抽出型と生成型とが有る。抽出型とは、ソーステキストに含まれている一部分がそのまま抽出される方法である。生成型とは、ソーステキストに含まれる単語等に基づいて、要約データが生成される方法である。以下、入力としてクエリを必要とし、生成型によって要約データを生成するモデルを「クエリ依存生成型モデル」という。 On the other hand, there are two types of summary generation methods: extraction and generative. Extraction is a method in which a portion of the source text is extracted as is. Generative is a method in which summary data is generated based on the words and other elements contained in the source text. In what follows, models that require a query as input and generate summary data using a generative method are referred to as "query-dependent generative models."

Gonc，alo M. Correia，Andre F. T. Martins、A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning、Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3050-3056、July 28 August 2, 2019.Gonc, alo M. Correia, Andre F. T. Martins, A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3050-3056, July 28 August 2, 2019.

ソーステキストと要約データとのペアで構成される学習データは多数存在するが、クエリ依存生成型モデルを学習するため、ソーステキスト以外の追加の入力パラメータを含む学習データは、不十分である。 Although there is a large amount of training data consisting of pairs of source text and summary data, training data that includes additional input parameters other than the source text is insufficient for training a query-dependent generative model.

本発明は、上記の点に鑑みてなされたものであって、追加の入力パラメータが必要とされる要約の学習を効率化することを目的とする。The present invention has been made in consideration of the above points and aims to improve the efficiency of learning summaries that require additional input parameters.

そこで上記課題を解決するため、要約学習支援装置は、複数の文字列について、所定のモデルに基づいて第１の文書の要約の際に追加される入力パラメータとしての適切さを表すスコアを計算する計算部と、前記スコアに基づいて、前記複数の文字列の中から一部の文字列群を、文書の要約を生成する要約生成モデルの学習データを構成する前記入力パラメータとして選択する選択部と、を有し、前記スコアは、文書の本文と前記文書の標題を構成する文字列群との対応関係を学習済みのモデルに対して前記第１の文書の要約である第２の文書を入力した場合に当該モデルが出力候補の文字列の中から出力対象の文字列を選択するために前記出力候補の文字列ごとに計算するスコアである。

In order to solve the above problem, the summary learning support device has a calculation unit that calculates, for a plurality of character strings, a score indicating their suitability as input parameters to be added when summarizing a first document based on a predetermined model, and a selection unit that selects a group of character strings from among the plurality of character strings based on the score as the input parameters that constitute training data for a summary generation model that generates a summary of the document, and the score is a score that is calculated for each output candidate character string so that when a second document, which is a summary of the first document, is input to a model that has learned the correspondence between the body of the document and a group of character strings that constitute the title of the document, the model selects a character string to be output from among the output candidate character strings .

追加の入力パラメータが必要とされる要約の学習を効率化することができる。 It is possible to make learning of summaries more efficient where additional input parameters are required.

第１の実施の形態における要約生成装置１０のハードウェア構成例を示す図である。1 is a diagram illustrating an example of a hardware configuration of a summary generation device 10 according to a first embodiment. 第１の実施の形態における要約生成装置１０の機能構成例を示す図である。FIG. 1 illustrates an example of a functional configuration of a summary generation device according to a first embodiment. 第１の実施の形態におけるクエリ有り学習データ生成部１１の構成例を示す図である。FIG. 2 is a diagram illustrating an example of a configuration of a query-containing learning data generating unit 11 according to the first embodiment. 第１の実施の形態におけるクエリ有り学習データの生成処理の処理手順の一例を説明するためのフローチャートである。11 is a flowchart illustrating an example of a processing procedure for generating query-included learning data according to the first embodiment. 第２の実施の形態におけるクエリ有り学習データ生成部１１の構成例を示す図である。FIG. 13 is a diagram illustrating an example of a configuration of a query-containing learning data generating unit 11 according to the second embodiment. 第２の実施の形態におけるクエリ有り学習データの生成処理の処理手順の一例を説明するためのフローチャートである。13 is a flowchart illustrating an example of a process procedure for generating query-included learning data according to the second embodiment. 第３の実施の形態における要約生成モデルの学習及び要約の生成を説明するための図である。13 is a diagram for explaining learning of a summary generation model and generation of a summary in the third embodiment. FIG. 第４の実施の形態における要約生成モデルの学習及び要約の生成を説明するための図である。13 is a diagram for explaining learning of a summary generation model and generation of a summary in the fourth embodiment. FIG.

以下、図面に基づいて本発明の実施の形態を説明する。図１は、第１の実施の形態における要約生成装置１０のハードウェア構成例を示す図である。図１の要約生成装置１０は、それぞれバスＢで相互に接続されているドライブ装置１００、補助記憶装置１０２、メモリ装置１０３、プロセッサ１０４、及びインタフェース装置１０５等を有する。 Below, an embodiment of the present invention will be described with reference to the drawings. Figure 1 is a diagram showing an example of the hardware configuration of a summary generation device 10 in a first embodiment. The summary generation device 10 in Figure 1 has a drive device 100, an auxiliary storage device 102, a memory device 103, a processor 104, an interface device 105, etc., which are all interconnected by a bus B.

要約生成装置１０での処理を実現するプログラムは、ＣＤ－ＲＯＭ等の記録媒体１０１によって提供される。プログラムを記憶した記録媒体１０１がドライブ装置１００にセットされると、プログラムが記録媒体１０１からドライブ装置１００を介して補助記憶装置１０２にインストールされる。但し、プログラムのインストールは必ずしも記録媒体１０１より行う必要はなく、ネットワークを介して他のコンピュータよりダウンロードするようにしてもよい。補助記憶装置１０２は、インストールされたプログラムを格納すると共に、必要なファイルやデータ等を格納する。 The program that realizes the processing in the summary generation device 10 is provided by a recording medium 101 such as a CD-ROM. When the recording medium 101 storing the program is set in the drive device 100, the program is installed from the recording medium 101 via the drive device 100 into the auxiliary storage device 102. However, the program does not necessarily have to be installed from the recording medium 101, but may be downloaded from another computer via a network. The auxiliary storage device 102 stores the installed program as well as necessary files, data, etc.

メモリ装置１０３は、プログラムの起動指示があった場合に、補助記憶装置１０２からプログラムを読み出して格納する。プロセッサ１０４は、ＣＰＵ若しくはＧＰＵ（Graphics Processing Unit）、又はＣＰＵ及びＧＰＵであり、メモリ装置１０３に格納されたプログラムに従って要約生成装置１０に係る機能を実行する。インタフェース装置１０５は、ネットワークに接続するためのインタフェースとして用いられる。The memory device 103 reads out and stores the program from the auxiliary storage device 102 when an instruction to start the program is received. The processor 104 is a CPU or a GPU (Graphics Processing Unit), or a CPU and a GPU, and executes functions related to the summary generation device 10 in accordance with the program stored in the memory device 103. The interface device 105 is used as an interface for connecting to a network.

図２は、第１の実施の形態における要約生成装置１０の機能構成例を示す図である。図２において、要約生成装置１０は、クエリ有り学習データ生成部１１、要約学習部１２及び要約部１３を有する。これら各部は、要約生成装置１０にインストールされた１以上のプログラムが、プロセッサ１０４に実行させる処理により実現される。 Figure 2 is a diagram showing an example of the functional configuration of the summary generation device 10 in the first embodiment. In Figure 2, the summary generation device 10 has a query-included learning data generation unit 11, a summary learning unit 12, and a summarization unit 13. Each of these units is realized by a process in which one or more programs installed in the summary generation device 10 are executed by the processor 104.

クエリ有り学習データ生成部１１は、入力として与えられるクエリ無し学習データ群に含まれる各クエリ無し学習データに基づいて、クエリ有り学習データを生成する。１つのクエリ無し学習データに対して１つのクエリ有り学習データが生成される。したがって、複数のクエリ無し学習データの集合であるクエリ無し学習データ群に対して、複数のクエリ有り学習データの集合であるクエリ有り学習データ群が生成される。クエリ無し学習データ及びクエリ有り学習データは、いずれも、文書の要約を生成するニューラルネットワーク等のモデル（以下、「要約生成モデル」という。）の学習データとして用いられるデータをいう。クエリ無し学習データは、クエリを構成要素として含まない点において、クエリ有り学習データと異なる。クエリとは、要約に関する追加情報として、要約対象の文書と共に要約生成モデルに入力されるテキスト（文字列）をいう。例えば、要約の焦点がクエリとされてもよい。The query-containing learning data generation unit 11 generates query-containing learning data based on each of the query-free learning data included in the query-free learning data group given as an input. One query-containing learning data is generated for one query-free learning data. Thus, for a query-free learning data group that is a collection of multiple query-free learning data, a query-containing learning data group that is a collection of multiple query-containing learning data is generated. Both query-free learning data and query-containing learning data refer to data used as learning data for a model such as a neural network that generates a summary of a document (hereinafter referred to as a "summary generation model"). Query-free learning data differs from query-containing learning data in that it does not include a query as a component. A query refers to a text (character string) that is input to a summary generation model together with a document to be summarized as additional information related to the summary. For example, the focus of the summary may be the query.

クエリ無し学習データは、｛ソーステキスト，要約テキスト｝の２つのテキストデータの組によって構成される学習データである。ソーステキストとは、要約対象の文書のテキストデータをいう。要約テキストとは、ソーステキストが要約された結果の正解を示すテキストデータをいう。 Query-free training data is training data consisting of a pair of two pieces of text data: {source text, summary text}. Source text refers to the text data of the document to be summarized. Summary text refers to the text data that indicates the correct answer resulting from summarizing the source text.

一方、クエリ有り学習データは、｛ソーステキスト，クエリ、要約テキスト｝の３つのテキストデータの組によって構成される学習データである。On the other hand, the query-based training data is training data composed of a set of three pieces of text data: {source text, query, summary text}.

要約学習部１２は、クエリ有り学習データを用いて要約生成モデルの学習を行う。 The summary learning unit 12 trains the summary generation model using query-containing learning data.

要約部１３は、要約対象のソーステキスト及び当該ソーステキストに対するクエリ等の入力を受け付けると、当該ソーステキスト及び当該クエリを学習済みの要約生成モデルへ入力することで、当該ソーステキストに対する当該クエリに応じた要約を要約生成モデルに生成させる。When the summarization unit 13 receives input such as a source text to be summarized and a query for the source text, it inputs the source text and the query into a trained summary generation model, causing the summary generation model to generate a summary corresponding to the query for the source text.

クエリ有り学習データ生成部１１について更に詳しく説明する。図３は、第１の実施の形態におけるクエリ有り学習データ生成部１１の構成例を示す図である。図３において、クエリ有り学習データ生成部１１は、重要度計算部１１１、クエリ選択部１１２及びクエリ追加部１１３を有する。これら各部の機能については、図４を用いて詳細に説明する。The query-included learning data generation unit 11 will now be described in more detail. FIG. 3 is a diagram showing an example configuration of the query-included learning data generation unit 11 in the first embodiment. In FIG. 3, the query-included learning data generation unit 11 has an importance calculation unit 111, a query selection unit 112, and a query addition unit 113. The functions of each of these units will now be described in detail with reference to FIG. 4.

図４は、第１の実施の形態におけるクエリ有り学習データの生成処理の処理手順の一例を説明するためのフローチャートである。 Figure 4 is a flowchart illustrating an example of a processing procedure for generating query-containing learning data in the first embodiment.

ステップＳ１０１において、重要度計算部１１１は、クエリ無し学習データ群に含まれるクエリ無し学習データ（ソーステキスト及び要約テキストの組）ごとに、クエリの候補となる文字列の抽出元とする文書（以下、「抽出元文書」という。）を生成する。したがって、Ｎ個のクエリ無し学習データからＮ個の抽出元文書が生成される。In step S101, the importance calculation unit 111 generates a document (hereinafter referred to as an "extraction source document") from which a character string that is a candidate for a query is extracted for each piece of query-free learning data (a pair of source text and summary text) included in the query-free learning data group. Therefore, N extraction source documents are generated from N pieces of query-free learning data.

例えば、重要度計算部１１１は、クエリ無し学習データの以下の（ａ）～（ｄ）のいずれかを、当該クエリ無し学習データに基づく抽出元文書として生成する。
（ａ）ソーステキストと要約テキストを結合した文書（ソーステキスト及び要約テキストの双方を含む文書）
（ｂ）要約テキストのみ
（ｃ）ソーステキストのみ
（ｄ）（ａ）～（ｃ）のいずれかと、その他の付属情報テキスト（例えば、ソーステキストのタイトルなど）を結合した文書
続いて、重要度計算部１１１は、所定のモデルに基づいて、各抽出元文書を構成する所定単位の各文字列（例えば、単語）について、文書の要約の際に用いるクエリ（追加される入力パラメータ）としての適切さを表すスコアの一例として、これらの抽出元文書群における重要度を算出する（Ｓ１０２）。例えば、重要度計算部１１１は、所定のモデルとして、ＴＦ－ＩＤＦの計算モデルを用いる。この場合、重要度計算部１１１は、各単語のＴＦ－ＩＤＦを重要度として算出する。文書群に含まれる各単語のＴＦ－ＩＤＦの算出は、公知の方法を用いて行うことができる。なお、本実施の形態において、入力パラメータにおける「パラメータ」とは、例えば、ニューラルネットワーク等のモデルの学習用パラメータとは明確に区別される。入力パラメータは、モデルに対する入力として与えられるデータであるのに対し、学習用パラメータは、モデルの学習に応じて値が変化するデータである。一般的な例としては、入力パラメータは、テキストデータ等で与えられるのに対し、学習用パラメータは、数値データの集合等で表現される。 For example, the importance calculation unit 111 generates any one of the following (a) to (d) of the no-query learning data as an extraction source document based on the no-query learning data.
(a) A document that combines source text and summary text (a document that contains both source text and summary text)
(b) summary text only; (c) source text only; (d) a document combining any one of (a) to (c) with other auxiliary information text (e.g., the title of the source text, etc.). Next, the importance calculation unit 111 calculates the importance of each character string (e.g., word) of a predetermined unit constituting each extraction source document in the extraction source document group as an example of a score indicating the appropriateness of the character string as a query (added input parameter) used in summarizing the document based on a predetermined model (S102). For example, the importance calculation unit 111 uses a TF-IDF calculation model as the predetermined model. In this case, the importance calculation unit 111 calculates the TF-IDF of each word as the importance. The TF-IDF of each word included in the document group can be calculated using a known method. In this embodiment, the "parameter" in the input parameter is clearly distinguished from, for example, a learning parameter of a model such as a neural network. The input parameter is data given as an input to the model, whereas the learning parameter is data whose value changes according to the learning of the model. As a general example, the input parameters are given as text data or the like, whereas the learning parameters are expressed as a set of numerical data or the like.

続いて、クエリ選択部１１２は、抽出元文書ごとに、当該抽出元文書を構成する所定単位の文字列（単語）の中から重要度の降順にＫ個の文字列を、当該抽出元文書に対応するクエリ無し学習データに対応するクエリとして選択する（Ｓ１０３）。なお、Ｋの値（Ｋ＞＝０）は、抽出元文書ごとにランダムに選択されてもよいし、全ての抽出元文書に対して同じであってもよい。また、各抽出元文書からのクエリの選択に際し、クエリ選択部１１２は、当該抽出元文書の要約テキストに含まれる単語のみをクエリとして選択するようにしてもよい。そうすることで、要約生成モデルについて、指定されたクエリを要約中に含むような学習をしやすくすることができる。Next, for each source document, the query selection unit 112 selects K character strings in descending order of importance from among the character strings (words) of a predetermined unit constituting the source document as queries corresponding to the query-free learning data corresponding to the source document (S103). Note that the value of K (K>=0) may be selected randomly for each source document, or may be the same for all source documents. In addition, when selecting a query from each source document, the query selection unit 112 may select only words included in the summary text of the source document as queries. This makes it easier to train the summary generation model to include the specified query in the summary.

続いて、クエリ追加部１１３は、クエリ無し学習データごとに、当該クエリ無し学習データに基づく抽出元文書から選択されたＫ個の単語を、クエリとして当該クエリ無し学習データに追加することで、クエリ有り学習データを生成する（Ｓ１０４）。したがって、生成されるクエリ有り学習データは、クエリ無し学習データが含んでいたソーステキスト及び要約テキストと、当該クエリ無し学習データから抽出されたＫ個のクエリ（クエリ列）とを含むことになる。Next, the query adding unit 113 generates query-included learning data by adding, for each piece of query-free learning data, K words selected from the source document based on the query-free learning data as queries to the query-free learning data (S104). Therefore, the generated query-included learning data includes the source text and summary text contained in the query-free learning data, and the K queries (query strings) extracted from the query-free learning data.

上述したように、第１の実施の形態によれば、クエリを含んでいない学習データから、疑似的なクエリを生成することができる。したがって、追加の入力パラメータが必要とされる要約の学習を効率化することができる。As described above, according to the first embodiment, pseudo queries can be generated from training data that does not include queries. This makes it possible to efficiently train summaries that require additional input parameters.

次に、第２の実施の形態について説明する。第２の実施の形態では第１の実施の形態と異なる点について説明する。第２の実施の形態において特に言及されない点については、第１の実施の形態と同様でもよい。Next, the second embodiment will be described. In the second embodiment, differences from the first embodiment will be described. Points not specifically mentioned in the second embodiment may be the same as those in the first embodiment.

第２の実施の形態では、クエリ有り学習データ生成部１１の構成及びクエリ有り学習データ生成部１１が実行する処理手順が第１の実施の形態と異なる。In the second embodiment, the configuration of the query-included learning data generation unit 11 and the processing procedure performed by the query-included learning data generation unit 11 differ from those in the first embodiment.

図５は、第２の実施の形態におけるクエリ有り学習データ生成部１１の構成例を示す図である。図５中、図３と同一又は対応する部分には、同一符号を付している。第２の実施の形態において、クエリ有り学習データ生成部１１は、重要度計算部１１１の代わりにクエリ生成モデル学習部１１４及びクエリ候補生成部１１５を有する。クエリ生成モデル学習部１１４は、クエリ無し学習データから１以上のクエリを生成するモデル（以下、「クエリ生成モデル」という。）を学習する。クエリ生成モデルは、例えば、ニューラルネットワーク等により構成される。クエリ生成モデル学習部１１４は、クエリ生成モデルの学習データの元となる学習用文書群を入力とする。学習用文書群は、複数の学習用文書の集合をいう。学習用文書とは、ｗｉｋｉｐｅｄｉａ等のようにインターネットにおいて公開されている百科事典や、新聞などのように、標題（見出し）と本文とを含むテキスト形式の文書データをいう。 Figure 5 is a diagram showing an example of the configuration of the query-containing learning data generation unit 11 in the second embodiment. In Figure 5, the same or corresponding parts as those in Figure 3 are given the same reference numerals. In the second embodiment, the query-containing learning data generation unit 11 has a query generation model learning unit 114 and a query candidate generation unit 115 instead of the importance calculation unit 111. The query generation model learning unit 114 learns a model (hereinafter referred to as a "query generation model") that generates one or more queries from query-free learning data. The query generation model is composed of, for example, a neural network. The query generation model learning unit 114 receives as input a group of learning documents that are the source of learning data for the query generation model. The group of learning documents refers to a collection of multiple learning documents. The learning documents refer to text-format document data that includes a title (heading) and a body text, such as an encyclopedia published on the Internet, such as Wikipedia, or a newspaper.

クエリ候補生成部１１５は、学習済みのクエリ生成モデルに基づいて、クエリの候補を生成（出力）する。The query candidate generation unit 115 generates (outputs) query candidates based on the learned query generation model.

図６は、第２の実施の形態におけるクエリ有り学習データの生成処理の処理手順の一例を説明するためのフローチャートである。 Figure 6 is a flowchart illustrating an example of a processing procedure for generating query-containing learning data in the second embodiment.

ステップＳ２０１において、クエリ生成モデル学習部１１４は、学習用文書群に含まれる学習用文書ごとに、クエリ生成モデルの学習データを生成する。具体的には、クエリ生成モデル学習部１１４は、各学習用文書の標題を、所定単位の文字列（例えば、単語）に分解（分割）する。したがって、例えば、学習用文書ごとに、標題を構成する単語列（以下、単に「単語列」という。）が生成される。この際、クエリ生成モデル学習部１１４は、ストップワードを削除した単語列を生成するようにしてもよい。クエリ生成モデル学習部１１４は、学習用文書ごとに、当該学習用文書の本文（パラグラフテキスト）と、当該本文に対応する標題から生成された単語列との組を学習データとして生成する。In step S201, the query generation model training unit 114 generates training data of the query generation model for each training document included in the training document group. Specifically, the query generation model training unit 114 decomposes (divides) the title of each training document into character strings of a predetermined unit (e.g., words). Therefore, for example, a word string (hereinafter simply referred to as a "word string") constituting the title is generated for each training document. At this time, the query generation model training unit 114 may generate a word string from which stop words have been deleted. For each training document, the query generation model training unit 114 generates, as training data, a pair of the main text (paragraph text) of the training document and a word string generated from the title corresponding to the main text.

続いて、クエリ生成モデル学習部１１４は、ステップＳ２０１において生成された学習データ群を用いて、クエリ生成モデルの学習を行う（Ｓ２０２）。具体的には、クエリ生成モデル学習部１１４は、各学習データの本文を入力とし、標題の単語列を出力とした場合における、当該本文と当該単語列との対応関係をクエリ生成モデルに学習させる。したがって、クエリ生成モデルは、或る文書の本文を入力すると、当該文書の標題に関連する単語列を出力するように学習される。なお、クエリ生成モデルは、例えば、公知のｅｎｃｏｄｅｒ－ｄｅｃｏｄｅｒモデルによって構成されてもよいし、公知の他の文生成モデルによって構成されてもよい。 Next, the query generation model training unit 114 trains the query generation model using the training data group generated in step S201 (S202). Specifically, the query generation model training unit 114 trains the query generation model to learn the correspondence between the body text of each training data item and the word string in the title when the body text is input and the word string in the title is output. Thus, when the body text of a document is input, the query generation model is trained to output a word string related to the title of the document. Note that the query generation model may be configured, for example, by a known encoder-decoder model or another known sentence generation model.

続いて、クエリ候補生成部１１５は、クエリ無し学習データごとに、当該クエリ無し学習データの要約テキストを学習済みのクエリ生成モデルに入力して、当該クエリ生成モデルが出力する文字列群（単語列）を、当該クエリ無し学習データに対応するクエリ候補列として生成する（Ｓ２０３）。Next, for each piece of query-free learning data, the query candidate generation unit 115 inputs the summary text of the query-free learning data into the trained query generation model, and generates a group of strings (word strings) output by the query generation model as a query candidate string corresponding to the query-free learning data (S203).

この際、クエリ生成モデルがｅｎｃｏｄｅｒ－ｄｅｃｏｄｅｒモデルであれば、クエリ生成モデルは、クエリ無し学習データの入力に応じ、単語列を構成する各単語を逐次的に出力する。単語の逐次的な出力において、クエリ生成モデルは、自らの語彙（クエリ生成モデルの出力候補の単語の集合）を構成するＤ個の各単語について、出力候補の中から出力対象を選択するためのスコアを計算し、スコアが最大である単語を出力する。第２の実施の形態では、当該スコアが、文書の要約の際に用いるクエリ（追加される入力パラメータ）としての適切さを表すスコアの一例に相当する。In this case, if the query generation model is an encoder-decoder model, the query generation model sequentially outputs each word constituting the word string in response to the input of query-free training data. In sequential output of words, the query generation model calculates a score for selecting an output target from among output candidates for each of D words constituting its vocabulary (a set of words that are output candidates of the query generation model), and outputs the word with the maximum score. In the second embodiment, the score corresponds to an example of a score representing the suitability of a query (added input parameter) to be used when summarizing a document.

続いて、クエリ選択部１１２は、クエリ無し学習データごとに、当該クエリ無し学習データについてクエリ候補生成部１１５が生成したクエリ候補列の中からクエリとして利用する１以上の単語（クエリ列）を選択する（Ｓ２０４）。この際、クエリ選択部１１２は、クエリ候補列の全てをクエリ列として選択してもよいし、クエリ候補列の一部をクエリ列として選択してもよい。クエリ候補列の一部をクエリ列として選択する場合、クエリ選択部１１２は、クエリ候補列の先頭からＫ番目までの単語をクエリ列として選択してもよい。すなわち、クエリ生成モデルが逐次的に行う単語の出力のうち、Ｋ番目までの単語がクエリとして選択されてもよい。又は、ステップＳ２０３において、クエリ生成モデルからの逐次的な単語の出力回数がＫ回に抑制されてもよい。この場合、クエリ候補列は、Ｋ個の単語から構成されることになる。したがって、この場合、ステップＳ２０４では、クエリ候補列の全部がクエリ列として選択されればよい。Next, for each piece of query-free learning data, the query selection unit 112 selects one or more words (query strings) to be used as a query from the query candidate strings generated by the query candidate generation unit 115 for the query-free learning data (S204). At this time, the query selection unit 112 may select all of the query candidate strings as query strings, or may select a part of the query candidate string as a query string. When selecting a part of the query candidate string as a query string, the query selection unit 112 may select the first to Kth words of the query candidate string as a query string. That is, among the words sequentially output by the query generation model, the Kth words may be selected as a query. Alternatively, in step S203, the number of sequential word outputs from the query generation model may be suppressed to K times. In this case, the query candidate string is composed of K words. Therefore, in this case, in step S204, all of the query candidate strings may be selected as query strings.

続いて、クエリ追加部１１３は、クエリ無し学習データごとに、当該クエリ無し学習データについて選択されたクエリ列を当該クエリ無し学習データに追加することで、クエリ有り学習データを生成する（Ｓ２０５）。Next, the query addition unit 113 generates query-included learning data by adding, for each piece of query-free learning data, the query sequence selected for that piece of query-free learning data to that piece of query-free learning data (S205).

上述したように、第２の実施の形態によれば、第１の実施の形態と同様の効果を得ることができる。As described above, the second embodiment can achieve the same effects as the first embodiment.

次に、第３の実施の形態として、クエリ有り学習データを用いた要約生成モデルの学習、及び学習済みの要約生成モデルを用いた要約の生成に関する第１の例について説明する。なお、第３の実施の形態は、第１の実施の形態及び第２の実施の形態のいずれに対しても適用可能である。 Next, as a third embodiment, a first example of training a summary generation model using training data with a query and generating a summary using the trained summary generation model will be described. Note that the third embodiment can be applied to both the first and second embodiments.

図７は、第３の実施の形態における要約生成モデルの学習及び要約の生成を説明するための図である。図７において、要約部１３は、内容選択部１３１、エンコーダ１３２及びデコーダ１３３を含む。これら各部が要約生成モデルを構成する。 Figure 7 is a diagram for explaining the learning of the summary generation model and the generation of summaries in the third embodiment. In Figure 7, the summary unit 13 includes a content selection unit 131, an encoder 132, and a decoder 133. Each of these units constitutes the summary generation model.

要約生成モデルの学習時において、要約学習部１２は、要約部１３に対して、クエリ有り学習データ群に含まれる学習データ（ソーステキスト、クエリ列、要約テキスト）ごとに、当該学習データのソーステキスト及びクエリ列を入力する。When training the summary generation model, the summary training unit 12 inputs the source text and query string of each piece of training data (source text, query string, summary text) included in the training data group with query to the summary unit 13.

内容選択部１３１は、当該ソーステキスト及び当該クエリ列を結合したテキスト（以下、「結合テキスト」という。）を構成する文字列（例えば、単語）ごとに、重要度を算出するモデル（例えば、ニューラルネットワーク）である。内容選択部１３１は、ＢＥＲＴやＭＡＳＳなどの事前学習済モデルをｆｉｎｅｔｕｎｅすることで構成されてもよい。なお、ＢＥＲＴについては、例えば、「https://arxiv.org/abs/1810.04805」等に詳しい。また、ＭＡＳＳについては、例えば、「https://arxiv.org/abs/1905.02450」等に詳しい。The content selection unit 131 is a model (e.g., a neural network) that calculates the importance of each character string (e.g., word) that constitutes the text obtained by combining the source text and the query string (hereinafter referred to as the "combined text"). The content selection unit 131 may be configured by finetuning a pre-trained model such as BERT or MASS. For more information on BERT, see, for example, "https://arxiv.org/abs/1810.04805". For more information on MASS, see, for example, "https://arxiv.org/abs/1905.02450".

内容選択部１３１は、結合テキストの中から重要度の降順にＮ個の単語列（重要語列）を抽出し、当該重要語列、入力として与えられたソーステキスト及びクエリ列をエンコーダ１３２へ入力する。この際、内容選択部１３１は、クエリ列、重要語列及びソーステキストを「クエリ列［ＳＥＰ］重要語列［ＳＥＰ］ソーステキスト」のように、［ＳＥＰ］などの特殊トークンで結合する。なお、Ｎの値は、クエリ等と共に内容選択部１３１に対する入力とされてもよい。The content selection unit 131 extracts N word strings (keyword strings) in descending order of importance from the combined text, and inputs the key word string, the source text given as input, and the query string to the encoder 132. At this time, the content selection unit 131 combines the query string, the key word string, and the source text with a special token such as [SEP], as in "query string [SEP] key word string [SEP] source text". Note that the value of N may be input to the content selection unit 131 together with the query, etc.

エンコーダ１３２及びデコーダ１３３は、例えば、ＢＥＲＴ又はＭＡＳＳ等の公知のｅｎｃｏｄｅｒ－ｄｅｃｏｄｅｒモデル（ニューラルネットワーク）である。 The encoder 132 and the decoder 133 are, for example, known encoder-decoder models (neural networks) such as BERT or MASS.

エンコーダ１３２は、入力されたテキストを符号化する。デコーダ１３３は、符号化結果に基づいて要約テキストを生成及び出力する。The encoder 132 encodes the input text. The decoder 133 generates and outputs summary text based on the encoding results.

要約学習部１２は、学習データに含まれる要約テキストと、デコーダ１３３が出力した要約テキストとの比較に基づいて、エンコーダ１３２及びデコーダ１３３の学習用パラメータを更新する。なお、当該比較及び学習用パラメータの更新は、公知技術に基づいて行われればよい。The summary learning unit 12 updates the learning parameters of the encoder 132 and the decoder 133 based on a comparison between the summary text included in the learning data and the summary text output by the decoder 133. Note that the comparison and the update of the learning parameters may be performed based on publicly known technology.

学習が終了すると、要約部１３は、クエリ列及び入力テキストを入力とし、要約テキストを出力とする学習済みの要約生成モデルとして機能する。Once the learning is complete, the summarization unit 13 functions as a trained summary generation model that takes the query string and input text as input and outputs summary text.

なお、図７の要約部１３は、国際公開第２０２１／０６４９０７号に開示された技術を用いて構成されてもよい。 In addition, the summary section 13 in Figure 7 may be configured using the technology disclosed in International Publication No. 2021/064907.

次に、第４の実施の形態として、クエリ有り学習データを用いた要約生成モデルの学習、及び学習済みの要約生成モデルを用いた要約の生成に関する第２の例について説明する。なお、第４の実施の形態は、第１の実施の形態及び第２の実施の形態のいずれに対しても適用可能である。 Next, as a fourth embodiment, a second example of training a summary generation model using training data with a query and generating a summary using the trained summary generation model will be described. Note that the fourth embodiment can be applied to both the first and second embodiments.

図８は、第４の実施の形態における要約生成モデルの学習及び要約の生成を説明するための図である。図８中、図７と同一又は対応する部分には同一符号を付している。図８において、要約部１３は、エンコーダ１３２及びデコーダ１３３を含む。これら各部が要約生成モデルを構成する。すなわち、第４の実施の形態の要約生成モデルは、内容選択部１３１を有さない。 Figure 8 is a diagram for explaining the learning of the summary generation model and the generation of summaries in the fourth embodiment. In Figure 8, parts that are the same as or correspond to those in Figure 7 are given the same reference numerals. In Figure 8, the summary unit 13 includes an encoder 132 and a decoder 133. These parts constitute the summary generation model. In other words, the summary generation model of the fourth embodiment does not have a content selection unit 131.

要約生成モデルの学習時において、要約学習部１２は、要約部１３に対して、クエリ有り学習データ群に含まれる学習データ（ソーステキスト、クエリ列、要約テキスト）ごとに、当該学習データのソーステキスト及びクエリ列を入力する。この際、要約学習部１２は、クエリ列及びソーステキストを「クエリ列［ＳＥＰ］ソーステキスト」のように、［ＳＥＰ］などの特殊トークンで結合する。When training the summary generation model, the summary learning unit 12 inputs the source text and query string of each piece of training data (source text, query string, summary text) included in the training data group with query to the summary unit 13. At this time, the summary learning unit 12 combines the query string and source text with a special token such as [SEP], such as "query string [SEP] source text".

なお、第４及び第５の実施の形態において、要約部１３は、ｅｎｃｏｄｅｒ－ｄｅｃｏｄｅｒモデル以外の文生成モデルに基づいて構成されてもよい。 In addition, in the fourth and fifth embodiments, the summary unit 13 may be configured based on a sentence generation model other than the encoder-decoder model.

以上の実施形態に関し、更に以下の付記を開示する。 The following notes are further disclosed with respect to the above embodiments.

（付記項１）
メモリと、
前記メモリに接続された少なくとも１つのプロセッサと、
を含み、
前記プロセッサは、
複数の文字列について、所定のモデルに基づいて第１の文書の要約の際に追加される入力パラメータとしての適切さを表すスコアを計算し、
前記スコアに基づいて、前記複数の文字列の中から一部の文字列群を、文書の要約を生成する要約生成モデルの学習データを構成する前記入力パラメータとして選択する、
ことを特徴とする要約学習支援装置。 (Additional note 1)
Memory,
at least one processor coupled to the memory;
Including,
The processor,
calculating, for each of the plurality of character strings, a score indicative of a suitability thereof as an input parameter to be added in summarizing the first document based on a predetermined model;
selecting a group of character strings from among the plurality of character strings based on the scores as the input parameters constituting training data for a summary generation model that generates a summary of the document;
A summary learning support device comprising:

（付記項２）
複数の文字列について、所定のモデルに基づいて第１の文書の要約の際に追加される入力パラメータとしての適切さを表すスコアを計算し、
前記スコアに基づいて、前記複数の文字列の中から一部の文字列群を、文書の要約を生成する要約生成モデルの学習データを構成する前記入力パラメータとして選択する、
処理をコンピュータに実行させるプログラムを記録した記録媒体。 (Additional note 2)
calculating, for each of the plurality of character strings, a score indicative of a suitability thereof as an input parameter to be added in summarizing the first document based on a predetermined model;
selecting a group of character strings from among the plurality of character strings based on the scores as the input parameters constituting training data for a summary generation model that generates a summary of the document;
A recording medium on which a program for causing a computer to execute a process is recorded.

なお、上記各実施の形態において、要約生成装置１０は、要約学習支援装置の一例である。重要度計算部１１１又はクエリ候補生成部１１５（クエリ生成モデル）は、計算部の一例である。クエリ選択部１１２は、選択部の一例である。要約学習部１２は、学習部の一例である。In each of the above embodiments, the summary generation device 10 is an example of a summary learning support device. The importance calculation unit 111 or the query candidate generation unit 115 (query generation model) is an example of a calculation unit. The query selection unit 112 is an example of a selection unit. The summary learning unit 12 is an example of a learning unit.

以上、本発明の実施の形態について詳述したが、本発明は斯かる特定の実施形態に限定されるものではなく、請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 Although the embodiments of the present invention have been described in detail above, the present invention is not limited to such specific embodiments, and various modifications and variations are possible within the scope of the gist of the present invention as described in the claims.

１０要約生成装置
１１クエリ有り学習データ生成部
１２要約学習部
１３要約部
１００ドライブ装置
１０１記録媒体
１０２補助記憶装置
１０３メモリ装置
１０４プロセッサ
１０５インタフェース装置
１１１重要度計算部
１１２クエリ選択部
１１３クエリ追加部
１１４クエリ生成モデル学習部
１１５クエリ候補生成部
１３１内容選択部
１３２エンコーダ
１３３デコーダ
Ｂバス REFERENCE SIGNS LIST 10 Summary generation device 11 Query-containing learning data generation unit 12 Summary learning unit 13 Summarization unit 100 Drive device 101 Recording medium 102 Auxiliary storage device 103 Memory device 104 Processor 105 Interface device 111 Importance calculation unit 112 Query selection unit 113 Query addition unit 114 Query generation model learning unit 115 Query candidate generation unit 131 Content selection unit 132 Encoder 133 Decoder B Bus

Claims

a calculation unit that calculates a score representing suitability of a plurality of character strings as an input parameter to be added when summarizing the first document based on a predetermined model;
a selection unit that selects a group of character strings from among the plurality of character strings based on the score as the input parameters that constitute training data for a summary generation model that generates a summary of a document;
having
the score is a score calculated for each character string of an output candidate in order for the model to select a character string to be output from among the character strings of the output candidates when a second document, which is a summary of the first document, is input to a model that has learned a correspondence relationship between a body of a document and a group of character strings constituting a title of the document.
A summary learning support device comprising:

The score is a degree of importance in a third document for each of a plurality of character strings constituting the third document including either or both of the first document and a second document which is a summary of the first document.
2. The summary learning support device according to claim 1.

a learning unit that learns the summary generation model using learning data including the first document, the group of character strings, and a second document that is a summary of the first document;
3. The summary learning support device according to claim 1 , further comprising:

a summarization unit that inputs a certain document and a character string related to a summary of the certain document to the summary generation model trained by the training unit, and generates a summary of the certain document;
4. The summary learning support device according to claim 3, further comprising:

a calculation step of calculating, for each of the plurality of character strings, a score representing a suitability thereof as an input parameter to be added in summarizing the first document based on a predetermined model;
a selection step of selecting a group of character strings from among the plurality of character strings based on the scores as the input parameters constituting training data for a summary generation model that generates a summary of the document;
The computer executes
the score is a score calculated for each character string of an output candidate in order for the model to select a character string to be output from among the character strings of the output candidates when a second document, which is a summary of the first document, is input to a model that has learned a correspondence relationship between a body of a document and a group of character strings constituting a title of the document.
A method for supporting summary learning comprising the steps of:

5. A program for causing a computer to function as the summary learning support device according to claim 1 .