JP5900486B2

JP5900486B2 - Related specification mapping system, related specification mapping method and program

Info

Publication number: JP5900486B2
Application number: JP2013504557A
Authority: JP
Inventors: 由希子黒岩
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2011-03-16
Filing date: 2012-03-09
Publication date: 2016-04-06
Anticipated expiration: 2032-03-09
Also published as: JPWO2012124301A1; WO2012124301A1

Description

本発明は、関連する仕様同士を対応付ける関連仕様対応付けシステム、関連仕様対応付け方法および関連仕様対応付けプログラムに関し、特に、異なるテキストに含まれている仕様同士を対応付ける関連仕様対応付けシステム、関連仕様対応付け方法および関連仕様対応付けプログラムに関する。 The present invention relates to a related specification associating system, a related specification associating method, and a related specification associating program for associating related specifications with each other, and in particular, a related specification associating system and related specifications associating specifications included in different texts. The present invention relates to a correlation method and a related specification correlation program.

システムやソフトウェアの開発において、要求、設計、テスト等の各工程内や各工程間の仕様が定められる。 In the development of systems and software, specifications within each process and between processes such as requirements, design, and testing are determined.

ソフトウェアの一部を変更したり、仕様を変更したりした場合における影響を管理するための技術が種々提案されている（例えば、特許文献１，２参照）。 Various techniques for managing the influence of changing part of software or changing specifications have been proposed (see, for example, Patent Documents 1 and 2).

特許文献１には、ソフトウェアの一部を修正した場合にその修正の影響が広い範囲に及ぶことを防ぎ、その修正が他に影響を及ぼす範囲を細かく抽出するためのソフトウェア開発システムが記載されている。特許文献１に記載のソフトウェア開発システムでは、各工程のドキュメントに第１のキーワードと第２のキーワードが付され、第１のキーワードに基づいて自工程内のドキュメントが記憶され、第２のキーワードに基づいて自工程内のドキュメントに対する修正の影響がおよぶ上位工程のドキュメントが記憶される。また、特許文献１に記載のソフトウェア開発システムは、各工程内のドキュメントを工程別に記憶し、各工程内のドキュメントとその工程の下位工程内のドキュメントの関係を定義する。そして、いずれかの工程内のドキュメントが修正されたときに、そのドキュメントに付与された第２のキーワードに基づいて上位工程内のドキュメントを検索する。 Patent Literature 1 describes a software development system for preventing the influence of a modification from extending over a wide range when a part of the software is modified, and extracting the range in which the modification affects the other. Yes. In the software development system described in Patent Document 1, the first keyword and the second keyword are attached to the document of each process, the document in the own process is stored based on the first keyword, and the second keyword is stored in the second keyword. Based on the document, the document of the higher-order process that is affected by the modification to the document in the own process is stored. In addition, the software development system described in Patent Document 1 stores documents in each process for each process, and defines a relationship between a document in each process and a document in a lower process of the process. Then, when a document in any process is modified, the document in the upper process is searched based on the second keyword assigned to the document.

特許文献２には、ソフトウェア開発において仕様変更が発生した際に仕様変更がもたらすチェックリストおよびソースコードへの影響範囲を抽出する影響範囲抽出システムが記載されている。特許文献２に記載されたシステムでは、データ項目と対応した各プログラム言語用の項目名が登録されているデータ項目辞書を用いられる。 Patent Document 2 describes an influence range extraction system that extracts an influence range on a check list and source code caused by a specification change when the specification change occurs in software development. In the system described in Patent Document 2, a data item dictionary in which item names for each program language corresponding to data items are registered is used.

また、特許文献３には、曖昧な記載を排除したドキュメントを利用できるようにするためのデータ処理装置が記載されている。特許文献３に記載されたシステムでは、ドキュメントにおいて、他のドキュメントでも共通の意味を持つ定義語として用いられる状態定義語が記載される。そして、キーワード検索部が状態定義語をキーワードにして階層キーワード検索を行う。また、特許文献３には、上位層のキーワードと下位層のキーワードを結ぶことでリンクを定める。 Further, Patent Document 3 describes a data processing device for making it possible to use a document from which ambiguous description is excluded. In the system described in Patent Document 3, a state definition word used as a definition word having a common meaning in other documents is described in a document. Then, the keyword search unit performs a hierarchical keyword search using the state definition word as a keyword. In Patent Document 3, a link is defined by connecting an upper layer keyword and a lower layer keyword.

また、特許文献４には、文字データとドットマップイメージデータの混在するデータを、所与のサイズ以下の圧縮されたドットマップデータに変換することが記載されている。 Patent Document 4 describes that data in which character data and dot map image data are mixed is converted into compressed dot map data having a predetermined size or less.

また、特許文献５には、実行回数がしきい値を超えた命令列の関数呼び出し命令をインライン展開することが記載されている。 Japanese Patent Application Laid-Open No. H10-228667 describes inline expansion of a function call instruction of an instruction sequence whose execution count exceeds a threshold value.

特開２０１０−３９７５１号公報（段落００１０，００１１）JP 2010-39551 (paragraphs 0010, 0011) 特開２００７−１２８１２３号公報（段落０００６）JP 2007-128123 A (paragraph 0006) 特開２０１０−１０８３４６号公報（段落００１１，００２１，００２５，００３１）JP 2010-108346 A (paragraphs 0011, 0021, 0025, 0031) 特開２００６−１２０１４５号公報（段落００１３，００３３，００３９）JP 2006-120145 A (paragraphs 0013, 0033, 0039) 特開２００７−１８２５４号公報（段落０００２，０００３，００１６，００４４）JP 2007-18254 (paragraphs 0002, 0003, 0016, 0044)

システムやソフトウェアの開発における工程内や工程間の開発成果物を関連付けておけば、仕様の後工程への反映漏れ検出や整合性確認等に活用することができる。また、仕様の変更時の影響範囲の把握に活用することができる。より具体的には、複数の仕様を含むテキスト間において、関連する仕様同士を対応付けておくことにより、上記のように、仕様の後工程への反映漏れ検出、整合性確認、仕様変更時の影響範囲の把握等に活用することができる。 By associating development products in and between processes in the development of systems and software, it can be used for detection of reflection omission in the subsequent process of specifications and confirmation of consistency. It can also be used to understand the scope of influence when changing specifications. More specifically, by associating related specifications between texts containing multiple specifications, as described above, detection of reflection reflection to the subsequent process of specifications, consistency check, and specification change It can be used to understand the scope of influence.

特許文献１に記載された発明は、ソフトウェアの一部を修正した場合にその修正の影響が広い範囲に及ぶことを防ぎ、その修正が他に影響を及ぼす範囲を細かく抽出することができる。しかし、予めドキュメントにキーワードを付す作業を行わなくてはならなかった。また、各工程内のドキュメントとその工程の下位工程内のドキュメントの関係を定義する必要があった。これらの作業を人手で行うことになるので、これらの作業に時間がかかってしまっていた。 The invention described in Patent Document 1 can prevent the influence of the modification from extending over a wide range when a part of the software is modified, and can extract the range in which the modification affects the other. However, the task of attaching keywords to the document had to be performed in advance. In addition, it is necessary to define a relationship between a document in each process and a document in a lower process of the process. Since these operations are performed manually, these operations take time.

特許文献２に記載された発明でも、ソフトウェア開発において仕様変更が発生した際に仕様変更がもたらすソースコード等への影響範囲を抽出することができる。しかし、データ項目と対応した各プログラム言語用の項目名を登録したデータ項目辞書を用意しなければならなかった。この作業も人手で行うことになるので、この作業に時間がかかってしまっていた。 Even in the invention described in Patent Document 2, it is possible to extract a range of influence on a source code or the like caused by a specification change when the specification change occurs in software development. However, it has been necessary to prepare a data item dictionary in which item names for each programming language corresponding to the data items are registered. Since this work is also done manually, this work took time.

また、特許文献３に記載された発明においても、ドキュメント内に状態定義語を定めておく必要があるが、この作業も人手で行うことになるので、この作業に時間がかかってしまっていた。 In the invention described in Patent Document 3, it is necessary to define a state definition word in a document. However, since this operation is also performed manually, this operation takes time.

そこで、人手による作業をできるだけ少なくして、複数の仕様を含むテキスト間において、関連する仕様同士を対応付けることができる関連仕様対応付けシステム、関連仕様対応付け方法および関連仕様対応付けプログラムを提供することを目的とする。 To provide a related specification associating system, a related specification associating method, and a related specification associating program capable of associating related specifications between texts including a plurality of specifications with a minimum of manual work. With the goal.

本発明による関連仕様対応付けシステムは、２つのテキストから２つの仕様の組を選択し、２つの仕様に共通に出現する共通文字列に基づいて、その２つの仕様の関連性の有無を判定する関連性有無判定手段と、２つの仕様に関連性があると判定される場合には、２つの仕様それぞれの可逆圧縮サイズと、２つの仕様の文字列を連結した文字列の可逆圧縮サイズとに基づいて、関連性値を算出する関連性計算手段と、２つの仕様を対応付けるか否かを判定する仕様対応付け手段とを備えることを特徴とする。 Related specification associating system according to the present invention selects two text or et two specifications set, based on common string that appears in common to the two specifications, the presence or absence of association of the two specifications and relevant presence determining means for determining, if it is determined to be relevant to the two specifications, the two specifications of each lossless compressed size, lossless compression of concatenated string of text of two specifications based on the size, the relevance calculating means for calculating a relevance value, characterized in that it comprises a determining specifications correlating means whether associating two specifications.

また、本発明による関連仕様対応付け方法は、２つのテキストから２つの仕様の組を選択し、２つの仕様に共通に出現する共通文字列に基づいて、その２つの仕様の関連性の有無を判定し、２つの仕様に関連性があると判定される場合には、２つの仕様それぞれの可逆圧縮サイズと、２つの仕様の文字列を連結した文字列の可逆圧縮サイズとに基づいて、関連性値を算出し、２つの仕様を対応付けるか否かを判定することを特徴とする。 The related specifications associating method according to the present invention selects two sets of text or et two specifications, based on common string that appears in common to the two specifications, the relevance of the two specifications determines the presence of, if it is determined to be relevant to the two specifications, based on each of the lossless compression size two specifications, a lossless compression size of the string of string concatenation of two specifications Te, it calculates a relevance value, and wherein the determining whether associating two specifications.

また、本発明による関連仕様対応付けプログラムは、コンピュータに、２つのテキストから２つの仕様の組を選択し、２つの仕様に共通に出現する共通文字列に基づいて、その２つの仕様の関連性の有無を判定する関連性有無判定処理、２つの仕様に関連性があると判定される場合には、２つの仕様それぞれの可逆圧縮サイズと、２つの仕様の文字列を連結した文字列の可逆圧縮サイズとに基づいて、関連性値を算出する関連性計算処理、および、２つの仕様を対応付けるか否かを判定する仕様対応付け処理を実行させることを特徴とする。 The related specifications correspondence program according to the present invention, the computer selects two sets of text or et two specifications, based on common string that appears in common to the two specifications, two specifications that character relevance relevance existence determining process determines the presence or absence of, if it is determined that there is a relationship between the two specifications, the two specifications of each lossless compressed size, the concatenation of the strings of the two specifications based on the lossless compression column size, relevance calculating process of calculating a relevance value, and, characterized in that to execute a determining specifications associating process whether associating two specifications.

本発明によれば、人手による作業をできるだけ少なくして、複数の仕様を含むテキスト間において、関連する仕様同士を対応付けることができる。 According to the present invention, manual specifications can be reduced as much as possible, and related specifications can be associated with each other between texts including a plurality of specifications.

本発明の関連仕様対応付けシステムの例を示すブロック図である。It is a block diagram which shows the example of the related specification matching system of this invention. テキストで記述された仕様の例を示す説明図である。It is explanatory drawing which shows the example of the specification described in the text. 図で記述された仕様の例を示す説明図である。It is explanatory drawing which shows the example of the specification described by the figure. 初期仕様記憶部が記憶する初期仕様の例を示す説明図である。It is explanatory drawing which shows the example of the initial specification which an initial specification memory | storage part memorize | stores. 変換語句記憶部が記憶する情報の例を示す説明図である。It is explanatory drawing which shows the example of the information which a conversion word memory | storage part memorize | stores. 変換仕様記憶部が記憶する変換仕様の例を示す説明図である。It is explanatory drawing which shows the example of the conversion specification which a conversion specification memory | storage part memorize | stores. リンク記憶部に記憶されたリンクの例を示す説明図である。It is explanatory drawing which shows the example of the link memorize | stored in the link memory | storage part. 本実施形態の処理経過の例を示すフローチャートである。It is a flowchart which shows the example of the process progress of this embodiment. リンク生成部によるリンク生成結果の例を模式的に示す説明図である。It is explanatory drawing which shows the example of the link production | generation result by a link production | generation part typically. ステップＡ２，Ａ３の詳細を示すフローチャートである。It is a flowchart which shows the detail of step A2, A3. リンク補完部による補完動作の例を示すフローチャートである。It is a flowchart which shows the example of the complementation operation | movement by a link complementation part. リンク記憶部に記憶されたリンクの例を示す説明図である。It is explanatory drawing which shows the example of the link memorize | stored in the link memory | storage part. 補完後のリンクの例を模式的に示す説明図である。It is explanatory drawing which shows the example of the link after a complementation typically. 処理の終了時にリンク記憶部に記憶されているリンクの例を示す説明図である。It is explanatory drawing which shows the example of the link memorize | stored in the link memory | storage part at the time of the completion | finish of a process. 本発明の関連仕様対応付けシステムの最小構成の例を示す説明図である。It is explanatory drawing which shows the example of the minimum structure of the related specification matching system of this invention.

以下、本発明の実施形態を図面を参照して説明する。
本発明の関連仕様対応付けシステムは、複数の仕様を含むテキスト間において、関連する仕様同士を対応付ける。このテキストは、同一文書内のテキストであっても、異なる文書のテキストであってもよい。以下の説明では、複数の仕様を含む文書間において、関連する仕様同士を対応付ける場合を例に説明し、テキストを識別するためのＩＤとして、文書ＩＤを用いる。同一文書内の異なるテキスト間での対応付けを行う場合には、各テキストを識別するテキストＩＤを用いればよい。また、仕様としては、例えば、開発対象に要求される要件や、設計要素に関する仕様、テスト項目に関する仕様等が挙げられるが、他の事項に関する仕様であってもよい。Hereinafter, embodiments of the present invention will be described with reference to the drawings.
The related specification associating system of the present invention associates related specifications between texts including a plurality of specifications. This text may be in the same document or in a different document. In the following description, a case in which related specifications are associated with each other between documents including a plurality of specifications will be described as an example, and a document ID is used as an ID for identifying text. When matching between different texts in the same document, a text ID for identifying each text may be used. The specifications include, for example, requirements required for the development target, specifications regarding design elements, specifications regarding test items, and the like, but specifications regarding other matters may also be used.

以下、複数の仕様を含むテキスト間における、関連する仕様同士の対応付けを、トレーサビリティリンク（あるいは、単にリンク）と記す場合がある。また、そのような対応付けを行うことを、「トレーサビリティリンク（あるいは、単にリンク）を生成する」と記す場合がある。 Hereinafter, association between related specifications between texts including a plurality of specifications may be referred to as a traceability link (or simply a link). In addition, performing such association may be described as “generating a traceability link (or simply link)”.

図１は、本発明の関連仕様対応付けシステムの例を示すブロック図である。本発明の関連仕様対応付けシステムは、情報を記憶する記憶部１１と、プログラム制御により動作する計算部１２とを備える。 FIG. 1 is a block diagram showing an example of a related specification association system of the present invention. The related specification associating system of the present invention includes a storage unit 11 that stores information and a calculation unit 12 that operates under program control.

記憶部１１は、初期仕様記憶部１００と、変換語句記憶部１０１と、変換仕様記憶部１０２と、リンク記憶部１０３とを含む。 The storage unit 11 includes an initial specification storage unit 100, a conversion phrase storage unit 101, a conversion specification storage unit 102, and a link storage unit 103.

初期仕様記憶部１００は、仕様の内容を示すテキストを初期仕様として予め記憶する。 The initial specification storage unit 100 stores in advance text indicating the content of the specification as an initial specification.

一般に、仕様は、テキストで記述されたり、あるいは、業務フロー図等の図で記述されたりする。図２は、テキストで記述された仕様の例を示す説明図である。図２に示す例では、１つの仕様が１行で示されている。図３は、図で記述された仕様の例を示す説明図である。図３では、業務フロー図で記述された仕様が示されており、１つの仕様が、１つの箱や、複数の箱とその箱間の矢印で表されている。 In general, specifications are described in text or in a diagram such as a business flow diagram. FIG. 2 is an explanatory diagram showing an example of specifications described in text. In the example shown in FIG. 2, one specification is shown in one line. FIG. 3 is an explanatory diagram illustrating an example of specifications described in the figure. In FIG. 3, the specifications described in the business flow diagram are shown, and one specification is represented by one box or a plurality of boxes and arrows between the boxes.

初期仕様記憶部１００は、仕様の内容を表すテキストを記憶するので、図２に例示する仕様については、テキスト形式への書き換えを行う必要はなく、初期仕様記憶部１００に記憶させればよい。一方、図３に例示する仕様については、テキスト形式への書き換えを行った上で、初期仕様記憶部１００に記憶させればよい。 Since the initial specification storage unit 100 stores text representing the contents of the specification, the specification illustrated in FIG. 2 need not be rewritten into the text format and may be stored in the initial specification storage unit 100. On the other hand, the specification illustrated in FIG. 3 may be stored in the initial specification storage unit 100 after being rewritten into a text format.

図４は、初期仕様記憶部１００が記憶する初期仕様の例を示す説明図である。個々の仕様には、その仕様が属するテキスト（本例では文書とする。）を識別するテキストＩＤ（本例では、文書ＩＤ）が割り当てられる。また、個々の仕様には、文書内で登場する順に、仕様ＩＤとなる番号が割り当てられる。なお、仕様を表す図（例えば、図３参照）において、個々の仕様は、左上から右下の順に順番付けられて記述されることが一般的である。従って、このような図で表された個々の仕様をテキスト形式に変換した場合、元の図において左上に表されていた仕様ほど、仕様ＩＤとして小さな番号を割り当て、元の図において右下に表されていた仕様ほど、仕様ＩＤとして大きな番号を割り当てればよい。 FIG. 4 is an explanatory diagram illustrating an example of initial specifications stored in the initial specification storage unit 100. Each specification is assigned a text ID (document ID in this example) that identifies the text (in this example, a document) to which the specification belongs. Also, each specification is assigned a number that is a specification ID in the order in which it appears in the document. It should be noted that in the diagram showing the specifications (for example, see FIG. 3), the individual specifications are generally described in order from the upper left to the lower right. Therefore, when individual specifications shown in such a diagram are converted to text format, the specification shown in the upper left in the original diagram is assigned a smaller number as the specification ID, and is displayed in the lower right in the original diagram. A larger number may be assigned as a specification ID for a specification that has been made.

以下、仕様ＩＤがｘである仕様を、単に、仕様ｘと記す。 Hereinafter, a specification whose specification ID is x is simply referred to as a specification x.

図４において、文書ＩＤが０である文書（以下、文書０と記す。）の初期仕様は、図３に示す業務フロー図からアクタと仕様内容を示す文字列を抜き出して仕様として記述している。具体的には、業務フロー図内のアクタと、業務フロー図内で仕様内容を表す文字列とは、所定の記号（本例では「：」）で区切って記述されている。また、分岐条件は「→」で記述されている。 In FIG. 4, the initial specification of a document whose document ID is 0 (hereinafter referred to as document 0) is described as a specification by extracting a character string indicating an actor and specification contents from the business flow diagram shown in FIG. . Specifically, the actor in the workflow diagram and the character string representing the specification content in the workflow diagram are described by being separated by a predetermined symbol (“:” in this example). The branch condition is described by “→”.

図４において、文書ＩＤが１である文書（以下、文書１と記す。）の初期仕様は、図２に示す仕様と同一である。 In FIG. 4, the initial specification of a document whose document ID is 1 (hereinafter referred to as document 1) is the same as the specification shown in FIG.

以下、元々図で記述された仕様（文書０の仕様）と、テキスト形式で表された仕様（文書１の仕様）との間でリンクを生成する場合を例に説明するが、元々図で記述された仕様同士の間でリンクを生成してもよく、また、テキスト形式で表された仕様同士の間でリンクを生成してもよい。 In the following, a case where a link is generated between the specification originally described in the drawing (specification of document 0) and the specification expressed in the text format (specification of document 1) will be described as an example. A link may be generated between the specified specifications, or a link may be generated between the specifications expressed in the text format.

変換語句記憶部１０１は、初期仕様（図４参照）として記述された文字列を変換するための変換語句を予め記憶する。具体的には、変換語句記憶部１０１は、変換前の語句と、変換後の語句とを対応付けて記憶する。例えば、初期仕様に記載された同義の語句が、同一の語句で統一されるように、変換前の語句と、変換後の語句とが対応付けられている。なお、語句には、句読点や、初期仕様で用いられる記号（本例では、「：」や「→」）も含まれる。変換後の語句が定められていない場合には、変換前の語句を削除することを意味するものとする。 The conversion phrase storage unit 101 stores in advance a conversion phrase for converting a character string described as an initial specification (see FIG. 4). Specifically, the converted phrase storage unit 101 stores the phrase before conversion and the converted phrase in association with each other. For example, a word before conversion and a word after conversion are associated with each other so that synonymous words described in the initial specification are unified with the same word. Note that the phrase also includes punctuation marks and symbols used in the initial specification (in this example, “:” and “→”). If the word after conversion is not defined, it means that the word before conversion is deleted.

図５は、変換語句記憶部１０１が記憶する情報の例を示す。図５において、１行分の情報は、１つ以上の変換前の語句と、１つの変換後の語句との対応関係を表している。図５に示す例では、「発注」を「注文」に変換すること等が表されている。また、図５に示す例では、同一の語句に変換される、複数の変換前の語句の区切りに記号「，」が用いられている。例えば、変換前の語句として、「，」を挟んで記述された「行う。」、「される。」、「する。」、「ものとする。」という各語句は、いずれも、句点「。」に変換されることを表している。 FIG. 5 shows an example of information stored in the conversion word storage unit 101. In FIG. 5, information for one line represents a correspondence relationship between one or more pre-conversion words and one post-conversion word / phrase. In the example shown in FIG. 5, “ordering” is converted to “ordering” and the like. In the example shown in FIG. 5, the symbol “,” is used to delimit a plurality of pre-conversion words that are converted to the same word. For example, each of the words “do”, “be done”, “do”, and “to be” written with “,” as a word before conversion is a punctuation mark “.”. "Is converted to" ".

変換語句記憶部１０１が記憶する情報は、初期仕様内の語句を変換する変換ルールであるということができる。 It can be said that the information stored in the conversion phrase storage unit 101 is a conversion rule for converting a phrase in the initial specification.

汎用の同義語辞書を利用して、変換前後の語句の組合せを追加してもよい。例えば、汎用の同義語辞書であるＷｏｒｄＮｅｔでは、「注文」、「発注」等の語句が同義セットになっているので、同義セットに属するいずれかの語句を変換後の語句とし、その同義セットに属する他の語句を変換前の語句として定めればよい。 A combination of words before and after conversion may be added using a general-purpose synonym dictionary. For example, in WordNet, which is a general-purpose synonym dictionary, terms such as “order” and “ordering” are synonymous sets. Therefore, any of the terms belonging to the synonym set is used as a post-conversion term, and the synonym set is included in the synonym set. What is necessary is just to define the other words to which it belongs as a word before conversion.

また、図５に示す例では、「ここで、」に対応する変換後の語句が空欄になっていて、定められていない。このことは、初期仕様内の語句「ここで、」を削除することを意味する。このように、語句を削除する場合には、その語句を変換前の語句として定め、変換後の語句を定めておかなければよい。 In the example shown in FIG. 5, the converted word / phrase corresponding to “Here” is blank and is not defined. This means that the phrase “here” in the initial specification is deleted. Thus, when deleting a phrase, it is sufficient that the phrase is defined as a phrase before conversion and the phrase after conversion is not defined.

変換語句記憶部１０１には、どのような初期仕様を用いるかに関わらない汎用の同義語辞書を用いて、変換前後の語句の対応関係を記憶させたり、表現の簡略化のための変換前後の語句の対応関係を記憶させたりすればよい。そして、業務用語集等の業界・業務に特有の同義語句を追加していけばよい。そのように、変換語句記憶部１０１に記憶させる語句を定めていけば、人手や時間をかけずに、変換語句記憶部１０１の記憶内容を用意することができる。 The conversion phrase storage unit 101 uses a general-purpose synonym dictionary regardless of what initial specification is used, stores the correspondence between words before and after conversion, and before and after conversion for simplification of expression. What is necessary is just to memorize | store the correspondence of a phrase. Then, synonyms specific to the industry / business such as a business glossary may be added. In this way, if the phrases to be stored in the conversion phrase storage unit 101 are determined, the storage contents of the conversion phrase storage unit 101 can be prepared without manpower and time.

変換仕様記憶部１０２は、初期仕様記憶部１００に記憶された初期仕様を変換語句記憶部１０１に記憶された変換ルールを用いて変換した結果（以下、変換仕様と記す。）を記憶する。図６は、変換仕様記憶部１０２が記憶する変換仕様の例を示す説明図である。図６では、図４に示す初期仕様を、図５に示す変換ルールに従って変換して得られた変換仕様が例示されている。 The conversion specification storage unit 102 stores a result of conversion of the initial specification stored in the initial specification storage unit 100 using the conversion rule stored in the conversion word storage unit 101 (hereinafter referred to as conversion specification). FIG. 6 is an explanatory diagram illustrating an example of conversion specifications stored in the conversion specification storage unit 102. FIG. 6 illustrates a conversion specification obtained by converting the initial specification shown in FIG. 4 according to the conversion rule shown in FIG.

リンク記憶部１０３は、仕様間のリンクを記憶する。すなわち、リンク記憶部１０３は、複数の仕様を含むテキスト間（本例では、文書間）で、関連する仕様同士の対応関係を記憶する。図７は、リンク記憶部１０３に記憶されたリンクの例を示す説明図である。図７では、例えば、「文書０」の「仕様０」と「文書１」の「仕様０」との間や、「文書０」の「仕様２」と「文書１」の「仕様３」との間等にリンクが生成されたことが表されている。また、図７に示す例では、リンク毎に、リンク信頼度が記憶されている。リンク信頼度は、リンク（仕様同士の対応付け）の適切さに関する信頼度を表している。例えば、リンクがリンク生成部１０７に生成された場合には、リンク信頼度が高いことを意味する情報（図７では「○」）を記憶させ、リンク補完部１０８によってリンク候補と推定された場合には、リンク生成部１０７に生成されたリンクよりもリンク信頼度が低いことを意味する情報（例えば、「△」とする。図７において図示せず。）を記憶させればよい。 The link storage unit 103 stores a link between specifications. That is, the link storage unit 103 stores a correspondence relationship between related specifications between texts (in this example, between documents) including a plurality of specifications. FIG. 7 is an explanatory diagram illustrating an example of a link stored in the link storage unit 103. In FIG. 7, for example, between “specification 0” of “document 0” and “specification 0” of “document 1”, “specification 2” of “document 0”, and “specification 3” of “document 1”. It is shown that a link is generated between the two. In the example shown in FIG. 7, the link reliability is stored for each link. The link reliability represents the reliability related to the appropriateness of the link (association between specifications). For example, when a link is generated in the link generation unit 107, information indicating that the link reliability is high (“◯” in FIG. 7) is stored, and the link complementing unit 108 estimates the link candidate The link generation unit 107 may store information (for example, “Δ”, not shown in FIG. 7) indicating that the link reliability is lower than that of the link generated.

計算部１２は、語句変換部１０４と、関連性有無判定部１０５と、関連性計算部１０６と、リンク生成部１０７と、リンク補完部１０８とを含む。 The calculation unit 12 includes a phrase conversion unit 104, an association presence / absence determination unit 105, an association calculation unit 106, a link generation unit 107, and a link complementation unit 108.

語句変換部１０４は、初期仕様記憶部１００に記憶された初期仕様を変換語句記憶部１０１に記憶された変換ルールに従って変換する。すなわち、語句変換部１０４は、初期仕様の中に、変換語句記憶部１０１に記憶された変換前の語句に該当する文字列があれば、その文字列を、その変換前の語句に対応する変換後の語句に変換する。語句変換部１０４は、初期仕様を変換した結果（変換仕様）を変換仕様記憶部１０２に記憶させる。図５に示す変換ルールに従って、語句変換部１０４が、図４に示す初期仕様を変換すると、図６に示す変換仕様が得られる。 The phrase conversion unit 104 converts the initial specification stored in the initial specification storage unit 100 according to the conversion rule stored in the conversion word storage unit 101. That is, if there is a character string corresponding to the word before conversion stored in the conversion word storage unit 101 in the initial specification, the word conversion unit 104 converts the character string into the conversion corresponding to the word before conversion. Convert to a later phrase. The phrase conversion unit 104 causes the conversion specification storage unit 102 to store the result of converting the initial specification (conversion specification). When the phrase conversion unit 104 converts the initial specification shown in FIG. 4 according to the conversion rule shown in FIG. 5, the conversion specification shown in FIG. 6 is obtained.

関連性有無判定部１０５は、変換仕様記憶部１０２に記憶された変換仕様に対して、文書ＩＤが互いに異なる２つの変換仕様毎に、共通部分列を抽出し、その共通部分列に基づいて、２つの変換仕様間の関連性の有無を判定する。ここで、共通部分列は、予め定められた長さの文字列であって、２つの仕様に共通に出現している文字列である。ひらがなは共通部分列に含めない等の条件が定められていてもよい。この条件は例示であり、他の条件が定められていてもよい。また、条件とともに例外が指定可能であってもよい。例えば、ひらがなは共通部分列に含めないという条件とともに、「ない」等の特定のひらがなに関しては「共通部分列」として抽出するという例外を指定可能であってもよい。あるいは、共通部分列に関する条件が定められていなくてもよい。 The relevance determination unit 105 extracts a common partial sequence for each of two conversion specifications having different document IDs from the conversion specification stored in the conversion specification storage unit 102, and based on the common partial sequence, The presence / absence of relevance between the two conversion specifications is determined. Here, the common partial sequence is a character string having a predetermined length, and is a character string that appears in common in the two specifications. Conditions such as not including hiragana in the common subsequence may be defined. This condition is an example, and other conditions may be set. An exception may be specified together with a condition. For example, in addition to the condition that hiragana characters are not included in the common subsequence, it may be possible to specify an exception that a specific hiragana such as “none” is extracted as “common subsequence”. Or the conditions regarding a common partial row | line | column do not need to be defined.

関連性有無判定部１０５は、変換ルールによる変換が行われた２つの文書（図６における文書０および文書１）から、互いに異なる文書に属する２つの変換仕様の組をそれぞれ選択すればよい。そして、関連性有無判定部１０５は、選択した組毎に、共通部分列を抽出し、その共通部分列に基づいて、２つの変換仕様間の関連性の有無を判定すればよい。 The relevance determination unit 105 may select two conversion specification groups belonging to different documents from the two documents (document 0 and document 1 in FIG. 6) that have been converted according to the conversion rule. Then, the relevance determination unit 105 may extract a common partial sequence for each selected set, and determine whether there is relevance between the two conversion specifications based on the common partial sequence.

関連性計算部１０６は、文書ＩＤが互いに異なる２つの変換仕様毎に、仕様間の関連性の強さを示す値（以下、関連性値と記す。）を計算する。 The relevancy calculation unit 106 calculates a value indicating the strength of relevance between specifications (hereinafter referred to as relevance value) for each of two conversion specifications having different document IDs.

関連性有無判定部１０５による関連性の有無の判定処理や、関連性計算部１０６による関連性値の計算処理については、後述する。 The determination process of the presence / absence of relevance by the relevance presence / absence determination unit 105 and the relevance value calculation process by the relevance calculation unit 106 will be described later.

リンク生成部１０７は、変換仕様記憶部１０２に記憶された変換仕様に対して計算された変換仕様間の関連性値に基づいて、文書ＩＤが互いに異なる変換仕様の間のリンクを生成する。すなわち、リンク生成部１０７は、文書ＩＤが互いに異なる変換仕様同士を対応付ける。例えば、２つの変換仕様間の関連性値が予め定められた第一の閾値以上であれば、リンク生成部１０７は、その２つの変換仕様にリンクを生成すべきと推定し、リンクを生成する。第一の閾値は、例えば、関連仕様対応付けシステムのユーザがキーボード等の入力手段（図示略）を介して入力することによって設定されてもよい。また、例えば、予め関連仕様対応付けシステムに第一の閾値のデフォルト値を記憶させておいてもよい。これらは例示であり、第一の閾値の設定方法は、これらの方法に限定されない。第一の閾値は、例えば、０．６等の値に設定すればよいが、この値も例示であり、第一の閾値は０．６以外の値であってもよい。 The link generation unit 107 generates a link between conversion specifications having different document IDs based on the relevance value between the conversion specifications calculated for the conversion specifications stored in the conversion specification storage unit 102. That is, the link generation unit 107 associates conversion specifications with different document IDs. For example, if the relevance value between two conversion specifications is equal to or greater than a predetermined first threshold, the link generation unit 107 estimates that a link should be generated for the two conversion specifications, and generates a link. . The first threshold may be set, for example, by a user of the related specification association system inputting via an input means (not shown) such as a keyboard. For example, the default value of the first threshold value may be stored in advance in the related specification association system. These are examples, and the first threshold setting method is not limited to these methods. The first threshold value may be set to a value such as 0.6, for example, but this value is also an example, and the first threshold value may be a value other than 0.6.

リンク補完部１０８は、リンク生成部１０７でリンクが生成されなかった変換仕様に関して、文書内におけるその変換仕様の順番に着目し、順番がその変換仕様の順番よりも先になっている変換仕様のリンク生成状況、および、順番がその変換仕様の順番よりも後になっている変換仕様のリンク生成状況に基づいて、リンクを補完するか否かを判定する。そして、リンク補完部１０８は、リンクを補完すると判定した変換仕様間にリンクを補完する。ここで、リンクを補完するとは、リンク生成部１０７によるリンク生成処理後において未だリンクが生成されていない変換仕様にリンクを生成することである。換言すれば、他の変換仕様と対応付けられなかった変換仕様に関して、他の変換仕様との対応付けを行うことである。 The link complementing unit 108 pays attention to the order of the conversion specifications in the document with respect to the conversion specifications for which no link has been generated by the link generation unit 107, and the conversion specifications whose order precedes the order of the conversion specifications. Whether or not to complement the link is determined based on the link generation status and the link generation status of the conversion specifications whose order is later than the order of the conversion specifications. Then, the link complementation unit 108 complements the link between the conversion specifications determined to complement the link. Here, complementing a link means generating a link in a conversion specification for which a link has not yet been generated after the link generation processing by the link generation unit 107. In other words, the conversion specifications that are not associated with other conversion specifications are associated with other conversion specifications.

文書内における仕様の順番は、仕様ＩＤによって表される。例えば、図６に例示する変換仕様において、文書０の仕様１に着目した場合、順番が仕様１よりも先になっている変換仕様は、文書０の仕様０であり、順番が仕様１よりも後になっている変換仕様は、文書０の仕様２、仕様３、仕様４である。なお、文書ＩＤが異なる仕様間では、仕様の順番は定められない。よって、文書０の仕様１に着目した場合、リンク補完部１０８は、文書０の仕様０のリンク生成状況や、文書０の仕様２、仕様３、仕様４のリンク生成状況等に基づいて、文書０の仕様１と、文書１の仕様との間でリンクを補完するか否かを判断する。 The order of specifications in the document is represented by a specification ID. For example, in the conversion specification illustrated in FIG. 6, when focusing on the specification 1 of the document 0, the conversion specification whose order precedes the specification 1 is the specification 0 of the document 0 and the order is higher than the specification 1. The conversion specifications that follow are specification 2, specification 3, and specification 4 of document 0. Note that the specification order is not determined between specifications with different document IDs. Therefore, when attention is focused on the specification 1 of the document 0, the link complementation unit 108 determines whether the document 0 is based on the link generation status of the specification 0 of the document 0, the link generation status of the specification 2, the specification 3, and the specification 4 of the document 0. It is determined whether or not the link is complemented between the specification 1 of 0 and the specification of the document 1.

次に、動作について説明する。
図８は、本実施形態の処理経過の例を示すフローチャートである。まず、語句変換部１０４は、初期仕様記憶部１００に記憶された初期仕様を１つずつ読み込み、変換語句記憶部１０１に記憶された変換ルールに従って、読み込んだ初期仕様を変換する。すなわち、読み込んだ初期仕様内に、変換語句記憶部１０１に記憶された変換前の語句に該当する文字列があれば、その文字列を、その変換前の語句に対応する変換後の語句に変換する。例えば、読み込んだ初期仕様内に「発注」という文字列があれば、その文字列を「注文」に変換したり、読み込んだ初期仕様内に「受け付ける」という文字列があれば、その文字列を「受容」に変換したりする。語句変換部１０４は、この変換によって得られた変換仕様を変換仕様記憶部１０２に記憶させる。語句変換部１０４は、初期仕様記憶部１００に記憶された全ての初期仕様にこの処理を行う（ステップＡ１）。Next, the operation will be described.
FIG. 8 is a flowchart illustrating an example of processing progress of the present embodiment. First, the phrase conversion unit 104 reads the initial specifications stored in the initial specification storage unit 100 one by one, and converts the read initial specifications according to the conversion rules stored in the conversion word storage unit 101. That is, if there is a character string corresponding to the word before conversion stored in the conversion word storage unit 101 in the read initial specification, the character string is converted into a word after conversion corresponding to the word before conversion. To do. For example, if there is a character string “Purchase” in the read initial specification, the character string is converted to “Order”, or if there is a character string “Accept” in the read initial specification, the character string is Or convert it to “acceptance”. The phrase conversion unit 104 stores the conversion specification obtained by this conversion in the conversion specification storage unit 102. The phrase conversion unit 104 performs this process on all initial specifications stored in the initial specification storage unit 100 (step A1).

次に、関連性有無判定部１０５は、変換仕様記憶部１０２に記憶された変換仕様に対して、文書ＩＤが互いに異なる２つの変換仕様毎に、２つの変換仕様間の関連性の有無を判定する（ステップＡ２）。次に、関連性計算部１０６が、文書ＩＤが互いに異なる２つの変換仕様毎に関連性値を計算する（ステップＡ３）。ステップＡ２，Ａ３の処理の詳細については、後述する。 Next, the relevance presence / absence determination unit 105 determines the presence / absence of relevance between the two conversion specifications for each of the two conversion specifications having different document IDs with respect to the conversion specification stored in the conversion specification storage unit 102. (Step A2). Next, the relevancy calculation unit 106 calculates a relevance value for each of two conversion specifications having different document IDs (step A3). Details of the processes of steps A2 and A3 will be described later.

次に、リンク生成部１０７は、文書ＩＤが互いに異なる２つの変換仕様毎に計算された全ての関連性値に関して、第一の閾値との比較（後述のステップＡ５）が完了したか否かを判定する（ステップＡ４）。全ての関連性値に関して第一の閾値との比較が完了していなければ（ステップＡ４におけるＮｏ）、リンク生成部１０７は、文書ＩＤが互いに異なる２つの変換仕様毎に計算された各関連性値のうち、未選択の関連性値を一つ選択し、その関連性値が、第一の閾値以上であるか否かを判定する（ステップＡ５）。選択した関連性値が第一の閾値以上であるならば（ステップＡ５におけるＹｅｓ）、リンク生成部１０７は、その関連性値の算出元となった２つの変換仕様の間にリンクを生成し、そのリンクの情報をリンク記憶部１０３に記憶させる（ステップＡ６）。ステップＡ６では、リンク生成部１０７は、生成したリンクのリンク信頼度として「○」を記憶させる。一方、選択した関連性値が第一の閾値未満であるならば（ステップＡ５におけるＮｏ）、リンク生成部１０７は、ステップＡ４以降の処理を繰り返す。 Next, the link generation unit 107 determines whether or not all the relevance values calculated for each of two conversion specifications having different document IDs are compared with the first threshold (step A5 described later). Determine (step A4). If the comparison with the first threshold is not completed for all relevance values (No in step A4), the link generation unit 107 calculates each relevance value calculated for each of two conversion specifications having different document IDs. Among them, one unselected relevance value is selected, and it is determined whether or not the relevance value is equal to or greater than a first threshold value (step A5). If the selected relevance value is greater than or equal to the first threshold value (Yes in step A5), the link generation unit 107 generates a link between the two conversion specifications from which the relevance value is calculated, The link information is stored in the link storage unit 103 (step A6). In step A6, the link generation unit 107 stores “◯” as the link reliability of the generated link. On the other hand, if the selected relevance value is less than the first threshold value (No in step A5), the link generation unit 107 repeats the processing after step A4.

図９は、リンク生成部１０７によるリンク生成結果（仕様の対応付け結果）の例を模式的に示す説明図である。文書０の仕様０と文書１の仕様０とは、文字列がほぼ同一であり関連性が高いため、リンク生成部１０７は、図９に示すように、文書０の仕様０と文書１の仕様０との間にリンクを生成する。また、文書１の仕様１に関しては、関連性の高い文字列を含む仕様が文書０内に存在しないため、リンク生成部１０７は、図９に示すように、文書１の仕様１に関するリンクを生成しない。 FIG. 9 is an explanatory diagram schematically illustrating an example of a link generation result (specification matching result) by the link generation unit 107. Since the specification 0 of the document 0 and the specification 0 of the document 1 have almost the same character strings and are highly related, the link generation unit 107 has the specifications 0 of the document 0 and the specifications of the document 1 as shown in FIG. Create a link with 0. In addition, regarding the specification 1 of the document 1, since the specification including the highly relevant character string does not exist in the document 0, the link generation unit 107 generates a link related to the specification 1 of the document 1 as shown in FIG. do not do.

なお、図９に示すようにリンクを生成した場合、リンク生成部１０７は、図７に例示する情報をリンク記憶部１０３に記憶させる。例えば、リンク生成部１０７は、文書０の仕様０と文書１の仕様０との間にリンクを生成したので、その旨（文書０の仕様０と文書１の仕様０とを対応付けたこと）を表す情報（図７参照）をリンク記憶部１０３に記憶させる。 When the link is generated as illustrated in FIG. 9, the link generation unit 107 stores the information illustrated in FIG. 7 in the link storage unit 103. For example, the link generation unit 107 has generated a link between the specification 0 of the document 0 and the specification 0 of the document 1, so that (the specification 0 of the document 0 and the specification 0 of the document 1 are associated) Is stored in the link storage unit 103 (see FIG. 7).

ステップＡ６の後、リンク生成部１０７は、ステップＡ４以降の処理を繰り返す。ステップＡ４で、全ての関連性値に関して第一の閾値との比較が完了したと判定したならば（ステップＡ４におけるＹｅｓ）、リンク補完部１０８は、文書０内の変換仕様のうち、リンクが生成されていない変換仕様に対して、文書１内の変換仕様との間でリンクを補完するか否かを判定し、リンクを補完すると判定した変換仕様間にリンクを補完する（ステップＡ７）。同様に、リンク補完部１０８は、文書１内の変換仕様のうち、リンクが生成されていない変換仕様に対して、文書０内の変換仕様との間でリンクを補完するか否かを判定し、リンクを補完すると判定した変換仕様間にリンクを補完する（ステップＡ８）。 After step A6, the link generation unit 107 repeats the processing after step A4. If it is determined in step A4 that the comparison with the first threshold has been completed for all relevance values (Yes in step A4), the link complementing unit 108 generates a link among the conversion specifications in the document 0. It is determined whether or not a link specification is to be complemented with a conversion specification in document 1 for a conversion specification that has not been completed, and the link is supplemented between the conversion specifications that have been determined to be supplemented (step A7). Similarly, the link complementation unit 108 determines whether or not to complement a link with a conversion specification in the document 0 for a conversion specification in which no link is generated among the conversion specifications in the document 1. The link is complemented between the conversion specifications determined to complement the link (step A8).

ステップＡ８では、ステップＡ７と同様の処理が、着目する文書ＩＤを切り替えて実行されればよい。 In step A8, the same processing as in step A7 may be executed by switching the document ID of interest.

次に、関連性有無判定部１０５によるステップＡ２、および関連性計算部１０６によるステップＡＳ３の処理について説明する。図１０は、ステップＡ２，Ａ３の詳細を示すフローチャートである。図１０の説明において、文書ＩＤが互いに異なる２つの変換仕様を、仕様Ａ，Ｂと記す。図１０において、ステップＡ１１，Ａ１２が、図８に示すステップＡ２に相当し、ステップＡ１２よりも後の処理が、図８に示すステップＡ３に相当する。 Next, the process of step A2 by the relevance presence / absence determination unit 105 and step AS3 by the relevance calculation unit 106 will be described. FIG. 10 is a flowchart showing details of steps A2 and A3. In the description of FIG. 10, two conversion specifications having different document IDs are referred to as specifications A and B. In FIG. 10, steps A11 and A12 correspond to step A2 shown in FIG. 8, and processing subsequent to step A12 corresponds to step A3 shown in FIG.

関連性有無判定部１０５は、仕様Ａ，Ｂから共通部分列を抽出する（ステップＡ１１）。すなわち、仕様Ａを表す文字列と、仕様Ｂを表す文字列とから、仕様Ａ，Ｂに共通に出現していて、かつ、予め定められた長さの文字列を抽出する。 The relevance determination unit 105 extracts a common partial sequence from the specifications A and B (step A11). That is, from the character string representing the specification A and the character string representing the specification B, a character string that appears in common in the specifications A and B and has a predetermined length is extracted.

ここでは、共通部分列の長さが予め「２」に定められている場合を例にして説明する。図６に示す文書０の仕様０と、文書１の仕様０から共通部分列を抽出する場合を例にして説明する。この場合、文書０の仕様０と文書１の仕様０に共通に出現していて、かつ、長さが「２」である文字列を、共通部分列として抽出すればよい。この例では、「顧客」、「客が」、「が営」、「営業」、「業へ」、「へ注」、「注文」、「文表」、「表で」、「で注」、「注文」がそれぞれ、共通部分列に該当するので、これらの共通部分列を抽出すればよい。 Here, a case where the length of the common subsequence is set to “2” in advance will be described as an example. An example in which a common partial sequence is extracted from specification 0 of document 0 and specification 0 of document 1 shown in FIG. 6 will be described. In this case, a character string that appears in common in specification 0 of document 0 and specification 0 of document 1 and has a length of “2” may be extracted as a common partial sequence. In this example, “customer”, “customer”, “gai”, “sales”, “to business”, “to note”, “order”, “sentence table”, “in table”, “in note” , “Order” corresponds to the common partial sequence, and these common partial sequences may be extracted.

また、既に説明したように、共通部分列に対して条件が予め定められていてもよい。その場合、関連性有無判定部１０５は、その条件を満たす共通部分列を抽出すればよい。例えば、通部分列の長さが「２」と定められていて、ひらがなは共通部分列に含めないという条件が定められているとする。このとき、図６に示す文書０の仕様０と、文書１の仕様０から共通部分列を抽出すると、関連性有無判定部１０５は、「顧客」、「営業」、「注文」、「文表」、「注文」を抽出することになる。 Further, as already described, a condition may be predetermined for the common subsequence. In that case, the relevance determination unit 105 may extract a common partial sequence that satisfies the condition. For example, it is assumed that the length of the partial subsequence is defined as “2” and the condition that hiragana is not included in the common subsequence is defined. At this time, when the common partial sequence is extracted from the specification 0 of the document 0 and the specification 0 of the document 1 shown in FIG. 6, the relevance determination unit 105 displays “customer”, “sales”, “order”, ”And“ order ”are extracted.

また、共通部分列に関する条件は、「ひらがなは含めない」という条件に限定されず、他の条件であってもよい。例えば、「同じ共通部分列を重複して抽出しない」という条件が定められていてもよい。共通部分列にひらがなを含めず、かつ、共通部分列が重複しないようにするという条件が定められているとする。このとき、図６に示す文書０の仕様０と、文書１の仕様０から共通部分列を抽出すると、関連性有無判定部１０５は、「顧客」、「営業」、「注文」、「文表」を抽出することになる。「ひらがなは含めない」という条件のもとでは、「注文」が重複して抽出されることになるが、「同じ共通部分列を重複して抽出しない」という条件も加えると、「注文」は１つだけ抽出されることになる。 Further, the condition relating to the common partial sequence is not limited to the condition “not including hiragana”, and may be other conditions. For example, a condition that “the same common partial sequence is not extracted twice” may be defined. Suppose that the common partial sequence does not include hiragana and that the common partial sequence is not duplicated. At this time, when the common partial sequence is extracted from the specification 0 of the document 0 and the specification 0 of the document 1 shown in FIG. 6, the relevance determination unit 105 reads “customer”, “sales”, “order”, “sentence table”. "Will be extracted. Under the condition of “Do not include hiragana”, “Order” will be extracted twice, but if you add the condition of “Do not extract the same common subsequence”, “Order” will be Only one will be extracted.

また、既に説明したように、条件とともに例外が指定可能であってもよい。例えば、ひらがなは共通部分列に含めないという条件とともに、「ない」等の特定のひらがなに関しては「共通部分列」として抽出するという例外を指定可能であってもよい。ここで例示した「ない」という文字列は、仕様内で否定を表すために必要な文字列であり、例外を指定することで、「ない」等の文字列を共通部分列として抽出することができる。 Further, as already described, an exception may be specified together with a condition. For example, in addition to the condition that hiragana characters are not included in the common subsequence, it may be possible to specify an exception that a specific hiragana such as “none” is extracted as “common subsequence”. The character string “None” illustrated here is a character string necessary for expressing negation in the specification. By specifying an exception, a character string such as “None” can be extracted as a common substring. it can.

共通部分列に関する条件やその例外は、関連仕様対応付けシステムのユーザがキーボード等の入力手段（図示略）を介して入力することによって設定してもよい。また、例えば、予め関連仕様対応付けシステムに条件やその例外を記憶させておいてもよい。これらは例示であり、条件やその例外の設定方法は、これらの方法に限定されない。 The conditions related to the common partial sequence and the exceptions may be set by the user of the related specification association system by inputting via an input means (not shown) such as a keyboard. Further, for example, conditions and exceptions may be stored in the related specification association system in advance. These are merely examples, and the method of setting conditions and exceptions is not limited to these methods.

以下の説明では、共通部分列にひらがなを含めず、かつ、共通部分列が重複しないようにするという条件が定められていて、「ない」は共通部分列として許容するという例外が定められているとする。 In the following description, a condition is set that the common substring does not include hiragana and that the common substring does not overlap, and an exception is given that “no” is allowed as a common substring. And

ステップＡ１１で２つの仕様Ａ，Ｂの共通部分列を抽出した後、関連性有無判定部１０５は、抽出した共通部分列が、２つの仕様に関連性があると判断するための条件を満たすか否かを判定する（ステップＡ１２）。以下、この条件を、関連性条件と記す。 After extracting the common partial sequence of the two specifications A and B in step A11, the relevance determination unit 105 satisfies the condition for determining that the extracted common partial sequence is related to the two specifications. It is determined whether or not (step A12). Hereinafter, this condition is referred to as a relevance condition.

関連性条件の例として、「抽出した共通部分列の数が予め定めた数以上であること」等が挙げられる。この場合、関連性有無判定部１０５は、抽出した共通部分列の数が予め定めた数以上であれば、２つの仕様に関連性があると判定し、抽出した共通部分列の数が予め定めた数未満であれば、２つの仕様に関連性がないと判定する。 As an example of the relevance condition, “the number of extracted common subsequences is equal to or greater than a predetermined number” or the like can be cited. In this case, the relevance determination unit 105 determines that the two specifications are related if the number of extracted common partial sequences is equal to or greater than a predetermined number, and the number of extracted common partial sequences is predetermined. If it is less than the number, it is determined that the two specifications are not related.

あるいは、「２つの仕様Ａ，Ｂの文字列の長さの和に対する、抽出した共通部分列の数の割合が、予め定めた割合以上であること」という条件を関連性条件として用いてもよい。この場合、関連性有無判定部１０５は、２つの仕様Ａ，Ｂの文字列の長さの和に対する、抽出した共通部分列の数の割合が、予め定めた割合以上であれば、２つの仕様に関連性があると判定する。一方、関連性有無判定部１０５は、２つの仕様Ａ，Ｂの文字列の長さの和に対する、抽出した共通部分列の数の割合が、予め定めた割合未満であれば、２つの仕様に関連性がないと判定する。 Alternatively, a condition that “the ratio of the number of extracted common partial strings to the sum of the lengths of the character strings of the two specifications A and B is equal to or greater than a predetermined ratio” may be used as the relevance condition. . In this case, if the ratio of the number of extracted common subsequences to the sum of the lengths of the character strings of the two specifications A and B is greater than or equal to a predetermined ratio, Is determined to be related. On the other hand, if the ratio of the number of extracted common subsequences to the sum of the lengths of the character strings of the two specifications A and B is less than a predetermined ratio, the relevance determination unit 105 determines that the two specifications It is determined that there is no relevance.

あるいは、「仕様Ａの文字列の長さをｐとし、仕様Ｂの文字列の長さをｑとし、共通部分列の文字列の長さをｒとしたときに、（抽出した共通部分列の数）／｛（ｐ−ｒ）×（ｑ−ｒ）｝が、予め定めた値以上であること」という条件を関連性条件として用いてもよい。 Or, “When the length of the character string of the specification A is p, the length of the character string of the specification B is q, and the length of the character string of the common partial sequence is r, (the extracted common partial sequence The condition that “number) / {(p−r) × (q−r)} is equal to or greater than a predetermined value” may be used as the relevance condition.

関連性条件は、関連仕様対応付けシステムのユーザがキーボード等の入力手段（図示略）を介して入力することによって設定されてもよい。また、例えば、予め関連仕様対応付けシステムに関連性条件を記憶させておいてもよい。これらは例示であり、関連性条件の設定方法は、これらの方法に限定されない。 The relevance condition may be set by the user of the related specification associating system inputting via an input means (not shown) such as a keyboard. For example, the relevance condition may be stored in advance in the related specification association system. These are examples, and the setting method of the relevance condition is not limited to these methods.

関連性有無判定部１０５は、関連性条件が満たされている場合、仕様Ａ，Ｂの間に関連性があると判定し、関連性条件が満たされていない場合、仕様Ａ，Ｂの間に関連性がないと判定する。 When the relevance condition is satisfied, the relevance presence / absence determination unit 105 determines that there is a relevance between the specifications A and B, and when the relevance condition is not satisfied, the relevance is determined between the specifications A and B. It is determined that there is no relevance.

ステップＡ３において２つの仕様Ａ，Ｂ間の関連性値を計算する場合、その仕様Ａ，Ｂ間に関連性があると判定されているならば（ステップＡ１２で関連性条件を満たすと判定されているならば）、関連性計算部１０６は、仕様Ａの文字列の可逆圧縮サイズを計算する（ステップＡ１３）。ここで、可逆圧縮サイズとは、可逆圧縮アルゴリズムを用いて仕様の文字列を圧縮したときのサイズである。可逆圧縮アルゴリズムとは、圧縮前の文字列と、圧縮し、さらに、展開する処理を行った後の文字列とが完全に一致する圧縮アルゴリズムである。可逆圧縮アルゴリズムの例として、例えば、ｚｉｐ、ｇｚｉｐ等がある。どのような可逆圧縮アルゴリズムを用いるかは、例えば、関連仕様対応付けシステムのユーザがキーボード等の入力手段（図示略）を介して指定してもよい。また、例えば、予め関連仕様対応付けシステムに、どのような可逆圧縮アルゴリズムを用いるかを記憶させておいてもよい。どのような可逆圧縮アルゴリズムを用いるかを指定する方法は、これらに限定されず、他の方法であってもよい。本実施形態では、可逆圧縮アルゴリズムとして、ｇｚｉｐを用いる場合を例にする。 When calculating the relationship value between the two specifications A and B in step A3, if it is determined that there is a relationship between the specifications A and B (in step A12, it is determined that the relationship is satisfied) If so, the relevancy calculation unit 106 calculates the lossless compression size of the character string of the specification A (step A13). Here, the lossless compression size is a size when a character string of a specification is compressed using a lossless compression algorithm. The lossless compression algorithm is a compression algorithm in which a character string before compression completely matches a character string after being compressed and further expanded. Examples of lossless compression algorithms include zip and gzip. What type of lossless compression algorithm is used may be specified by the user of the related specification associating system via input means (not shown) such as a keyboard. Further, for example, what type of lossless compression algorithm is used may be stored in advance in the related specification association system. The method for designating which lossless compression algorithm is used is not limited to these, and other methods may be used. In the present embodiment, a case where gzip is used as the lossless compression algorithm is taken as an example.

ステップＡ１３の後、関連性計算部１０６は、仕様Ｂの文字列の可逆圧縮サイズを計算する（ステップＡ１４）。さらに、関連性計算部１０６は、仕様Ａの文字列と仕様Ｂの文字列とを連結した文字列の可逆圧縮サイズを計算する（ステップＡ１５）。以下、この値を、仕様Ａ，Ｂの連結可逆圧縮サイズと記す。関連性計算部１０６は、ステップＡ１３，Ａ１４，Ａ１５で、同じ可逆圧縮アルゴリズムを用いる。 After step A13, the relevance calculation unit 106 calculates the lossless compression size of the character string of the specification B (step A14). Further, the relevancy calculation unit 106 calculates the lossless compression size of the character string obtained by concatenating the character string of the specification A and the character string of the specification B (step A15). Hereinafter, this value is referred to as a concatenated lossless compression size of specifications A and B. The relevancy calculation unit 106 uses the same lossless compression algorithm in steps A13, A14, and A15.

また、２つの仕様の文字列を連結した文字列とは、一方の仕様の文字列の後にもう一方の仕様の文字列を連ねた文字列である。例えば、図６に示す文書０の仕様０の文字列と、文書１の仕様０の文字列とを連結した文字列は、「顧客が営業へ注文表で注文顧客が営業へ注文表で注文。」となる。 A character string obtained by concatenating two specification character strings is a character string in which one specification character string is connected to the other specification character string. For example, the character string in which the character string of specification 0 of document 0 and the character string of specification 0 of document 1 shown in FIG. 6 are concatenated is “customer orders to sales and order customer orders to sales. "

ステップＡ１５の後、関連性計算部１０６は、ステップＡ１３，Ａ１４，Ａ１５で計算した各可逆圧縮サイズを用いて、仕様Ａ，Ｂ間の関連性値を計算する（ステップＡ１６）。ステップＡ１６では、関連性計算部１０６は、例えば、以下の計算方法によって、仕様Ａ，Ｂ間の関連性値を算出すればよい。ステップＡ１３で計算した仕様Ａの可逆圧縮サイズをＣ（Ａ）とする。また、ステップＡ１４で計算した仕様Ｂの可逆圧縮サイズをＣ（Ｂ）とする。また、ステップＡ１５で計算した仕様Ａ，Ｂの連結可逆圧縮サイズをＣ（ＡＢ）とする。関連性計算部１０６は、例えば、以下に示す式（１）の計算を行うことで、仕様Ａ，Ｂ間の関連性値を算出すればよい。なお、仕様Ａ，Ｂ間の関連性値をｓで表す。 After step A15, the relevance calculation unit 106 calculates a relevance value between specifications A and B using each lossless compression size calculated in steps A13, A14, and A15 (step A16). In step A16, the relevance calculation unit 106 may calculate the relevance value between the specifications A and B by the following calculation method, for example. Let C (A) be the lossless compression size of the specification A calculated in step A13. In addition, the reversible compression size of the specification B calculated in step A14 is C (B). Further, the concatenated lossless compression size of the specifications A and B calculated in step A15 is defined as C (AB). The relevancy calculation unit 106 may calculate the relevance value between the specifications A and B, for example, by calculating the following equation (1). The relevance value between specifications A and B is represented by s.

ｓ＝１−Ｃ（ＡＢ）／｛Ｃ（Ａ）＋Ｃ（Ｂ）｝式（１） s = 1-C (AB) / {C (A) + C (B)} Formula (1)

仕様Ａ，Ｂ間の関連性値の計算方法は、式（１）を用いた方法に限定されない。例えば、ｍｉｎ｛α，β｝は、α，βのうち小さい方の値を表すものとする。ｍａｘ｛α，β｝は、α，βのうち大きい方の値を表すものとする。このとき、関連性計算部１０６は、以下に示す式（２）の計算を行うことで、仕様Ａ，Ｂ間の関連性値ｓを算出してもよい。 The calculation method of the relationship value between the specifications A and B is not limited to the method using the formula (1). For example, min {α, β} represents the smaller value of α and β. max {α, β} represents the larger value of α and β. At this time, the relevance calculation unit 106 may calculate the relevance value s between the specifications A and B by performing the calculation of Expression (2) shown below.

ｓ＝
１−［Ｃ（ＡＢ）−ｍｉｎ｛Ｃ（Ａ），Ｃ（Ｂ）｝］／ｍａｘ｛Ｃ（Ａ），Ｃ（Ｂ）｝式（２）s =
1- [C (AB) -min {C (A), C (B)}] / max {C (A), C (B)} Equation (2)

関連性値は、概ね０から１までの数値となる。関連性値が１に近いほど、仕様Ａと仕様Ｂとが関連していることを表す。 The relevance value is generally a numerical value from 0 to 1. The closer the relevance value is to 1, the more the specification A and the specification B are related.

ここで、冗長性の高い文字列ほど圧縮率が高くなり、圧縮サイズは相対的に小さくなる。そのため、仕様Ａと仕様Ｂとが同一文字列を多く含み冗長性が高ければ、連結可逆圧縮サイズＣ（Ａ，Ｂ）が小さくなり、関連性値は大きな値として計算される。以下、本実施形態では、式（２）を用いて仕様Ａ，Ｂ間の関連性値を算出する場合を例にして説明する。 Here, the higher the redundancy, the higher the compression rate and the smaller the compression size. Therefore, if the specifications A and B contain many identical character strings and have high redundancy, the concatenated lossless compression size C (A, B) becomes small and the relevance value is calculated as a large value. Hereinafter, in the present embodiment, a case where the relevance value between the specifications A and B is calculated using Expression (2) will be described as an example.

なお、記憶部１１は、２つの仕様の組合せ毎に、関連性値を記憶する関連性値記憶部（図示略）を含んでいてもよい。そして、関連性計算部１０６は、その関連性値記憶部に、計算した関連性値と、その算出元となった２つの仕様とを組にして、関連性値記憶部に記憶させてもよい。あるいは、関連性計算部１０６は、計算した関連性値をリンク記憶部１０３に記憶させてもよい。このとき、関連性計算部１０６は、文書ＩＤおよび仕様ＩＤによって、互いに異なる文書の仕様の組合せを表し、その仕様の組合せに関する関連性値（図７において図示せず。）を記憶させてもよい。また、次に説明するステップＡ１７では、仕様間に関連性がないと判断されているので、その２つの仕様の組合せに対するリンク信頼度として、例えば、「×」を記憶させればよい。「×」は、仕様間の関連性がないことを意味する。 The storage unit 11 may include a relevance value storage unit (not shown) that stores a relevance value for each combination of two specifications. Then, the relevancy calculation unit 106 may store the relevance value storage unit in the relevance value storage unit by combining the calculated relevance value and the two specifications from which the calculation is made. . Alternatively, the relevance calculation unit 106 may store the calculated relevance value in the link storage unit 103. At this time, the relevancy calculation unit 106 may represent a combination of different document specifications based on the document ID and the specification ID, and may store a relevance value (not shown in FIG. 7) regarding the combination of the specifications. . In Step A17 described below, since it is determined that there is no relationship between the specifications, for example, “x” may be stored as the link reliability for the combination of the two specifications. “X” means that there is no relationship between specifications.

また、ステップＡ３において、関連性計算部１０６は、２つの仕様Ａ，Ｂ間の関連性値を計算する場合、その仕様Ａ，Ｂ間に関連性がないと判定されているならば（ステップＡ１２で関連性条件を満たさないと判定されているならば）、その仕様Ａ，Ｂ間の関連性値を０とする（ステップＡ１７）。 In step A3, when calculating the relevance value between the two specifications A and B, the relevance calculating unit 106 determines that there is no relevance between the specifications A and B (step A12). If it is determined that the relevance condition is not satisfied), the relevance value between the specifications A and B is set to 0 (step A17).

仕様Ａ，Ｂに関して関連性条件を満たさないと判定されている場合に、仕様Ａ，Ｂ間の関連性値を０とするのは、この場合に圧縮サイズを用いて関連性値を計算すると、次のような問題が生じるからである。可逆圧縮アルゴリズムでは、仕様を表す文字列を２進記号列に変換してから圧縮される。このとき、例えば、文字「あ」が「１０１００１００１０１０００１０」に変換され、文字「い」が「１０１００１００１０１００１００」に変換されるとすると、「あ」と「い」は、元の文字列では全く異なる文字であるにも関わらず、２進記号列では先頭から１３文字目までが同一であるため、圧縮サイズを利用して関連性値を計算すると、「あ」と「い」はある程度関連があることになってしまう（すなわち、ある程度高い関連性値が計算されてしまう）。このように、関連性がないと判断されている文字列同士の関連性値の計算に圧縮サイズを利用すると、関連性値が高く計算されてしまう場合が生じるという問題がある。このような問題を防止するため、仕様Ａ，Ｂに関して関連性条件を満たさないと判定されている場合には、圧縮サイズを利用せずに、仕様Ａ，Ｂの関係性値を０とする。 When it is determined that the relevance condition is not satisfied for the specifications A and B, the relevance value between the specifications A and B is set to 0. In this case, when the relevance value is calculated using the compression size, This is because the following problems arise. In the lossless compression algorithm, a character string representing a specification is converted into a binary symbol string and then compressed. At this time, for example, if the character “A” is converted to “1010010010100010” and the character “I” is converted to “1010010010100100”, “A” and “I” are completely different characters in the original character string. In spite of this, since the first 13 characters in the binary symbol string are the same, when calculating the relevance value using the compressed size, “a” and “i” are related to some extent. (That is, a relevance value that is somewhat high is calculated). As described above, when the compression size is used for calculating the relevance value between character strings that are determined to have no relevance, there is a problem that the relevance value may be calculated to be high. In order to prevent such a problem, when it is determined that the relevance condition is not satisfied for the specifications A and B, the relationship value of the specifications A and B is set to 0 without using the compression size.

また、２つの仕様の文字列に、同一の文字列が含まれていたとしても、その同一の文字列がひらがな等、多くのテキスト（文書）に含まれている文字からなる文字列であれば、その２つの仕様の関連性値を高い値にするのは好ましくない。しかし、その一方で、単にひらがなを除いて関連性値を計算すると、例えば、「顧客が営業へ」と、「顧客へ営業が」とを区別することができない。 Moreover, even if the character string of two specifications includes the same character string, the same character string is a character string consisting of characters included in many texts (documents) such as hiragana. Therefore, it is not preferable to make the relevance value of the two specifications high. On the other hand, if the relevance value is simply calculated except for hiragana, for example, “customer to sales” and “customer to sales” cannot be distinguished.

本実施形態では、関連性有無判定部１０５がステップＡ１２で共通部分列に基づいて２つの仕様の関連性の有無を判定する。そして、共通部分列が少なく、２つの仕様に関連性がないと判定した場合には、関連性計算部１０６が、圧縮サイズを利用せずに、その２つの仕様の関連性値を０とする。また、関連性有無判定部１０５がステップＡ１１，Ａ１２を行うことで、ひらがなを除いても、「顧客が営業へ」と「顧客へ営業が」のような組合せにも関連性があると判定することができる。そして、関連性計算部１０６が、ステップＡ１３〜Ａ１６の処理を行うことで、ひらがなも含めた関連性値を計算することができる。本実施形態では、このようにステップＡ２，Ａ３（具体的には、ステップＡ１１〜Ａ１７）の処理によって、上記のような問題の発生を防止することができる。 In the present embodiment, the relevance presence / absence determination unit 105 determines the relevance of the two specifications based on the common partial sequence in step A12. When it is determined that there are few common partial sequences and the two specifications are not related, the relationship calculation unit 106 sets the relationship value of the two specifications to 0 without using the compressed size. . Further, the relevance determination unit 105 performs steps A11 and A12, so that it is determined that a combination such as “customer to sales” and “customer to sales” is also relevant even if hiragana is excluded. be able to. And the relevance calculation part 106 can calculate the relevance value including hiragana by performing the processing of steps A13 to A16. In the present embodiment, the occurrence of the above problems can be prevented by the processing of steps A2 and A3 (specifically, steps A11 to A17).

次に、リンク補完部１０８による補完動作について説明する。図１１は、リンク補完部１０８による補完動作の例を示すフローチャートである。ここでは、図９に例示する文書０に対して補完を行う場合を例にして説明する。より具体的には、図９に例示するリンクが生成された後に、ステップＡ７（図８参照）に移行した場合を例にして説明する。 Next, the complementing operation by the link complementing unit 108 will be described. FIG. 11 is a flowchart illustrating an example of a complement operation performed by the link complement unit 108. Here, a case where the document 0 illustrated in FIG. 9 is complemented will be described as an example. More specifically, an example will be described in which the process proceeds to step A7 (see FIG. 8) after the link illustrated in FIG. 9 is generated.

また、以下の説明では、着目している文書（本例では文書０）の仕様ＩＤを記号“ｉ”で表し、もう一方の文書（本例では文書１）の仕様ＩＤを記号“ｊ”で表す。さらに、着目している文書に属する仕様の数をＮ_０とし、もう一方の文書に属する仕様の数をＮ_１とする。さらに、正の整数である定数ｋが予め定められているものとする。定数ｋは、例えば、関連仕様対応付けシステムのユーザがキーボード等の入力手段（図示略）を介して入力することによって設定されてもよい。また、例えば、予め関連仕様対応付けシステムに定数ｋを記憶させておいてもよい。これらは例示であり、定数ｋの定め方は、これらの方法に限定されない。ｋは、ある仕様を基準として、その仕様よりも前の仕様の範囲や、その仕様よりも後の仕様の範囲を定めるために用いられる。In the following description, the specification ID of the document of interest (document 0 in this example) is represented by the symbol “i”, and the specification ID of the other document (document 1 in this example) is represented by the symbol “j”. Represent. Further, the number of specifications belonging to the document of interest is N _0, and the number of specifications belonging to the other document is N ₁ . Furthermore, it is assumed that a constant k which is a positive integer is predetermined. The constant k may be set, for example, by a user of the related specification associating system inputting via an input unit (not shown) such as a keyboard. Further, for example, the constant k may be stored in advance in the related specification association system. These are examples, and the method of determining the constant k is not limited to these methods. k is used to determine a range of specifications before the specification and a range of specifications after the specification with a certain specification as a reference.

まず、リンク補完部１０８は、着目している文書（本例では文書０）の仕様ＩＤを表す番号ｉを０に初期化する（ステップＡ３１）。 First, the link complementation unit 108 initializes a number i representing the specification ID of the document of interest (document 0 in this example) to 0 (step A31).

次に、リンク補完部１０８は、ｉが、文書０に属する仕様の数であるＮ_０未満であるか否かを判定する（ステップＡ３２）。ｉ＜Ｎ_０であれば（ステップＡ３２におけるＹｅｓ）、ステップＡ３３に移行し、ｉ≧Ｎ_０であれば（ステップＡ３２におけるＮｏ）、処理を終了する。Next, the link complementation unit 108, i is equal to or less than _{N 0} is the number of specifications that belong to the document 0 (step A32). If i <N ₀ (Yes in step A32), the process proceeds to step A33, and if i ≧ N ₀ (No in step A32), the process ends.

ステップＡ３３において、リンク補完部１０８は、文書１の仕様ＩＤを表す番号ｊを０に初期化する（ステップＡ３３）。 In step A33, the link complementing unit 108 initializes a number j representing the specification ID of the document 1 to 0 (step A33).

次に、リンク補完部１０８は、ｊが、文書１に属する仕様の数であるＮ_１未満であるか否かを判定する（ステップＡ３４）。ｊ＜Ｎ_１であれば（ステップＡ３４におけるＹｅｓ）、ステップＡ３５に移行し、ｊ≧Ｎ_１であれば（ステップＡ３４におけるＮｏ）、ステップＡ３９に移行する。Next, the link complementation unit 108, j is determines whether the _N less than ₁ is the number of specifications that belong to the document 1 (step A34). If j _{<N 1} (Yes at Step A34), the process proceeds to step A35, if j ≧ _{N 1} (No in step A34), the process proceeds to step A39.

ステップＡ３５において、リンク補完部１０８は、リンク記憶部１０３に記憶された情報を参照して、文書０の仕様ｉに既にリンクが生成されているか否かを判定する（ステップＡ３５）。換言すれば、リンク補完部１０８は、文書０の仕様ｉに、文書１のいずれかの仕様が対応付けられているか否かを判定する。文書０の仕様ｉに既にリンクが生成されていれば（ステップＡ３５におけるＹｅｓ）、ステップＡ３９に移行する。文書０の仕様ｉにまだリンクが生成されていなければ（ステップＡ３５におけるＮｏ）、ステップＡ３６に移行する。 In step A35, the link complementation unit 108 refers to the information stored in the link storage unit 103, and determines whether or not a link has already been generated in the specification i of the document 0 (step A35). In other words, the link complementation unit 108 determines whether any specification of the document 1 is associated with the specification i of the document 0. If a link has already been generated in the specification i of the document 0 (Yes in step A35), the process proceeds to step A39. If a link has not yet been generated in the specification i of document 0 (No in step A35), the process proceeds to step A36.

ステップＡ３６において、リンク補完部１０８は、文書０の仕様ｉと、文書１の仕様ｊとの関連性値を参照して、その関連性値が第二の閾値以上であるか否かを判定する（ステップＡ３６）。 In step A36, the link complementing unit 108 refers to the relevance value between the specification i of the document 0 and the specification j of the document 1, and determines whether or not the relevance value is equal to or greater than a second threshold value. (Step A36).

すでに説明したように、記憶部１１が関連性値記憶部（図示略）を含み、関連性記憶部１０６は、ステップＡ３で求めた仕様間の関連性値を、その関連性値記憶部に記憶させておけばよい。あるいは、関連性記憶部１０６は、ステップＡ３で求めた仕様間の関連性値をリンク記憶部１０３に記憶させてもよい。ステップＡ３６において、リンク補完部１０８は、ステップＡ３で求められ記憶された各関連性値のうち、仕様ｉ，ｊの関連性値を参照し、その関連性値が第二の閾値以上であるか否かを判定すればよい。 As already described, the storage unit 11 includes a relevance value storage unit (not shown), and the relevance storage unit 106 stores the relevance value between specifications obtained in step A3 in the relevance value storage unit. You can let it go. Alternatively, the relationship storage unit 106 may store the relationship value between the specifications obtained in step A3 in the link storage unit 103. In step A36, the link complementing unit 108 refers to the relevance value of the specifications i and j among the relevance values obtained and stored in step A3, and whether the relevance value is greater than or equal to the second threshold value. What is necessary is just to determine.

なお、第二の閾値は、例えば、関連仕様対応付けシステムのユーザがキーボード等の入力手段（図示略）を介して入力することによって設定してもよい。また、例えば、予め関連仕様対応付けシステムに第二の閾値のデフォルト値を記憶させておいてもよい。これらは例示であり、第二の閾値の設定方法は、これらの方法に限定されない。第二の閾値は、例えば、０．２５等の値に設定すればよいが、この値も例示であり、第二の閾値は０．２５以外の値であってもよい。ただし、第二の閾値は、第一の閾値より小さな値に定めておく。 Note that the second threshold value may be set, for example, by a user of the related specification associating system inputting via an input means (not shown) such as a keyboard. For example, the default value of the second threshold value may be stored in advance in the related specification association system. These are examples, and the second threshold setting method is not limited to these methods. The second threshold value may be set to a value such as 0.25, for example, but this value is also an example, and the second threshold value may be a value other than 0.25. However, the second threshold value is set to a value smaller than the first threshold value.

仕様ｉ，ｊの関連性値が第二の閾値未満であれば（ステップＡ３６におけるＮｏ）、ステップＡ４０に移行する。また、仕様ｉ，ｊの関連性値が第二の閾値以上であれば（ステップＡ３６におけるＹｅｓ）、リンク補完部１０８は、前条件が満たされているか否かを判定する（ステップＡ３７）。 If the relevance value of the specifications i and j is less than the second threshold value (No in step A36), the process proceeds to step A40. If the relevance value of the specifications i and j is greater than or equal to the second threshold value (Yes in step A36), the link complementation unit 108 determines whether the precondition is satisfied (step A37).

前条件は、ｉ＝０であること、あるいは、文書０において仕様ｉよりもｋ個前の仕様ｉ−ｋから仕様ｉの１つ前の仕様ｉ−１までの間に、文書１におけるｊ番目以前の仕様と対応付けられている仕様が少なくとも１つ存在していることである。すなわち、「ｉ＝０であること」と、「文書０において仕様ｉよりもｋ個前の仕様ｉ−ｋから仕様ｉの１つ前の仕様ｉ−１までの間に、文書１におけるｊ番目以前の仕様と対応付けられている仕様が少なくとも１つ存在していること」の２つのうち、少なくともいずれか一方が満たされていれば、前条件は満たされていることになる。「ｉ＝０であること」と、「文書０において仕様ｉよりもｋ個前の仕様ｉ−ｋから仕様ｉの１つ前の仕様ｉ−１までの間に、文書１におけるｊ番目以前の仕様と対応付けられている仕様が少なくとも１つ存在していること」がいずれも満たされていなければ、前条件は満たされていないことになる。なお、ｉ＝０であることは、仕様ｉが文書０における最初の仕様であることを意味する。 The precondition is that i = 0, or in the document 0, between the specification i-k before the specification i and the specification i-1 immediately before the specification i, the j-th in the document 1 That is, there is at least one specification associated with the previous specification. That is, “i = 0” and “jth in the document 1 from the specification i−k before the specification i to the specification i−1 immediately before the specification i in the document 0” If at least one of the two of “there is at least one specification associated with the previous specification” is satisfied, the precondition is satisfied. “I = 0” and “in the document 0, between the specification i−k k before the specification i and the specification i−1 immediately before the specification i, If none of “there is at least one specification associated with the specification” is satisfied, the precondition is not satisfied. Note that i = 0 means that the specification i is the first specification in the document 0.

前条件が満たされていなければ（ステップＡ３７におけるＮｏ）、ステップＡ４０に移行する。また、前条件が満たされていれば（ステップＡ３７におけるＹｅｓ）、リンク補完部１０８は、後条件が満たされているか否かを判定する（ステップＡ３８）。 If the precondition is not satisfied (No in step A37), the process proceeds to step A40. If the precondition is satisfied (Yes in step A37), the link complementation unit 108 determines whether the postcondition is satisfied (step A38).

後条件は、ｉ＝Ｎ_０−１であること、あるいは、文書０において仕様ｉの次の仕様ｉ＋１から、仕様ｉよりもｋ個後の仕様ｉ＋ｋまでの間に、文書１におけるｊ番目以後の仕様と対応づけられている仕様が少なくとも１つ存在していることである。すなわち、「ｉ＝Ｎ_０−１であること」と、「文書０において仕様ｉの次の仕様ｉ＋１から、仕様ｉよりもｋ個後の仕様ｉ＋ｋまでの間に、文書１におけるｊ番目以後の仕様と対応づけられている仕様が少なくとも１つ存在していること」の２つのうち、少なくともいずれか一方が満たされていれば、後条件は満たされていることになる。「ｉ＝Ｎ_０−１であること」と、「文書０において仕様ｉの次の仕様ｉ＋１から、仕様ｉよりもｋ個後の仕様ｉ＋ｋまでの間に、文書１におけるｊ番目以後の仕様と対応づけられている仕様が少なくとも１つ存在していること」がいずれも満たされていなければ、後条件は満たされていないことになる。なお、ｉ＝Ｎ_０−１であることは、仕様ｉが文書０における最後の仕様であることを意味する。The post-condition is i = N ₀ −1, or in the document 0, the specification i + 1 after the specification i and the specification i + k after the specification i to the specification i + k after the specification i. That is, there is at least one specification associated with the specification. That is, “i = N ₀ −1” and “from the specification i + 1 next to the specification i in the document 0 to the specification i + k after the specification i, the jth and later in the document 1 If at least one of two of “there is at least one specification associated with the specification” is satisfied, the post-condition is satisfied. “I = N ₀ −1” and “from the specification i + 1 next to the specification i in the document 0 to the specification i + k after the specification i, the jth and subsequent specifications in the document 1 If none of the “corresponding specifications are present” is satisfied, the post-condition is not satisfied. Note that i = N ₀ −1 means that the specification i is the last specification in the document 0.

後条件が満たされていなければ（ステップＡ３８におけるＮｏ）、ステップＡ４０に移行する。また、後条件が満たされていれば（ステップＡ３８におけるＹｅｓ）、リンク補完部１０８は、文書０の仕様ｉに文書１の仕様ｊを補完する（ステップＡ３９）。ステップＡ３９において、リンク補完部１０８は、文書０の仕様ｉと文書１の仕様ｊとのリンクを、図７に例示するようにリンク記憶部１０３に記憶させればよい。ただし、リンク信頼度は「△」として記憶させる。 If the post-condition is not satisfied (No in step A38), the process proceeds to step A40. If the post-condition is satisfied (Yes in step A38), the link complementation unit 108 supplements the specification j of the document 1 with the specification i of the document 0 (step A39). In step A39, the link complementation unit 108 may store the link between the specification i of the document 0 and the specification j of the document 1 in the link storage unit 103 as illustrated in FIG. However, the link reliability is stored as “Δ”.

ステップＡ３９の後、リンク補完部１０８は、ｊの値を１インクリメントする（ステップＡ４０）。また、ステップＡ３６，Ａ３７，Ａ３８においてＮｏと判定された場合にも、ｊの値を１インクリメントする（ステップＡ４０）。ステップＡ４０の後、ステップＡ３４以降の処理を繰り返す。 After step A39, the link complementation unit 108 increments the value of j by 1 (step A40). Also, when it is determined No in Steps A36, A37, and A38, the value of j is incremented by 1 (Step A40). After step A40, the processing after step A34 is repeated.

また、ステップＡ３４において、ｊ≧Ｎ_１であった場合（ステップＡ３４におけるＮｏ）や、ステップＡ３５において、文書０の仕様ｉに既にリンクが生成されている場合（ステップＡ３５におけるＹｅｓ）、リンク補完部１０８は、ｉの値を１インクリメントする（ステップＡ３９）。ステップＡ３９の後、ステップＡ３２以降の処理を繰り返す。Further, in step A34, if it was j ≧ _{N 1} (No in step A34) and, in step A35, if already linked to specifications i of the document 0 is generated (Yes in step A35), the link complementary portion 108 increments the value of i by 1 (step A39). After step A39, the processes after step A32 are repeated.

一般に、システムやソフトウェア等の開発において、前の工程の文書をもとに後の工程の文書を作成する。従って、文書内における仕様の記述順は、同じになることが多い。また、仕様が業務の順に書かれている場合にも、各文書において、仕様の記述順は同様になる。従って、上記のように前条件および後条件がいずれも満たされている場合に、仕様ｉ，ｊの組合せは、リンク生成対象の候補と考えられる。よって、仕様ｉ，ｊ間にリンクを補完すること（ステップＡ３９を行うこと）が適切であるということができる。ただし、リンク補完部１０８によって行われた仕様同士の対応付けは、リンク生成部１０７によって行われた仕様同士の対応付けよりも、リンク信頼度（対応付けの適切さに関する信頼度）が低いものとして定める。 In general, in the development of a system, software, etc., a document for a subsequent process is created based on a document for the previous process. Therefore, the description order of specifications in a document is often the same. In addition, when the specifications are written in the order of work, the description order of the specifications is the same in each document. Therefore, when both the precondition and the postcondition are satisfied as described above, the combination of the specifications i and j is considered as a candidate for link generation. Therefore, it can be said that complementing the link between the specifications i and j (performing step A39) is appropriate. However, the association between specifications performed by the link complementation unit 108 is assumed to have a lower link reliability (reliability related to the appropriateness of association) than the association between specifications performed by the link generation unit 107. Determine.

ここでは、ステップＡ７（図８参照）に移行した場合を例にして説明したが、ステップＡ８では、着目する文書を文書１とし、もう一方の文書を文書０として、同様の処理を行えばよい。 Here, the case where the process has been shifted to step A7 (see FIG. 8) has been described as an example. However, in step A8, the document to be focused on is document 1 and the other document is document 0, and the same processing may be performed. .

以下、ステップＡ７（図８参照）に移行した場合の動作の具体例を示す。ステップＡ７に移行した時点で、文書０，１間の仕様は、図９に例示するように対応付けられ、リンク記憶部１０３には図７に例示する情報が記憶されているものとする。ここでは、文書０のリンクを補完するのでＮ_０＝５であり、Ｎ_１＝７である。また、第二の閾値は０．２５であるとする。Hereinafter, a specific example of the operation when the process proceeds to step A7 (see FIG. 8) will be described. At the time of shifting to step A7, it is assumed that the specifications between the documents 0 and 1 are associated as illustrated in FIG. 9, and the link storage unit 103 stores information illustrated in FIG. Here, since the link of document 0 is complemented, N ₀ = 5 and N ₁ = 7. The second threshold is assumed to be 0.25.

リンク補完部１０８は、文書０の仕様ＩＤを表すｉを０に初期化し（ステップＡ３１）、ｉ＜Ｎ_０であるので、文書１の仕様ＩＤを表すｊを０に初期化する（ステップＡ３３）。そして、ｊ＜Ｎ_１であるので、ステップＡ３５に移行する。このとき、ｉ＝０，ｊ＝０である。The link complementing unit 108 initializes i representing the specification ID of the document 0 to 0 (step A31), and initializes j representing the specification ID of the document 1 to 0 because i <N ₀ (step A33). . And, since it is j _{<N 1,} the process proceeds to step A35. At this time, i = 0 and j = 0.

ステップＡ３５において、文書０の仕様０にリンクが生成されているか否かを確認すると、文書０の仕様０には、文書１の仕様０との間にリンクが生成されている（図９、図７参照。）。従って、ステップＡ３９に移行し、リンク補完部１０８は、ｉをインクリメントし、ステップＡ３２，Ａ３３，Ａ３４の処理を行う。この結果、ｉ＝１，ｊ＝０となっている。 When it is confirmed in step A35 whether or not a link is generated in the specification 0 of the document 0, a link is generated in the specification 0 of the document 0 with the specification 0 of the document 1 (FIG. 9, FIG. 9). 7). Accordingly, the process proceeds to step A39, where the link complementing unit 108 increments i and performs the processes of steps A32, A33, A34. As a result, i = 1 and j = 0.

次に、リンク補完部１０８は、ステップＡ３５において、文書０の仕様１にリンクが生成されているか否かを確認する。文書０の仕様１にはリンクが生成されていないため（図９、図７参照。）、ステップＡ３６に移行する。ここで、文書０の仕様１と文書１の仕様０との間の共通部分列は「営業」しかなく、この２つの仕様間に関連性がないと判定され、関連性値は０に定められていたとする。すると、文書０の仕様１と文書１の仕様０との間の関連性値は第二の閾値未満であるので（ステップＡ３６におけるＮｏ）、リンク補完部１０８は、ｊをインクリメントする（ステップＡ４０）。この結果、ｉ＝１，ｊ＝１となる。なお、既に説明したように、各仕様間の関連性値は、記憶部１１内の関連性値記憶部（図示略）に記憶していてもよく、あるいは、リンク記憶部１０３に記憶しておいてもよい。ステップＡ３６では、関連性値記憶部またはリンク記憶部１０３に記憶されている関連性値を参照すればよい。 Next, the link complementation part 108 confirms whether the link is produced | generated by the specification 1 of the document 0 in step A35. Since no link is generated in the specification 1 of the document 0 (see FIGS. 9 and 7), the process proceeds to step A36. Here, the common subsequence between the specification 1 of the document 0 and the specification 0 of the document 1 is only “sales”, and it is determined that there is no relationship between the two specifications, and the relevance value is set to 0. Suppose that Then, since the relevance value between the specification 1 of the document 0 and the specification 0 of the document 1 is less than the second threshold (No in Step A36), the link complementation unit 108 increments j (Step A40). . As a result, i = 1 and j = 1. As already described, the relevance values between the specifications may be stored in a relevance value storage unit (not shown) in the storage unit 11 or stored in the link storage unit 103. May be. In step A36, the relevance value stored in the relevance value storage unit or the link storage unit 103 may be referred to.

続いて、ステップＡ３４，Ａ３５の判定を行い、ステップＡ３６に移行する。ここで、文書０の仕様１と文書１の仕様１とに関しては、共通部分列が存在しないため、この２つの仕様間に関連性がないと判定され、関連性値は０に定められている。すると、文書０の仕様１と文書１の仕様１との間の関連性値は第二の閾値未満であるので（ステップＡ３６におけるＮｏ）、リンク補完部１０８は、ｊをインクリメントする（ステップＡ４０）。この結果、ｉ＝１，ｊ＝２となる。 Subsequently, determinations in steps A34 and A35 are made, and the process proceeds to step A36. Here, regarding the specification 1 of the document 0 and the specification 1 of the document 1, since there is no common partial sequence, it is determined that there is no relationship between the two specifications, and the relationship value is set to 0. . Then, since the relevance value between the specification 1 of the document 0 and the specification 1 of the document 1 is less than the second threshold (No in Step A36), the link complementation unit 108 increments j (Step A40). . As a result, i = 1 and j = 2.

続いて、ステップＡ３４，Ａ３５の判定を行い、ステップＡ３６に移行する。ここで、文書０の仕様１と文書１の仕様２とに関しては、２つの共通部分列「在庫」、「確認」が存在していて、この２つの仕様間に関連性があると判定され、関連性計算部１０６が関連性値を０．５５と算出しているものとする。関連性値０．５５は第二の閾値（０．２５）以上であるので、リンク補完部１０８は、前条件が満たされているか否かを判定する（ステップＡ３７）。 Subsequently, determinations in steps A34 and A35 are made, and the process proceeds to step A36. Here, regarding the specification 1 of the document 0 and the specification 2 of the document 1, it is determined that there are two common subsequences “inventory” and “confirmation”, and the two specifications are related to each other. It is assumed that the relevancy calculation unit 106 has calculated the relevance value as 0.55. Since the relevance value 0.55 is equal to or greater than the second threshold value (0.25), the link complementation unit 108 determines whether or not the precondition is satisfied (step A37).

ここで、文書０の仕様ｉ（ｉ＝１）の１つ前の仕様０は、文書１の仕様０との間にリンクが生成されている（図９、図７参照）。そして、文書１の仕様０は、文書１において仕様ｊ（ｊ＝２）以前の仕様である。従って、リンク補完部１０８は、前条件が満たされていると判定し（ステップＡ３７におけるＹｅｓ）、次に、後条件が満たされているか否かを判定する（ステップＡ３８）。 Here, a link is generated between the specification 0 immediately before the specification i (i = 1) of the document 0 and the specification 0 of the document 1 (see FIGS. 9 and 7). The specification 0 of the document 1 is a specification before the specification j (j = 2) in the document 1. Accordingly, the link complementation unit 108 determines that the precondition is satisfied (Yes in Step A37), and then determines whether the postcondition is satisfied (Step A38).

ここで、文書０の仕様ｉ（ｉ＝１）の１つ後の仕様２は、文書１の仕様３との間にリンクが生成されている（図９、図７参照）。そして、文書１の仕様３は、文書１において仕様ｊ（ｊ＝２）以後の仕様である。従って、リンク補完部１０８は、後条件が満たされていると判定し（ステップＡ３８におけるＹｅｓ）、ステップＡ３９に移行する。 Here, a link is generated between the specification 2 immediately after the specification i (i = 1) of the document 0 and the specification 3 of the document 1 (see FIGS. 9 and 7). The specification 3 of the document 1 is a specification after the specification j (j = 2) in the document 1. Accordingly, the link complementation unit 108 determines that the post-condition is satisfied (Yes in Step A38), and proceeds to Step A39.

ステップＡ３９において、リンク補完部１０８は、文書０の仕様ｉ（ｉ＝１）と、文書１の仕様ｊ（ｊ＝２）とを対応付け、その対応関係（リンク）をリンク記憶部１０３に記憶させる。このとき、そのリンクのリンク信頼度として、「△」をリンク記憶部１０３に記憶させる。この結果、リンク記憶部１０３に記憶される情報は、図１２に示すようになる。図１２に示す情報では、図７に示す情報と比較すると、文書０の仕様１と文書１の仕様２との対応関係（リンク）が追加され、そのリンクのリンク信頼度「△」も追加されている。 In step A39, the link complementation unit 108 associates the specification i (i = 1) of the document 0 with the specification j (j = 2) of the document 1, and stores the correspondence (link) in the link storage unit 103. Let At this time, “Δ” is stored in the link storage unit 103 as the link reliability of the link. As a result, the information stored in the link storage unit 103 is as shown in FIG. In the information shown in FIG. 12, compared with the information shown in FIG. 7, the correspondence (link) between the specification 1 of the document 0 and the specification 2 of the document 1 is added, and the link reliability “Δ” of the link is also added. ing.

次に、リンク補完部１０８は、ｊをインクリメントして（ステップＡ４０）、ステップＡ３４以降の処理を繰り返す。そして、ｉをインクリメントした場合には（ステップＡ３９）、ステップＡ３２以降の処理を繰り返す。そして、ｉ≧Ｎ_０となったならば（ステップＡ３２におけるＮｏ）、文書０の仕様に対するリンクの補完（ステップＡ７、図８参照）を終了する。Next, the link complementation part 108 increments j (step A40), and repeats the process after step A34. If i is incremented (step A39), the processing after step A32 is repeated. Then, if i ≧ N ₀ (No in step A32), link complementation for the specification of document 0 (step A7, see FIG. 8) is terminated.

上記の動作の後、ステップＡ８（図８参照）に移行した場合の動作の具体例を示す。ステップＡ８では、文書０と文書１とを置き換えて、ステップＡ７と同様の処理を行えばよい。すなわち、文書１の仕様ＩＤをｉとし、文書０の仕様ＩＤをｊとして、図１１に示す処理を行えばよい。このとき、Ｎ_０＝７、Ｎ_１＝５である。A specific example of the operation when the process proceeds to step A8 (see FIG. 8) after the above operation will be described. In step A8, document 0 and document 1 are replaced and the same processing as in step A7 may be performed. That is, the process shown in FIG. 11 may be performed with the specification ID of document 1 as i and the specification ID of document 0 as j. At this time, N ₀ = 7 and N ₁ = 5.

リンク補完部１０８は、文書１の仕様ＩＤを表すｉを０に初期化し（ステップＡ３１）、ｉ＜Ｎ_０であるので、文書０の仕様ＩＤを表すｊを０に初期化する（ステップＡ３３）。そして、ｊ＜Ｎ_１であるので、ステップＡ３５に移行する。このとき、ｉ＝０，ｊ＝０である。The link complementing unit 108 initializes i representing the specification ID of the document 1 to 0 (step A31), and since i <N ₀ , j representing the specification ID of the document 0 is initialized to 0 (step A33). . And, since it is j _{<N 1,} the process proceeds to step A35. At this time, i = 0 and j = 0.

ステップＡ３５において、文書１の仕様０にリンクが生成されているか否かを確認すると、文書１の仕様０には、文書０の仕様０との間にリンクが生成されている。従って、ステップＡ３９に移行し、リンク補完部１０８は、ｉをインクリメントし、ステップＡ３２，Ａ３３，Ａ３４の処理を行う。この結果、ｉ＝１，ｊ＝０となっている。 In step A 35, when it is confirmed whether or not a link is generated in the specification 0 of the document 1, a link is generated in the specification 0 of the document 1 and the specification 0 of the document 0. Accordingly, the process proceeds to step A39, where the link complementing unit 108 increments i and performs the processes of steps A32, A33, A34. As a result, i = 1 and j = 0.

次に、リンク補完部１０８は、ステップＡ３５において、文書１の仕様１にリンクが生成されているか否かを確認する。文書１の仕様１にはリンクが生成されていないため、ステップＡ３６に移行する。ここで、文書１の仕様１と文書０の仕様０とに関しては、２つの共通部分列「注文」、「文表」が存在していて、この２つの仕様間に関連性があると判定され、関連性計算部１０６が関連性値を０．２９と算出しているものとする。関連性値０．２５は第二の閾値（０．２５）以上であるので、リンク補完部１０８は、前条件が満たされているか否かを判定する（ステップＡ３７）。 Next, the link complementation part 108 confirms whether the link is produced | generated by the specification 1 of the document 1 in step A35. Since no link is generated in the specification 1 of the document 1, the process proceeds to step A36. Here, regarding the specification 1 of the document 1 and the specification 0 of the document 0, it is determined that there are two common subsequences “order” and “sentence table”, and the two specifications are related to each other. Assume that the relevance calculation unit 106 calculates the relevance value as 0.29. Since the relevance value 0.25 is greater than or equal to the second threshold value (0.25), the link complementation unit 108 determines whether or not the precondition is satisfied (step A37).

ここで、文書１の仕様ｉ（ｉ＝１）の１つ前の仕様０は、文書０の仕様０との間にリンクが生成されている。そして、文書０の仕様０は、文書０において仕様ｊ（ｊ＝０）以前の仕様である。従って、リンク補完部１０８は、前条件が満たされていると判定し（ステップＡ３７におけるＹｅｓ）、次に、後条件が満たされているか否かを判定する（ステップＡ３８）。 Here, a link is generated between the specification 0 immediately before the specification i (i = 1) of the document 1 and the specification 0 of the document 0. The specification 0 of the document 0 is a specification before the specification j (j = 0) in the document 0. Accordingly, the link complementation unit 108 determines that the precondition is satisfied (Yes in Step A37), and then determines whether the postcondition is satisfied (Step A38).

ここで、文書１の仕様ｉ（ｉ＝１）の１つ後の仕様２は、文書０の仕様１との間にリンクが生成されている。そして、文書０の仕様１は、文書０において仕様ｊ（ｊ＝０）以後の仕様である（図１２参照）。従って、リンク補完部１０８は、後条件が満たされていると判定し（ステップＡ３８におけるＹｅｓ）、ステップＡ３９に移行する。 Here, a link is generated between the specification 2 immediately after the specification i (i = 1) of the document 1 and the specification 1 of the document 0. The specification 1 of the document 0 is a specification after the specification j (j = 0) in the document 0 (see FIG. 12). Accordingly, the link complementation unit 108 determines that the post-condition is satisfied (Yes in Step A38), and proceeds to Step A39.

ステップＡ３９において、リンク補完部１０８は、文書１の仕様ｉ（ｉ＝１）と、文書０の仕様ｊ（ｊ＝０）とを対応付け、その対応関係（リンク）をリンク記憶部１０３に記憶させる。このとき、そのリンクのリンク信頼度として、「△」をリンク記憶部１０３に記憶させる。 In step A <b> 39, the link complementation unit 108 associates the specification i (i = 1) of the document 1 with the specification j (j = 0) of the document 0 and stores the correspondence (link) in the link storage unit 103. Let At this time, “Δ” is stored in the link storage unit 103 as the link reliability of the link.

次に、リンク補完部１０８は、ｊをインクリメントして（ステップＡ４０）、ステップＡ３４以降の処理を繰り返す。そして、ｉをインクリメントした場合には（ステップＡ３９）、ステップＡ３２以降の処理を繰り返す。そして、ｉ≧Ｎ_０となったならば（ステップＡ３２におけるＮｏ）、文書１の仕様に対するリンクの補完（図８に示すステップＡ８）を終了する。Next, the link complementation part 108 increments j (step A40), and repeats the process after step A34. If i is incremented (step A39), the processing after step A32 is repeated. If i ≧ N ₀ (No in step A32), the link complement to the specification of document 1 (step A8 shown in FIG. 8) is terminated.

以上のようにステップＡ８までの処理を終了したときの文書０と文書１における仕様同士のリンクは、図１３に示すようになる。図１３において、実線で示したリンクは、ステップＡ７よりも前にリンク生成部１０７によって生成されたリンクを表している。破線で示したリンクは、ステップＡ７，Ａ８においてリンク補完部１０８によって補完されたリンクを表している。 As described above, the links between the specifications in document 0 and document 1 when the processing up to step A8 is completed are as shown in FIG. In FIG. 13, a link indicated by a solid line represents a link generated by the link generation unit 107 before step A7. A link indicated by a broken line represents a link supplemented by the link complementation unit 108 in steps A7 and A8.

また、以上のように、ステップＡ８までの処理を終了したときには、リンク記憶部１０３には、図１４に示す情報が記憶されている。図１４が示す仕様同士の対応関係は、図１３で模式的に示した仕様同士の対応関係に相当する。 Further, as described above, when the processing up to step A8 is completed, the information shown in FIG. The correspondence between specifications shown in FIG. 14 corresponds to the correspondence between specifications schematically shown in FIG.

また、本発明の関連仕様対応付けシステムは、リンク記憶部１０３に記憶された仕様同士の対応関係を出力する出力装置を備えていてもよい。出力装置は、例えば、ディスプレイ装置や印刷装置であるが、他の出力装置であってもよい。そして、リンク記憶部１０３に記憶された仕様同士の対応関係を出力装置から出力する場合には、リンク生成部１０７によって生成されたリンク（すなわち、リンク信頼度が「○」となっているリンク）と、リンク補完部１０８によって補完されたリンク（すなわち、リンク信頼度が「△」となっているリンク）とを区別して表してもよい。例えば、図１３に示すように、リンク生成部１０７によって生成されたリンクを実線で表し、リンク補完部１０８によって補完されたリンクを破線で表すようにして、リンクを区別してもよい。 In addition, the related specification associating system of the present invention may include an output device that outputs a correspondence relationship between specifications stored in the link storage unit 103. The output device is, for example, a display device or a printing device, but may be another output device. And when outputting the correspondence of the specifications memorize | stored in the link memory | storage part 103 from an output device, the link produced | generated by the link production | generation part 107 (namely, link whose link reliability is "(circle)"). And a link supplemented by the link complementing unit 108 (that is, a link having a link reliability of “Δ”) may be distinguished from each other. For example, as illustrated in FIG. 13, the links generated by the link generation unit 107 may be represented by solid lines, and the links supplemented by the link complementation unit 108 may be represented by broken lines.

以上の説明では、文書間における仕様の対応付けについて説明したが、同一文書内のテキスト間における仕様の対応付けにも本発明を適用することができる。 In the above description, specification association between documents has been described. However, the present invention can also be applied to specification association between texts in the same document.

本発明では、仕様同士の関連性に関する指定を人手で行わなくても、複数の仕様を含むテキスト間において、リンク（トレーサビリティリンク）を生成することができる。例えば、仕様同士の関連性に関する語句を定める作業等を人手で行わなくても、リンクを生成できる。従って、人手による作業をできるだけ少なくして、複数の仕様を含むテキスト間において、関連する仕様同士を対応付けることができる。 In the present invention, a link (traceability link) can be generated between texts including a plurality of specifications without manually specifying the relationship between specifications. For example, a link can be generated without manually performing an operation for determining a phrase related to the relationship between specifications. Therefore, it is possible to associate related specifications with each other between texts including a plurality of specifications by minimizing manual work.

また、関連性値が第一の閾値以上でないことにより、リンク生成部１０７によってリンクが生成されなかった仕様に関しても、その仕様の前後のリンクの生成状況に応じて、リンクを補完することができる。すなわち、ある仕様に関して、前条件および後条件を満たしていれば、関連性値が低くても、他のテキストの仕様と対応付けることが適切であると推定して、リンクを補完することができる。 In addition, because the relevance value is not equal to or greater than the first threshold, the link can be supplemented according to the generation status of the links before and after the specification even if the link is not generated by the link generation unit 107. . That is, if a pre-condition and a post-condition are satisfied for a certain specification, the link can be supplemented by assuming that it is appropriate to associate with the specification of another text even if the relevance value is low.

語句変換部１０４、関連性有無判定部１０５、関連性計算部１０６、リンク生成部１０７およびリンク補完部１０８は、関連仕様対応付けプログラムに従って動作するコンピュータ（計算部１２）によって実現される。すなわち、コンピュータが、関連仕様対応付けプログラムを読み込み、そのプログラムに従って、語句変換部１０４、関連性有無判定部１０５、関連性計算部１０６、リンク生成部１０７およびリンク補完部１０８として動作すればよい。 The phrase conversion unit 104, the relevance determination unit 105, the relevance calculation unit 106, the link generation unit 107, and the link complementation unit 108 are realized by a computer (calculation unit 12) that operates according to a related specification association program. That is, the computer may read the related specification association program and operate as the phrase conversion unit 104, the relevance presence / absence determination unit 105, the relevance calculation unit 106, the link generation unit 107, and the link complementation unit 108 according to the program.

この関連仕様対応付けプログラムは、フロッピーディスク（登録商標）、ハードディスク等の磁気ディスク、ＣＤ−ＲＯＭ、ＤＶＤ（Digital Versatile Disk）等の光ディスク、ＭＯ（Magneto-Optical Disk：光磁気ディスク）、あるいは半導体メモリ等の記憶媒体に記憶させて頒布してもよい。また、これらの記憶媒体は例示であり、関連仕様対応付けプログラムを記憶させる媒体は、コンピュータがプログラムを読み取り可能な記憶媒体であればよい。 This related specification association program is a floppy disk (registered trademark), a magnetic disk such as a hard disk, an optical disk such as a CD-ROM or DVD (Digital Versatile Disk), an MO (Magneto-Optical Disk), or a semiconductor memory. It may be stored in such a storage medium and distributed. These storage media are examples, and the medium for storing the related specification association program may be any storage medium that can be read by the computer.

また、記憶媒体からコンピュータにインストールされたプログラムに従って、コンピュータ上で稼働しているオペレーティングシステムや、データベース管理ソフトウェア、ネットワークソフトウェア等のミドルウェアによって、各処理の一部を実行してもよい。 Further, a part of each process may be executed by middleware such as an operating system running on the computer, database management software, network software, or the like according to a program installed in the computer from the storage medium.

また、コンピュータは、ＬＡＮやインターネットを介して関連仕様対応付けプログラムをダウンロードし、そのプログラムを記憶、または一時記憶して、そのプログラムに従って動作してもよい。 Further, the computer may download the related specification association program via the LAN or the Internet, store the program or temporarily store the program, and operate according to the program.

また、関連仕様対応付けプログラムが複数の記憶媒体に分けて記憶され、コンピュータが複数の記憶媒体から関連仕様対応付けプログラムを読み込んでもよい。 Further, the related specification association program may be stored separately in a plurality of storage media, and the computer may read the related specification association program from the plurality of storage media.

また、コンピュータは、例えば、パーソナルコンピュータであっても、複数の装置がネットワークを介して接続されたシステムであってもよい。 The computer may be a personal computer or a system in which a plurality of devices are connected via a network.

また、コンピュータは、パーソナルコンピュータに限られず、情報処理機器に含まれる演算処理装置を含み、プログラムによって本発明の機能を実現可能な機器や装置であってもよい。 Further, the computer is not limited to a personal computer, and may be a device or device that includes an arithmetic processing device included in an information processing device and can realize the functions of the present invention by a program.

また、語句変換部１０４、関連性有無判定部１０５、関連性計算部１０６、リンク生成部１０７およびリンク補完部１０８がそれぞれ別のハードウェアで実現されていてもよい。 Further, the phrase conversion unit 104, the relevance presence / absence determination unit 105, the relevance calculation unit 106, the link generation unit 107, and the link complementation unit 108 may be realized by different hardware.

次に、本発明の最小構成について説明する。図１５は、本発明の関連仕様対応付けシステムの最小構成の例を示す説明図である。本発明の関連仕様対応付けシステムは、関連性有無判定手段７１と、関連性計算手段７２と、仕様対応付け手段７３とを備える。 Next, the minimum configuration of the present invention will be described. FIG. 15 is an explanatory diagram showing an example of the minimum configuration of the related specification association system of the present invention. The related specification associating system of the present invention includes relevance presence / absence determining means 71, relevance calculating means 72, and specification associating means 73.

関連性有無判定手段７１（例えば、関連性有無判定部１０５）は、順序付けられた複数の仕様を含む２つのテキスト（例えば、文書０および文書１）から、互いに異なるテキストに属する２つの仕様の組を選択し、選択した仕様の組毎に、２つの仕様に共通に出現する所定の長さの文字列である共通文字列（例えば、共通部分列）を抽出し、当該共通文字列の抽出結果に基づいて、その２つの仕様の関連性の有無を判定する。 Relevance presence / absence determination means 71 (for example, relevance presence / absence determination unit 105) is a set of two specifications belonging to different texts from two texts (for example, document 0 and document 1) including a plurality of ordered specifications. And for each selected set of specifications, extract a common character string (for example, a common substring) that is a character string of a predetermined length that appears in common in the two specifications, and extract the common character string Based on the above, it is determined whether or not the two specifications are related.

関連性計算手段７２（例えば、関連性計算部１０６）は、互いに異なるテキストに属する２つの仕様の組毎に、２つの仕様に関連性がないと判定されている場合には、２つの仕様の関連性の強さを示す関連性値を所定の値（例えば、０）に定め、２つの仕様に関連性があると判定されている場合には、２つの仕様それぞれの可逆圧縮サイズと、２つの仕様の文字列を連結した文字列の可逆圧縮サイズとに基づいて、関連性値を算出する。 When it is determined that the two specifications are not related for each set of two specifications belonging to different texts, the relevance calculation unit 72 (for example, the relevance calculation unit 106) When the relevance value indicating the strength of relevance is set to a predetermined value (for example, 0) and it is determined that the two specifications are related, the reversible compression size of each of the two specifications and 2 A relevance value is calculated based on a lossless compression size of a character string obtained by concatenating character strings of two specifications.

仕様対応付け手段７３（例えば、リンク生成部１０７）は、関連性値が所定の条件を満たす場合（例えば、第一の閾値以上である場合）に、関連性値の計算対象となった２つの仕様を対応付ける。 The specification associating unit 73 (for example, the link generation unit 107), when the relevance value satisfies a predetermined condition (for example, when the relevance value is equal to or greater than the first threshold value), Associate the specifications.

そのような構成により、人手による作業をできるだけ少なくして、複数の仕様を含むテキスト間において、関連する仕様同士を対応付けることができる。 With such a configuration, it is possible to associate related specifications between texts including a plurality of specifications while minimizing manual work as much as possible.

上記の実施形態の一部または全部は、以下の付記のようにも記載され得るが、以下には限られない。 A part or all of the above-described embodiment can be described as in the following supplementary notes, but is not limited thereto.

（付記１）順序付けられた複数の仕様を含む２つのテキストから、互いに異なるテキストに属する２つの仕様の組を選択し、選択した仕様の組毎に、２つの仕様に共通に出現する所定の長さの文字列である共通文字列を抽出し、当該共通文字列の抽出結果に基づいて、前記２つの仕様の関連性の有無を判定する関連性有無判定手段と、互いに異なるテキストに属する２つの仕様の組毎に、２つの仕様に関連性がないと判定されている場合には、２つの仕様の関連性の強さを示す関連性値を所定の値に定め、２つの仕様に関連性があると判定されている場合には、２つの仕様それぞれの可逆圧縮サイズと、２つの仕様の文字列を連結した文字列の可逆圧縮サイズとに基づいて、関連性値を算出する関連性計算手段と、前記関連性値が所定の条件を満たす場合に、前記関連性値の計算対象となった２つの仕様を対応付ける仕様対応付け手段とを備えることを特徴とする関連仕様対応付けシステム。 (Supplementary note 1) From two texts including a plurality of ordered specifications, a set of two specifications belonging to different texts is selected, and a predetermined length that appears in common in the two specifications for each selected specification set A common character string that is the same character string, and a relevance presence / absence determining unit that determines the relevance of the two specifications based on the extraction result of the common character string; If it is determined that there is no relevance between two specifications for each set of specifications, a relevance value indicating the strength of relevance between the two specifications is defined as a predetermined value, and the relevance between the two specifications If it is determined that there is a relevance value, the relevance value is calculated based on the lossless compression size of each of the two specifications and the lossless compression size of the character string obtained by concatenating the character strings of the two specifications. Means and the relevance value is a predetermined condition If the relevant specification associating system, characterized in that it comprises a specification associating means for associating the two specifications was the calculation target of the relevance value satisfying.

（付記２）関連性有無判定手段は、２つの仕様から抽出した共通文字列の数が予め定めた数以上である場合に、前記２つの仕様に関連性があると判定し、前記共通文字列の数が予め定めた数未満である場合に、前記２つの仕様に関連性がないと判定する付記１に記載の関連仕様対応付けシステム。 (Supplementary Note 2) When the number of common character strings extracted from two specifications is greater than or equal to a predetermined number, the relevance presence / absence determining means determines that the two specifications are related, and the common character string The related specification associating system according to supplementary note 1, wherein the two specifications are determined not to be related when the number is less than a predetermined number.

（付記３）関連性有無判定手段は、２つの仕様の文字列の長さの和に対する、抽出した共通文字列の数の割合が、予め定めた割合以上である場合に、前記２つの仕様に関連性があると判定し、２つの仕様の文字列の長さの和に対する、抽出した共通文字列の数の割合が、予め定めた割合未満である場合に、前記２つの仕様に関連性がないと判定する付記１に記載の関連仕様対応付けシステム。 (Supplementary Note 3) When the ratio of the number of extracted common character strings to the sum of the lengths of the character strings of the two specifications is equal to or greater than a predetermined ratio, When the ratio of the number of extracted common character strings to the sum of the lengths of the character strings of the two specifications is less than a predetermined ratio, the two specifications are related. The related specification matching system according to Supplementary Note 1, wherein it is determined that there is no.

（付記４）関連性有無判定手段は、共通文字列に対して予め定められた条件を満たす共通文字列を抽出する付記１から付記３のうちのいずれかに記載の関連仕様対応付けシステム。 (Supplementary note 4) The related specification associating system according to any one of supplementary notes 1 to 3, wherein the relevance presence / absence determining unit extracts a common character string that satisfies a predetermined condition with respect to the common character string.

（付記５）仕様対応付け手段は、関連性値が予め定められた対応付け判定用閾値以上であることを条件に、関連性値の計算対象となった２つの仕様を対応付ける付記１から付記４のうちのいずれかに記載の関連仕様対応付けシステム。 (Supplementary Note 5) The specification associating means associates the two specifications that are the calculation targets of the relevance value on the condition that the relevance value is equal to or greater than a predetermined association determination threshold. The related specification matching system according to any one of the above.

（付記６）未だ対応付けられていない仕様より前の順番の仕様に関する対応付け結果と、前記未だ対応付けられていない仕様より後の順番の仕様に関する対応付け結果とに基づいて、前記未だ対応付けられていない仕様と、当該仕様とは異なるテキストに属する仕様とを対応付ける仕様対応付け補完手段とを備える付記１から付記５のうちのいずれかに記載の関連仕様対応付けシステム。 (Additional remark 6) Based on the matching result regarding the specification of the order before the specification which is not yet matched, and the matching result regarding the specification of the order after the specification which is not yet matched, it is still the correspondence The related specification associating system according to any one of supplementary notes 1 to 5, further comprising specification correspondence complementing means for associating a specification that is not made and a specification belonging to a text different from the specification.

（付記７）仕様対応付け補完手段は、未だ対応付けられていない仕様であって一方のテキストに属する第一の仕様と、当該第一の仕様とは異なるテキストに属する第二の仕様との間に、前記第一の仕様と前記第二の仕様の関連性値が補完用閾値以上であり、かつ、前記第一の仕様が属するテキスト内で前記第一の仕様より前の順番の仕様と、前記第二の仕様が属するテキスト内で前記第二の仕様以前の順番の仕様とが対応付けられていて、前記第一の仕様が属するテキスト内で前記第一の仕様より後の順番の仕様と、前記第二の仕様が属するテキスト内で前記第二の仕様以後の順番の仕様とが対応付けられているという関係がある場合に、前記第一の仕様と前記第二の仕様とを対応付ける付記６に記載の関連仕様対応付けシステム。 (Supplementary note 7) The specification correspondence complementing unit is a specification that has not yet been matched and is between a first specification belonging to one text and a second specification belonging to a text different from the first specification. In addition, the relevance value between the first specification and the second specification is not less than a complementary threshold, and the specification in the order before the first specification in the text to which the first specification belongs, A specification in the order before the second specification is associated with the specification in the text to which the second specification belongs, and the specification in the order after the first specification in the text to which the first specification belongs. And an additional note that associates the first specification with the second specification when there is a relationship that the second specification and the subsequent specification are associated in the text to which the second specification belongs. 6. The related specification matching system according to 6.

（付記８）各仕様に含まれる語句を所定の変換ルールに従って変換する語句変換手段を備え、関連性有無判定手段は、変換後の仕様の組毎に、２つの仕様の関連性の有無を判定する付記１から付記７のうちのいずれかに記載の関連仕様対応付けシステム。 (Additional remark 8) It is provided with the phrase conversion means which converts the phrase contained in each specification according to a predetermined conversion rule, and a relevance presence determination means determines the presence or absence of relevance of two specifications for every set of the specifications after conversion. The related specification matching system according to any one of Appendix 1 to Appendix 7.

（付記９）順序付けられた複数の仕様を含む２つのテキストから、互いに異なるテキストに属する２つの仕様の組を選択し、選択した仕様の組毎に、２つの仕様に共通に出現する所定の長さの文字列である共通文字列を抽出し、当該共通文字列の抽出結果に基づいて、前記２つの仕様の関連性の有無を判定し、互いに異なるテキストに属する２つの仕様の組毎に、２つの仕様に関連性がないと判定されている場合には、２つの仕様の関連性の強さを示す関連性値を所定の値に定め、２つの仕様に関連性があると判定されている場合には、２つの仕様それぞれの可逆圧縮サイズと、２つの仕様の文字列を連結した文字列の可逆圧縮サイズとに基づいて、関連性値を算出し、前記関連性値が所定の条件を満たす場合に、前記関連性値の計算対象となった２つの仕様を対応付けることを特徴とする関連仕様対応付け方法。 (Supplementary note 9) A pair of two specifications belonging to different texts is selected from two texts including a plurality of ordered specifications, and a predetermined length that appears in common in the two specifications for each selected specification set The common character string that is the character string is extracted, and based on the extraction result of the common character string, the presence or absence of relevance between the two specifications is determined, and for each set of two specifications belonging to different texts, If it is determined that there is no relevance between the two specifications, a relevance value indicating the strength of relevance between the two specifications is set to a predetermined value, and it is determined that the two specifications are related. In the case where there are two specifications, the relevance value is calculated based on the reversible compression size of each of the two specifications and the reversible compression size of the character string obtained by concatenating the character strings of the two specifications, If the relationship value is satisfied, Related Specifications mapping method characterized by associating two specifications Tsu.

（付記１０）２つの仕様から抽出した共通文字列の数が予め定めた数以上である場合に、前記２つの仕様に関連性があると判定し、前記共通文字列の数が予め定めた数未満である場合に、前記２つの仕様に関連性がないと判定する付記９に記載の関連仕様対応付け方法。 (Supplementary Note 10) When the number of common character strings extracted from two specifications is equal to or greater than a predetermined number, it is determined that the two specifications are related, and the number of common character strings is a predetermined number. The related specification associating method according to appendix 9, wherein the two specifications are determined to be unrelated when the number is less than

（付記１１）２つの仕様の文字列の長さの和に対する、抽出した共通文字列の数の割合が、予め定めた割合以上である場合に、前記２つの仕様に関連性があると判定し、２つの仕様の文字列の長さの和に対する、抽出した共通文字列の数の割合が、予め定めた割合未満である場合に、前記２つの仕様に関連性がないと判定する付記９に記載の関連仕様対応付け方法。 (Supplementary Note 11) When the ratio of the number of extracted common character strings to the sum of the lengths of the character strings of the two specifications is equal to or greater than a predetermined ratio, it is determined that the two specifications are related. Supplementary note 9 that determines that the two specifications are not relevant when the ratio of the number of extracted common character strings to the sum of the lengths of the character strings of the two specifications is less than a predetermined ratio The related specification matching method described.

（付記１２）共通文字列に対して予め定められた条件を満たす共通文字列を抽出する付記９から付記１１のうちのいずれかに記載の関連仕様対応付け方法。 (Additional remark 12) The related specification matching method in any one of Additional remark 9 to Additional remark 11 which extracts the common character string which satisfy | fills a predetermined condition with respect to a common character string.

（付記１３）関連性値が予め定められた対応付け判定用閾値以上であることを条件に、関連性値の計算対象となった２つの仕様を対応付ける付記９から付記１２のうちのいずれかに記載の関連仕様対応付け方法。 (Supplementary note 13) Any one of Supplementary note 9 to Supplementary note 12 for associating the two specifications that are the calculation target of the relevance value on condition that the relevance value is equal to or greater than a predetermined threshold value for association determination The related specification matching method described.

（付記１４）未だ対応付けられていない仕様より前の順番の仕様に関する対応付け結果と、前記未だ対応付けられていない仕様より後の順番の仕様に関する対応付け結果とに基づいて、前記未だ対応付けられていない仕様と、当該仕様とは異なるテキストに属する仕様とを対応付ける付記９から付記１３のうちのいずれかに記載の関連仕様対応付け方法。 (Additional remark 14) Based on the matching result regarding the specification of the order before the specification which is not yet matched, and the matching result regarding the specification of the order after the specification which has not been matched yet, The related specification associating method according to any one of supplementary note 9 to supplementary note 13, wherein a specification that has not been assigned is associated with a specification that belongs to a text different from the specification.

（付記１５）未だ対応付けられていない仕様であって一方のテキストに属する第一の仕様と、当該第一の仕様とは異なるテキストに属する第二の仕様との間に、前記第一の仕様と前記第二の仕様の関連性値が補完用閾値以上であり、かつ、前記第一の仕様が属するテキスト内で前記第一の仕様より前の順番の仕様と、前記第二の仕様が属するテキスト内で前記第二の仕様以前の順番の仕様とが対応付けられていて、前記第一の仕様が属するテキスト内で前記第一の仕様より後の順番の仕様と、前記第二の仕様が属するテキスト内で前記第二の仕様以後の順番の仕様とが対応付けられているという関係がある場合に、前記第一の仕様と前記第二の仕様とを対応付ける付記１４に記載の関連仕様対応付け方法。 (Supplementary Note 15) Between the first specification belonging to one text that is not yet associated and the second specification belonging to a text different from the first specification, the first specification And the second specification is greater than or equal to a complementary threshold value, and the specification in the order before the first specification and the second specification belong in the text to which the first specification belongs. The specification in the order before the second specification is associated in the text, the specification in the order after the first specification in the text to which the first specification belongs, and the second specification The related specification correspondence according to appendix 14, wherein the first specification and the second specification are associated with each other when there is a relationship that the specification in the order after the second specification is associated in the text to which it belongs. Attaching method.

（付記１６）各仕様に含まれる語句を所定の変換ルールに従って変換し、変換後の仕様の組毎に、２つの仕様の関連性の有無を判定する付記９から付記１５のうちのいずれかに記載の関連仕様対応付け方法。 (Supplementary note 16) Any one of Supplementary note 9 to Supplementary note 15 for converting a word / phrase included in each specification according to a predetermined conversion rule and determining whether or not the two specifications are related to each other after the conversion is performed. The related specification matching method described.

（付記１７）コンピュータに、順序付けられた複数の仕様を含む２つのテキストから、互いに異なるテキストに属する２つの仕様の組を選択し、選択した仕様の組毎に、２つの仕様に共通に出現する所定の長さの文字列である共通文字列を抽出し、当該共通文字列の抽出結果に基づいて、前記２つの仕様の関連性の有無を判定する関連性有無判定処理、互いに異なるテキストに属する２つの仕様の組毎に、２つの仕様に関連性がないと判定されている場合には、２つの仕様の関連性の強さを示す関連性値を所定の値に定め、２つの仕様に関連性があると判定されている場合には、２つの仕様それぞれの可逆圧縮サイズと、２つの仕様の文字列を連結した文字列の可逆圧縮サイズとに基づいて、関連性値を算出する関連性計算処理、および、前記関連性値が所定の条件を満たす場合に、前記関連性値の計算対象となった２つの仕様を対応付ける仕様対応付け処理を実行させるための関連仕様対応付けプログラム。 (Supplementary Note 17) Two sets of specifications belonging to different texts are selected from two texts including a plurality of ordered specifications on a computer, and appear in common in the two specifications for each selected set of specifications. Relevance presence / absence determination processing for extracting a common character string that is a character string of a predetermined length and determining the relevance of the two specifications based on the extraction result of the common character string, belonging to different texts If it is determined that there is no relationship between the two specifications for each set of two specifications, a relevance value indicating the strength of the relationship between the two specifications is set to a predetermined value, and the two specifications are When it is determined that there is a relationship, a relationship value is calculated based on the lossless compression size of each of the two specifications and the lossless compression size of the character string obtained by concatenating the character strings of the two specifications. Sex calculation processing and before If relevance value satisfies a predetermined condition related specifications correspondence program for executing the specification associating process for associating two specifications it became an object of calculation of the relevance value.

（付記１８）コンピュータに、関連性有無判定処理で、２つの仕様から抽出した共通文字列の数が予め定めた数以上である場合に、前記２つの仕様に関連性があると判定させ、前記共通文字列の数が予め定めた数未満である場合に、前記２つの仕様に関連性がないと判定させる付記１７に記載の関連仕様対応付けプログラム。 (Supplementary Note 18) When the number of common character strings extracted from two specifications is equal to or greater than a predetermined number in the relevance determination process, the computer determines that the two specifications are related, and The related specification association program according to supplementary note 17, wherein when the number of common character strings is less than a predetermined number, the two specifications are determined not to be related.

（付記１９）コンピュータに、関連性有無判定処理で、２つの仕様の文字列の長さの和に対する、抽出した共通文字列の数の割合が、予め定めた割合以上である場合に、前記２つの仕様に関連性があると判定させ、２つの仕様の文字列の長さの和に対する、抽出した共通文字列の数の割合が、予め定めた割合未満である場合に、前記２つの仕様に関連性がないと判定させる付記１７に記載の関連仕様対応付けプログラム。 (Additional remark 19) When the ratio of the number of the extracted common character string with respect to the sum of the length of the character string of two specifications is more than a predetermined ratio by the relevance existence determination processing, it is said 2 If the ratio of the number of extracted common character strings to the sum of the lengths of the character strings of the two specifications is less than a predetermined ratio, it is determined that the two specifications are related. The related specification associating program according to appendix 17, which determines that there is no relevance.

（付記２０）コンピュータに、関連性有無判定処理で、共通文字列に対して予め定められた条件を満たす共通文字列を抽出させる付記１７から付記１９のうちのいずれかに記載の関連仕様対応付けプログラム。 (Supplementary note 20) The related specification associating according to any one of supplementary note 17 to supplementary note 19, which causes the computer to extract a common character string satisfying a predetermined condition for the common character string in the relevance existence determination process. program.

（付記２１）コンピュータに、仕様対応付け処理で、関連性値が予め定められた対応付け判定用閾値以上であることを条件に、関連性値の計算対象となった２つの仕様を対応付けさせる付記１７から付記２０のうちのいずれかに記載の関連仕様対応付けプログラム。 (Supplementary Note 21) In the specification association process, the computer associates the two specifications that are the calculation target of the relevance value on condition that the relevance value is equal to or greater than a predetermined association determination threshold. The related specification association program according to any one of Supplementary Note 17 to Supplementary Note 20.

（付記２２）コンピュータに、未だ対応付けられていない仕様より前の順番の仕様に関する対応付け結果と、前記未だ対応付けられていない仕様より後の順番の仕様に関する対応付け結果とに基づいて、前記未だ対応付けられていない仕様と、当該仕様とは異なるテキストに属する仕様とを対応付け仕様対応付け補完処理を実行させる付記１７から付記２１のうちのいずれかに記載の関連仕様対応付けプログラム。 (Supplementary Note 22) Based on the association result regarding the specification in the order before the specification not yet associated with the computer, and the association result regarding the specification in the order after the specification not yet associated, The related specification associating program according to any one of supplementary note 17 to supplementary note 21, which executes a correspondence specification correspondence complementing process for a specification that has not been associated yet and a specification that belongs to a text different from the specification.

（付記２３）コンピュータに、仕様対応付け補完処理で、未だ対応付けられていない仕様であって一方のテキストに属する第一の仕様と、当該第一の仕様とは異なるテキストに属する第二の仕様との間に、前記第一の仕様と前記第二の仕様の関連性値が補完用閾値以上であり、かつ、前記第一の仕様が属するテキスト内で前記第一の仕様より前の順番の仕様と、前記第二の仕様が属するテキスト内で前記第二の仕様以前の順番の仕様とが対応付けられていて、前記第一の仕様が属するテキスト内で前記第一の仕様より後の順番の仕様と、前記第二の仕様が属するテキスト内で前記第二の仕様以後の順番の仕様とが対応付けられているという関係がある場合に、前記第一の仕様と前記第二の仕様とを対応付けさせる付記２２に記載の関連仕様対応付けプログラム。 (Supplementary Note 23) A first specification belonging to one text and a second specification belonging to a text different from the first specification, which has not yet been associated with the computer in the specification correspondence complement processing. Between the first specification and the second specification, the relevance value is greater than or equal to a complementary threshold value, and the first specification has an order before the first specification in the text to which the first specification belongs. The specification is associated with the specification in the order before the second specification in the text to which the second specification belongs, and the order after the first specification in the text to which the first specification belongs And the second specification are related to each other in the text to which the second specification belongs. Related specification pair described in appendix 22 Give program.

（付記２４）コンピュータに、各仕様に含まれる語句を所定の変換ルールに従って変換する語句変換処理を実行させ、関連性有無判定処理で、語句変換処理後の仕様の組毎に、２つの仕様の関連性の有無を判定する付記１７から付記２３のうちのいずれかに記載の関連仕様対応付けプログラム。 (Supplementary Note 24) Causes a computer to execute a phrase conversion process for converting a phrase included in each specification according to a predetermined conversion rule. In the relevance determination process, for each set of specifications after the phrase conversion process, two specifications The related specification associating program according to any one of supplementary notes 17 to 23 for determining the presence or absence of relevance.

以上、実施形態を参照して本願発明を説明したが、本願発明は上記実施形態に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。例えば、実施形態に示した全構成要素から一部の構成要素を削除してもよい。また、他の構成要素を組み合わせてもよい。 While the present invention has been described with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention. For example, some components may be deleted from all the components shown in the embodiment. Moreover, you may combine another component.

この出願は、２０１１年３月１６日に出願された日本特許出願２０１１−０５８３１４を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims the priority on the basis of the JP Patent application 2011-058314 for which it applied on March 16, 2011, and takes in those the indications of all here.

本発明は、テキスト内に記述されている仕様同士を対応付ける関連仕様対応付けシステムに好適に適用される。そして、仕様同士を対応付けた結果は、仕様の後工程への反映漏れ検出、整合性確認、仕様変更時の影響範囲の把握等に活用可能である。 The present invention is suitably applied to a related specification associating system for associating specifications described in text. The result of associating the specifications with each other can be used for detecting a reflection omission in the subsequent process of the specifications, confirming the consistency, and grasping the range of influence at the time of changing the specifications.

１１記憶部
１２計算部
１００初期仕様記憶部
１０１変換語句記憶部
１０２変換仕様記憶部
１０３リンク記憶部
１０４語句変換部
１０５関連性有無判定部
１０６関連性計算部
１０７リンク生成部
１０８リンク補完部DESCRIPTION OF SYMBOLS 11 Memory | storage part 12 Calculation part 100 Initial specification memory | storage part 101 Conversion word memory | storage part 102 Conversion specification memory | storage part 103 Link memory | storage part 104 Word / phrase conversion part 105 Relevance existence determination part 106 Relevance calculation part 107 Link generation part 108 Link complementation part

Claims

Selecting two sets of text or et two specifications, based on common string that appears in common to the two specifications, the relevance existence determining means determines the presence or absence of association of the two specifications,
If it is determined to be relevant to the two specifications, the two specifications each lossless compressed size, strings of the two specifications based on the lossless compression size of the string linked, the relevance value Relevance calculation means to calculate,
Related specification associating system, characterized in that it comprises a specification correlating means determines whether associating two specifications.

The relevance determination unit determines that the two specifications are related when the number of common character strings appearing in common in the two specifications is equal to or greater than a predetermined number, and the number of the common character strings The related specification associating system according to claim 1, wherein when the number is less than a predetermined number, the two specifications are determined not to be related.

When the ratio of the number of common character strings that appear in common in the two specifications with respect to the sum of the lengths of the character strings in the two specifications is equal to or greater than a predetermined ratio, determined to be relevant to the specification, to the length sum of the strings of the two specifications, the ratio of the number of the common strings, if it is less than the rate a predetermined, relevant to the two specifications The related specification matching system according to claim 1, wherein it is determined that there is not.

The related specification matching system according to any one of claims 1 to 3, wherein the relevance presence / absence determination unit extracts a common character string that satisfies a predetermined condition with respect to the common character string.

The specification associating means associates the two specifications that are subject to calculation of the relevance value on condition that the relevance value is equal to or greater than a predetermined association determination threshold value. The related specification matching system according to any one of the above.

The relevance determination means selects two sets of specifications from two texts including a plurality of ordered specifications,
The specifications that are not yet associated based on the association results for the specifications in the order before the specifications that are not yet associated and the association results for the specifications that are in the order after the specifications that are not yet associated The related specification matching system according to any one of claims 1 to 5, further comprising: specification matching complementing means that matches a specification belonging to a text different from the specification.

The specification correspondence complementing means is a specification that has not yet been associated and is between the first specification belonging to one text and the second specification belonging to a text different from the first specification. A relevance value between one specification and the second specification is equal to or greater than a complementary threshold, and the specification in the order before the first specification in the text to which the first specification belongs; and the second specification The specification in the order before the second specification is associated with the specification in the text to which the specification belongs, the specification in the order after the first specification in the text to which the first specification belongs, and the second 7. The first specification and the second specification are associated with each other when there is a relationship that the specification in the order after the second specification is associated in the text to which the specification belongs. Related specification mapping system.

Comprising word conversion means for converting words included in each specification according to a predetermined conversion rule;
The related specification associating system according to any one of claims 1 to 7, wherein the presence / absence determining unit determines whether or not two specifications are related for each set of converted specifications.

Selecting two sets of text or et two specifications, based on common string that appears in common to the two specifications, to determine the presence or absence of association of the two specifications,
If it is determined to be relevant to the two specifications, the two specifications each lossless compressed size, strings of the two specifications based on the lossless compression size of the string linked, the relevance value Calculate
It is determined whether or not two specifications are associated with each other .

On the computer,
Selecting two sets of text or et two specifications, based on common string that appears in common to the two specifications, relevance existence determining process of determining whether the relatedness of the two specifications,
If it is determined to be relevant to the two specifications, the two specifications each lossless compressed size, strings of the two specifications based on the lossless compression size of the string linked, the relevance value Relevance calculation processing to be calculated, and
A related specification association program for executing a specification association process for determining whether or not two specifications are associated with each other .