JP7748337B2

JP7748337B2 - Document instruction extraction system, document instruction extraction method, and document instruction extraction program

Info

Publication number: JP7748337B2
Application number: JP2022093076A
Authority: JP
Inventors: みゆき川田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2022-06-08
Filing date: 2022-06-08
Publication date: 2025-10-02
Anticipated expiration: 2042-06-08
Also published as: JP2023180030A

Description

本発明は、例えば、工業製品の設計における仕様書等の文書に記載された指示内容を抽出して処理する技術に関する。 The present invention relates to technology for extracting and processing instructions contained in documents such as specifications for industrial product design.

工業製品の設計・製造に係る品質向上とコスト低減のため、設計現場では、過去の仕様書の内容から、設計変更が繰り返された製品部分や、形状の特徴、変更の種類等の傾向の分析が行われる。この分析では、製品に関する知識のある人が仕様書に記載された指示内容の全てに目を通し、意味を読み取って分類する作業が必要であり、多大な時間を要する。 To improve quality and reduce costs in the design and manufacturing of industrial products, design teams analyze past specifications to identify trends in product parts that have undergone repeated design changes, shape characteristics, and the types of changes. This analysis requires someone with product knowledge to read through all of the instructions in the specifications, interpret their meaning, and classify them, which takes a significant amount of time.

医療分野の文書を対象とした分析に関しては、電子カルテやレポートシステム等の医療所見文書を対象とし、カテゴリとキーワードを定義したルールにより自由記述文からプロブレムを特定する技術が知られている（例えば、特許文献１参照）。 With regard to the analysis of documents in the medical field, there is known technology that targets medical observation documents such as electronic medical records and report systems, and identifies problems from free-form text using rules that define categories and keywords (see, for example, Patent Document 1).

特開２００９－２４５２３２号公報Japanese Patent Application Laid-Open No. 2009-245232

例えば、特許文献１の方法では、医療分野を対象に、プロブレムに対してカテゴリとキーワードからなる判定ルールを構築し、エディタで逐次編集・拡張していく形となっている。医療分野では、医学用語は医療の現場が異なっていても比較的統一されているため、このような方法で普遍的なモデルを拡張していくことができる。 For example, the method described in Patent Document 1 targets the medical field, constructing judgment rules consisting of categories and keywords for problems, and then sequentially editing and expanding them using an editor. In the medical field, medical terminology is relatively standardized across different medical settings, so this method makes it possible to expand a universal model.

一方、工業製品の設計・製造分野では、対象物の形状や部品構成は千差万別であり、製品型式や部品名などの固有名詞が多く存在し、設計工程や製造工程が多くの組織にわたり、製品の変化のサイクルも早い傾向にある。このため、仕様書等の文書に記述された指示内容が何を意味するかは、設計現場ごと、製品ごと等で、使用される用語が異なっていたり、同一の用語が全く違う意味で使用されたりする場合もある。また、多くの組織等が関与するので、工程によって仕様書等の構成が異なり、仕様書における指示内容の記載位置や、指示内容の記載方法も多岐にわたる場合がある。このようなことは、工業製品の設計・製造分野以外の文書においても同様な場合がある。 On the other hand, in the fields of industrial product design and manufacturing, the shapes and component configurations of objects vary enormously, there are many proper nouns such as product models and component names, the design and manufacturing processes span many organizations, and the cycle of product change tends to be rapid. For this reason, the meaning of the instructions written in documents such as specifications may vary depending on the design site or product, and the same terms may be used with completely different meanings. Furthermore, because many organizations are involved, the structure of specifications and other documents may differ depending on the process, and the location and method of writing instructions in specifications may also vary widely. This can also be true for documents outside the fields of industrial product design and manufacturing.

本発明は、上記事情に鑑みなされたものであり、その目的は、文書に含まれる指示内容を適切に抽出でき、指示内容に含まれる語句に対応する代表となる語である代表語を適切に特定することのできる技術を提供することにある。 The present invention was developed in consideration of the above circumstances, and its purpose is to provide technology that can properly extract instructions contained in a document and properly identify representative words that correspond to the words and phrases contained in the instructions.

上記目的を達成するため、一観点に係る文書内指示抽出システムは、指示内容が含まれる文書から指示内容を抽出し、前記指示内容に含まれる語句に対応する、代表となる語である代表語を特定する文書内指示抽出システムであって、前記文書内指示抽出システムは、プロセッサと、記憶部とを有し、前記記憶部は、１以上の前記文書と、前記文書から前記指示内容を特定するための指示内容特定用情報と、前記指示内容から前記代表語を特定するための代表語特定情報と、を格納し、前記プロセッサは、前記指示内容特定用情報に基づいて、前記文書から指示内容を特定し、前記代表語特定情報に基づいて、前記指示内容から代表語を特定する。 To achieve the above-mentioned objective, one aspect of the intra-document instruction extraction system is an intra-document instruction extraction system that extracts instruction content from a document containing the instruction content and identifies a representative word that corresponds to a phrase included in the instruction content. The intra-document instruction extraction system has a processor and a memory unit. The memory unit stores one or more of the documents, instruction content identification information for identifying the instruction content from the document, and representative word identification information for identifying the representative word from the instruction content. The processor identifies the instruction content from the document based on the instruction content identification information, and identifies a representative word from the instruction content based on the representative word identification information.

本発明によれば、文書に含まれる指示内容を適切に抽出でき、指示内容に含まれる語句に対応する代表語を適切に特定することができる。 This invention makes it possible to properly extract instructions contained in a document and properly identify representative words corresponding to the words contained in the instructions.

図１は、第１実施形態に係る製品の一例の構成図である。FIG. 1 is a diagram illustrating an example of a product according to the first embodiment. 図２は、第１実施形態に係る仕様書の一例の構成図である。FIG. 2 is a diagram illustrating an example of a specification according to the first embodiment. 図３は、第１実施形態に係る文書内指示抽出システムの全体構成図である。FIG. 3 is a diagram showing the overall configuration of the document instruction extraction system according to the first embodiment. 図４は、第１実施形態に係る文書内指示抽出システムの機能構成図である。FIG. 4 is a functional configuration diagram of the document instruction extraction system according to the first embodiment. 図５は、第１実施形態に係るメタ情報手掛り語辞書の構成図である。FIG. 5 is a diagram showing the configuration of the meta information cue word dictionary according to the first embodiment. 図６は、第１実施形態に係る指示文手掛り語辞書の構成図である。FIG. 6 is a diagram showing the configuration of the instruction sentence cue word dictionary according to the first embodiment. 図７は、第１実施形態に係る設計対象に関する代表語辞書の構成図である。FIG. 7 is a diagram showing the configuration of a representative word dictionary relating to a design object according to the first embodiment. 図８は、第１実施形態に係るＣＡＤ形状に関する代表語辞書の構成図である。FIG. 8 is a diagram showing the configuration of a representative word dictionary relating to CAD shapes according to the first embodiment. 図９は、第１実施形態に係る変更の種類に関する代表語辞書の構成図である。FIG. 9 is a diagram showing the configuration of a representative word dictionary relating to types of changes according to the first embodiment. 図１０は、第１実施形態に係る指示文抽出・代表語特定処理のフローチャートである。FIG. 10 is a flowchart of the instruction sentence extraction and group word identification process according to the first embodiment. 図１１は、第１実施形態に係るメタ情報・指示文抽出処理のフローチャートである。FIG. 11 is a flowchart of the meta information/instruction sentence extraction process according to the first embodiment. 図１２は、第１実施形態に係る指示文分割処理のフローチャートである。FIG. 12 is a flowchart of the directive statement division process according to the first embodiment. 図１３は、第１実施形態に係る代表語特定処理のフローチャートである。FIG. 13 is a flowchart of the group word specification process according to the first embodiment. 図１４は、第１実施形態に係る判定処理のフローチャートである。FIG. 14 is a flowchart of the determination process according to the first embodiment. 図１５は、第１実施形態に係る判定サブプログラム処理の一例のフローチャートである。FIG. 15 is a flowchart of an example of a determination subprogram process according to the first embodiment. 図１６は、第１実施形態に係る判定サブプログラム処理の他の例のフローチャートである。FIG. 16 is a flowchart of another example of the determination subprogram processing according to the first embodiment. 図１７は、第１実施形態に係る設計対象に関わる代表語の判定結果の一例を示す図である。FIG. 17 is a diagram showing an example of a determination result of a representative word related to a design object according to the first embodiment. 図１８は、第１実施形態に係るＣＡＤ形状に関わる代表語の判定結果の一例を示す図である。FIG. 18 is a diagram showing an example of a determination result of a representative word related to a CAD shape according to the first embodiment. 図１９は、第１実施形態に係る指示内容一覧表の構成図である。FIG. 19 is a diagram showing the configuration of an instruction content list according to the first embodiment. 図２０は、第１実施形態に係る集計表の構成図である。FIG. 20 is a diagram showing the layout of a summary table according to the first embodiment. 図２１は、第１実施形態に係る集計画面の構成図である。FIG. 21 is a diagram showing the configuration of the tally screen according to the first embodiment. 図２２は、第２実施形態に係る文書内指示抽出システムの機能構成図である。FIG. 22 is a functional configuration diagram of a document instruction extraction system according to the second embodiment. 図２３は、第２実施形態に係るＣＡＤモデル表示画面の構成図である。FIG. 23 is a diagram showing the configuration of a CAD model display screen according to the second embodiment.

実施形態について、図面を参照して説明する。なお、以下に説明する実施形態は特許請求の範囲に係る発明を限定するものではなく、また実施形態の中で説明されている諸要素及びその組み合わせの全てが発明の解決手段に必須であるとは限らない。 Embodiments will be described with reference to the drawings. Note that the embodiments described below do not limit the invention as defined in the claims, and not all of the elements and combinations thereof described in the embodiments are necessarily essential to the solution of the invention.

なお、本実施形態では、辞書や表における項目については、〈〉で囲んで表記し、また、具体的なデータの値の例について、『』で囲んで表記し、図中では斜体文字にて表記する。 In this embodiment, items in dictionaries and tables are enclosed in <>, and specific examples of data values are enclosed in " " and written in italics in figures.

（第１実施形態）
文書内指示抽出システム１０１の説明をする前に、文書内指示抽出システム１０１で利用する文書（仕様書）の対象としている製品の一例と、製品に関する文書の一例としての仕様書について説明する。 (First embodiment)
Before describing the intra-document instruction extraction system 101, an example of a product that is the subject of a document (specification) used in the intra-document instruction extraction system 101, and a specification as an example of a document related to the product will be described.

図１は、第１実施形態に係る製品の一例の構成図である。 Figure 1 is a diagram showing an example of the configuration of a product according to the first embodiment.

製品３００は、製品本体３０１と、部品３０２（部品Ａともいう）とにより組み立てられる。部品３０２には、ハーネス３０３が付帯されている。製品３００において、製品本体３０１の上部には、リブ３０７が形成されている。製品本体３０１の本体前面３０４には、部品３０２が取り付けられる部品取付部３０５が形成されている。また、製品本体３０１の上部には、部品３０２のハーネス３０３を固定するためのボス３０６が設けられている。 Product 300 is assembled from product body 301 and part 302 (also referred to as part A). Part 302 is attached with harness 303. In product 300, rib 307 is formed on the top of product body 301. A part mounting portion 305 to which part 302 is attached is formed on the front surface 304 of product body 301. In addition, a boss 306 is provided on the top of product body 301 for fixing harness 303 of part 302.

図２は、第１実施形態に係る仕様書の一例の構成図である。図２の仕様書は、図１に示す製品３００に対する設計変更の指示文（指示内容）が記載された仕様書の例を示す。 Figure 2 is a diagram illustrating an example of a specification document according to the first embodiment. The specification document in Figure 2 shows an example of a specification document that describes instructions (instructions) for design changes to the product 300 shown in Figure 1.

仕様書２０１は、大タイトル表示領域２０２と、発行日表示領域２０３と、仕様書番号表示領域２０４と、製品型式表示領域２０５と、変更内容タイトル表示領域２０６と、指示文記載範囲２０７と、要期表示領域２０８と、ページ表示領域２０９とを含む。 The specification 201 includes a main title display area 202, an issue date display area 203, a specification number display area 204, a product model display area 205, a change title display area 206, a directive description range 207, a summary display area 208, and a page display area 209.

大タイトル表示領域２０２には、大タイトルである『仕様書』が記載されている。発行日表示領域２０３には、小タイトルである『発行日』と、仕様書２０１の発行日である『２０２２／０４／０１』が記載されている。仕様書番号表示領域２０４には、小タイトルである『仕様書番号』と、仕様書番号である『Ｋ１００１』が記載されている。製品型式表示領域２０５には、小タイトルである『製品型式』と、製品３００の型式である『ＫＡ１』が記載されている。変更内容タイトル表示領域２０６には、小タイトルである『変更内容』が記載されている。指示文記載範囲２０７には、製品３００の設計についての変更の内容に対応する１以上の指示文が記載されている。図２の例では、指示文記載範囲２０７には、それぞれの指示を示す指示文が、所定の行頭文字を先頭として、箇条書きで記載されている。要期表示領域２０８には、小タイトルである『要期』と、製品３００の要期である『２０２２／０５／０１』が記載されている。ページ表示領域２０９には、『頁』との記述と、頁番号である『１／２』が記載されている。 The main title display area 202 contains the main title "Specifications." The issue date display area 203 contains the subtitle "Issue Date" and the issue date of the specifications 201, "2022/04/01." The specification number display area 204 contains the subtitle "Specifications Number" and the specification number "K1001." The product model display area 205 contains the subtitle "Product Model" and the model of the product 300, "KA1." The change title display area 206 contains the subtitle "Changes." The instruction statement area 207 contains one or more instruction statements corresponding to the changes to the design of the product 300. In the example of Figure 2, the instruction statement area 207 contains instruction statements indicating each instruction, listed in bullet points, with a specified first character at the beginning of each line. The deadline display area 208 displays the subtitle "Date" and the deadline for product 300, "2022/05/01." The page display area 209 displays the word "Page" and the page number, "1/2."

文書内指示抽出システム１０１は、複数の図２に示すような仕様書に基づいて、設計変更の傾向を把握したい時に、仕様書の指示文を特定し、指示文から指示の本来意味していることを表す代表語を特定して分類する処理を行う。 When it is desired to grasp trends in design changes based on multiple specifications such as those shown in Figure 2, the document instruction extraction system 101 identifies instruction sentences in the specifications and performs processing to identify and classify representative words that express the original meaning of the instructions from the instruction sentences.

この処理においては、指示文に含まれる語（特徴語）を単に取り出して、辞書と照合するだけでは、本来意味している代表語を特定して分類することができないことがある。 In this process, simply extracting the words (characteristic words) contained in the instruction sentence and comparing them with a dictionary may not be enough to identify and classify the representative word that originally meant the instruction.

例えば、設計変更の指示文が『部品Ａ固定部上部Ｒ拡大』である場合に、この指示文が対象としている製品の部分の代表語を特定しようとしても、指示文の語句には、指示が意図している製品の本体前面であるという情報が含まれていない。また、指示文が『部品Ａハーネス用ボス根元Ｒ拡大』である場合に、この指示文が対象としている製品の部分の代表語を特定しようとしても、指示文の語句には、指示が意図している本体上部であるという情報が含まれていない。また、これらの２つの指示文には『部品Ａ』という同一の語が入っているが、これらの指示文が対象としている製品３００の部分は異なっている。 For example, if the instruction for a design change is "Expand the R of the upper part of the fixed part of part A," even if an attempt is made to identify the representative word for the part of the product that this instruction applies to, the words in the instruction do not contain information that the part is the front of the main body of the product that the instruction intends. Also, if the instruction is "Expand the R of the base of the harness boss of part A," even if an attempt is made to identify the representative word for the part of the product that this instruction applies to, the words in the instruction do not contain information that the part is the upper part of the main body that the instruction intends. Furthermore, although these two instruction statements contain the same word, "part A," the parts of product 300 that these instruction statements apply to are different.

さらに、指示文『部品Ａ固定部上部Ｒ拡大』には、『上部』という語が入っているが、この上部は、製品３００の本体上部を意味していない。したがって、指示文の語句のみからは、製品３００における対象の部分を示す代表語を特定することはできない。 Furthermore, the instruction "Enlarge upper R of fixed portion of part A" contains the word "upper portion," but this upper portion does not refer to the upper portion of the main body of product 300. Therefore, it is not possible to identify a representative word indicating the target portion of product 300 from the words and phrases in the instruction alone.

また、別の例として、設計変更の指示文から設計変更の種類を表す代表語を特定しようとする場合に、『部品Ａ固定部上部Ｒ拡大』との指示文では、『Ｒ拡大』は、例えば、部品Ａの固定部における凸状のＲの拡大を意図しているのに対して、『部品Ａハーネス用ボス根元Ｒ拡大』との指示文では、ボスの根元の凹状のＲの拡大を意図しており、同じ『Ｒ拡大』という語句であっても、異なる種類の設計変更であり、指示文の語句から設計変更の種類に対応する代表語を直接特定することはできない。 As another example, when trying to identify a representative word that represents the type of design change from a design change instruction, in the instruction "Enlarge the radius of the upper part of the fixing part of part A," "enlarge radius" refers to, for example, enlarging the convex radius of the fixing part of part A, whereas in the instruction "Enlarge the radius of the base of the harness boss of part A," it refers to enlarging the concave radius of the base of the boss. Even though the phrase "enlarge radius" is the same, these are different types of design changes, and it is not possible to directly identify a representative word that corresponds to the type of design change from the phrase in the instruction.

また、工業製品の設計では、型式のマイナーチェンジや製造時期により、製品の部品や一部分の形状の位置関係を大きく変える場合がある。例えば、製品３００の部品Ａのハーネス３０３を固定するためのボス３０６が、或る時期を境に、製品本体３０１の底面に配置されることがあり得る。このように、ボス３０６の位置が変更された場合以降においては、指示文『部品Ａハーネス用ボス根元Ｒ拡大』の意図する内容としては、本体底面に関する設計変更であるとして認識して、代表語を特定する必要がある。 Furthermore, in the design of industrial products, minor changes to the model or the time of production can significantly change the relative positions of the product's parts or the shapes of certain parts. For example, boss 306 for securing harness 303 of part A of product 300 may be positioned on the bottom surface of product body 301 at a certain point in time. In this way, if the position of boss 306 is changed, the intended content of the instruction "Expand the base radius of the boss for part A's harness" must be recognized as a design change related to the bottom surface of the body, and a representative term must be specified.

このように、指示文から適切な代表語を特定して分類するためには、指示文の語の組合せや、その順番等に応じて、本来の意図する内容を認識する必要がある。また、同じ語句が使われていても、その他の条件によって異なることを意図していることを認識する必要がある。 In this way, in order to identify and classify the appropriate representative word from a directive sentence, it is necessary to recognize the original intended content based on the combination of words in the directive sentence and their order. It is also necessary to recognize that even if the same words are used, different intentions may be intended depending on other conditions.

文書内指示抽出システム１０１は、上記したような状況を考慮して、指示文から指示の本来意味していることを表している代表語を適切に特定して分類する処理を行う。 The document instruction extraction system 101 takes into account the above-mentioned circumstances and performs processing to appropriately identify and classify representative words that express the original meaning of instructions from instruction sentences.

次に、文書内指示抽出システム１０１について詳細に説明する。 Next, we will explain the document instruction extraction system 101 in detail.

（ハードウェア構成）
図３は、第１実施形態に係る文書内指示抽出システムの全体構成図である。 (Hardware configuration)
FIG. 3 is a diagram showing the overall configuration of the document instruction extraction system according to the first embodiment.

文書内指示抽出システム１０１は、例えば、ＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）や、汎用サーバ等のコンピュータにより構成される。文書内指示抽出システム１０１は、プロセッサの一例としてのＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１１と、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）１２と、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）１３と、記憶装置１４と、通信インターフェース（Ｉ／Ｆ）１５と、記録媒体読取装置１７と、入力装置１８と、表示装置１９とを含む。ＣＰＵ１１と、ＲＡＭ１２と、ＲＯＭ１３と、記憶装置１４と、通信Ｉ／Ｆ１５と、記録媒体読取装置１７と、入力装置１８と、表示装置１９とは、例えば、バスを介して通信可能に接続されている。 The document instruction extraction system 101 is composed of a computer, such as a PC (Personal Computer) or a general-purpose server. The document instruction extraction system 101 includes a CPU (Central Processing Unit) 11 (an example of a processor), a RAM (Random Access Memory) 12, a ROM (Read Only Memory) 13, a storage device 14, a communication interface (I/F) 15, a recording medium reader 17, an input device 18, and a display device 19. The CPU 11, RAM 12, ROM 13, storage device 14, communication I/F 15, recording medium reader 17, input device 18, and display device 19 are communicatively connected, for example, via a bus.

ＣＰＵ１１は、ＲＡＭ１２やＲＯＭ１３に格納されたプログラムを実行することにより、各種演算を行って、文書内指示抽出システム１０１の各部を集中的に制御する。ＲＡＭ１２は、ＣＰＵ１１で実行されるプログラムや、必要な情報を記憶する作業エリアとして機能する。ＲＯＭ１３は、ＢＩＯＳ（ＢａｓｉｃＩｎｐｕｔＯｕｔｐｕｔＳｙｓｔｅｍ）を記憶する。 The CPU 11 executes programs stored in the RAM 12 and ROM 13 to perform various calculations and centrally control each part of the document instruction extraction system 101. The RAM 12 functions as a work area for storing the programs executed by the CPU 11 and necessary information. The ROM 13 stores the BIOS (Basic Input Output System).

記憶装置１４は、記憶部の一例であり、例えば、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）や、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の不揮発性の記憶装置であり、各種情報を記憶する。本実施形態では、記憶装置１４は、文書内指示抽出プログラム２０や、１以上の文書ファイル２１を格納する。文書ファイル２１は、例えば、製品の設計や製造に関する文書（例えば、仕様書）のファイルであり、文書編集ソフト等で作成されたテキストデータや画像を含んでもよく、仕様書の画像データに対してＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ）を実行して画像データ中の文字をテキストデータ化したものであってもよい。 The storage device 14 is an example of a storage unit and is a non-volatile storage device such as a hard disk drive (HDD) or solid state drive (SSD), which stores various information. In this embodiment, the storage device 14 stores an in-document instruction extraction program 20 and one or more document files 21. The document files 21 are, for example, files of documents (e.g., specifications) related to product design or manufacturing, and may include text data and images created using document editing software, or may be files created by performing OCR (Optical Character Recognition) on image data of specifications to convert characters in the image data into text data.

通信Ｉ／Ｆ１５は、図示しないインターネット等のネットワークを介して他の装置と通信するためのインターフェースである。記録媒体読取装置１７は、ブルーレイディスク（Ｂｌｕ－ｒａｙ（登録商標）Ｄｉｓｃ）、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）、ＣＤ（ＣｏｍｐａｃｔＤｉｓｃ）等の記録媒体１６のデータを読み取る装置である。 The communication I/F 15 is an interface for communicating with other devices via a network such as the Internet (not shown). The recording medium reader 17 is a device that reads data from a recording medium 16 such as a Blu-ray (registered trademark) Disc, a DVD (Digital Versatile Disc), or a CD (Compact Disc).

入力装置１８は、例えば、キーボード、マウス等であり、ユーザによる情報の入力を受け付ける。表示装置１９は、例えば、液晶ディスプレイ、有機ＥＬディスプレイ等であり、各種情報を含むユーザインターフェースを表示出力する。 The input device 18 is, for example, a keyboard, mouse, etc., and accepts information input by the user. The display device 19 is, for example, a liquid crystal display, organic EL display, etc., and displays and outputs a user interface including various information.

文書内指示抽出システム１０１における、記憶装置１４にセットアップされている文書内指示抽出プログラム２０は、通信Ｉ／Ｆ１５を介して、インターネット上の装置からダウンロードして取得するようにしてもよいし、記録媒体読取装置１７により、文書内指示抽出プログラム２０が記録された記録媒体１６から読み取ってもよい。 In the document instruction extraction system 101, the document instruction extraction program 20 set up in the storage device 14 may be obtained by downloading it from a device on the Internet via the communication I/F 15, or it may be read by the recording medium reading device 17 from the recording medium 16 on which the document instruction extraction program 20 is recorded.

なお、図３では、文書内指示抽出システム１０１をひとつのコンピュータにより構成された例を示したが、本発明はこれに限られず、文書内指示抽出システム１０１を複数のサーバ装置で構成するようにしてもよい。このように文書内指示抽出システム１０１を複数のサーバ装置で構成する場合には、文書内指示抽出プログラム２０を複数のサーバ装置が連携して実行するようにしてもよい。この場合には、文書内指示抽出プログラム２０は、複数台のサーバ装置に分散してセットアップされているプログラム群の集合体に相当する。 Note that while Figure 3 shows an example in which the intra-document instruction extraction system 101 is configured using a single computer, the present invention is not limited to this, and the intra-document instruction extraction system 101 may be configured using multiple server devices. When the intra-document instruction extraction system 101 is configured using multiple server devices in this manner, the intra-document instruction extraction program 20 may be executed in cooperation with the multiple server devices. In this case, the intra-document instruction extraction program 20 corresponds to a collection of programs that are distributed and set up across multiple server devices.

次に、文書内指示抽出システム１０１の機能構成について説明する。 Next, we will explain the functional configuration of the document instruction extraction system 101.

（機能構成）
図４は、第１実施形態に係る文書内指示抽出システムの機能構成図である。 (Functional configuration)
FIG. 4 is a functional configuration diagram of the document instruction extraction system according to the first embodiment.

文書内指示抽出システム１０１は、辞書データ記憶部１０７と、判定サブプログラム記憶部１０９と、ファイル入力部１０２と、テキストデータ抽出部１０３と、指示文抽出部１０４と、代表語特定部１０５と、集計部１１０と、出力部１１１と、を備える。辞書データ記憶部１０７と、判定サブプログラム記憶部１０９とは、主に、ＲＡＭ１２及び／又は記憶装置１４により構成され、ファイル入力部１０２と、テキストデータ抽出部１０３と、指示文抽出部１０４と、代表語特定部１０５と、集計部１１０と、出力部１１１とは、主に、ＣＰＵ１１が文書内指示抽出プログラム２０を実行することにより構成される。 The intra-document instruction extraction system 101 includes a dictionary data storage unit 107, a judgment subprogram storage unit 109, a file input unit 102, a text data extraction unit 103, a reference sentence extraction unit 104, a representative word identification unit 105, a counting unit 110, and an output unit 111. The dictionary data storage unit 107 and the judgment subprogram storage unit 109 are primarily configured by the RAM 12 and/or the storage device 14, while the file input unit 102, the text data extraction unit 103, the reference sentence extraction unit 104, the representative word identification unit 105, the counting unit 110, and the output unit 111 are primarily configured by the CPU 11 executing the intra-document instruction extraction program 20.

辞書データ記憶部１０７は、メタ情報手掛り語辞書Ｄ１と、指示文手掛り語辞書Ｄ２と、１以上の代表語辞書Ｄ３とを記憶する。これらの辞書の詳細については、後述する。判定サブプログラム記憶部１０９は、代表語を判定するための１以上の判定サブプログラム（判定プログラムの一例）を格納する。判定サブプログラムは、第２判定部１０８から呼び出されて実行される。 The dictionary data storage unit 107 stores a meta-information clue word dictionary D1, a demonstrative clue word dictionary D2, and one or more representative word dictionaries D3. Details of these dictionaries will be described later. The determination subprogram storage unit 109 stores one or more determination subprograms (an example of a determination program) for determining representative words. The determination subprograms are called and executed by the second determination unit 108.

ファイル入力部１０２は、処理対象とする文書ファイル２１を記憶装置１４から読み込み、文書ファイルのデータをテキストデータ抽出部１０３に渡す。なお、文書ファイル２１は、ネットワークを介して外部の装置から読み込むようにしてもよい。テキストデータ抽出部１０３は、文書ファイル２１のデータからテキストデータを抽出し、指示文抽出部１０４へ渡す。 The file input unit 102 reads the document file 21 to be processed from the storage device 14 and passes the document file data to the text data extraction unit 103. Note that the document file 21 may also be read from an external device via a network. The text data extraction unit 103 extracts text data from the data in the document file 21 and passes it to the instruction sentence extraction unit 104.

指示文抽出部１０４は、文書ファイルのテキストデータと、メタ情報手掛り語辞書Ｄ１及び指示文手掛り語辞書Ｄ２とに基づいて、文書ファイルにおける指示内容の指示文を抽出し、指示文のリスト（指示文リスト）とメタ情報データとを含むデータセットを生成し、代表語特定部１０５に渡す。 The instruction sentence extraction unit 104 extracts instruction sentences of the instruction content in the document file based on the text data of the document file, the meta information clue word dictionary D1, and the instruction sentence clue word dictionary D2, generates a data set including a list of instruction sentences (instruction sentence list) and meta information data, and passes this to the target word identification unit 105.

代表語特定部１０５は、第１判定部１０６と、第２判定部１０８とを有する。第１判定部１０６は、指示文リストと、代表語辞書Ｄ３とに基づいて、指示文における特徴語を検索し、代表語の候補を特定する処理をする。第２判定部１０８は、代表語の候補が代表語である否かを判定するために判定サブプログラムを実行する必要がある場合に、判定サブプログラムを実行させて、代表語であるか否かを判定し、その結果に基づいて、代表語のリスト（代表語リスト）を生成し、代表語リスト、指示文リスト、及びメタ情報データを集計部１１０へ渡す。 The representative word identification unit 105 has a first determination unit 106 and a second determination unit 108. The first determination unit 106 searches for characteristic words in instruction sentences based on the instruction sentence list and the representative word dictionary D3, and performs processing to identify candidate representative words. When it is necessary to execute a determination subprogram to determine whether a candidate representative word is a representative word, the second determination unit 108 executes the determination subprogram to determine whether the candidate is a representative word, and based on the result, generates a list of representative words (representative word list), and passes the representative word list, instruction sentence list, and meta information data to the aggregation unit 110.

集計部１１０は、メタ情報データ、指示文リスト、及び代表語リストをまとめたデータセットと、このデータセットに基づいて集計したデータ（集計データ：図２０参照）を出力部１１１に渡す。出力部１１１は、渡されたデータセットや集計データを記憶装置１４に格納したり、データセットの情報を含む指示内容一覧表（図１９参照）の画面や集計データに基づく集計画面（図２１参照）を表示装置１９に出力したりする。 The aggregation unit 110 passes a data set that compiles the meta information data, instruction sentence list, and representative word list, as well as data aggregated based on this data set (aggregated data: see Figure 20) to the output unit 111. The output unit 111 stores the passed data set and aggregated data in the storage device 14, and outputs to the display device 19 a screen showing an instruction content list (see Figure 19) that includes information from the data set, and a aggregation screen based on the aggregated data (see Figure 21).

（辞書）
次に、メタ情報手掛り語辞書Ｄ１、指示文手掛り語辞書Ｄ２、及び代表語辞書Ｄ３について詳細に説明する。 (dictionary)
Next, the meta-information cue word dictionary D1, the instruction cue word dictionary D2, and the representative word dictionary D3 will be described in detail.

図５は、第１実施形態に係るメタ情報手掛り語辞書の構成図である。 Figure 5 is a diagram showing the structure of the meta information clue word dictionary according to the first embodiment.

メタ情報手掛り語辞書Ｄ１は、〈仕様書番号開始ワード〉、〈仕様書番号書式〉、〈仕様書日付開始ワード〉、及び〈仕様書日付書式〉の項目を含む。 Meta information clue word dictionary D1 includes the entries <specification number start word>, <specification number format>, <specification date start word>, and <specification date format>.

〈仕様書番号開始ワード〉には、仕様書における仕様書番号の開始を示すワードが格納される。図５の例では、〈仕様書番号開始ワード〉には、『仕様書番号』、『仕様書＃』等が格納されている。〈仕様書番号書式〉には、仕様書番号の書式が格納される。図５の例では、仕様書の書式として、『Ａ０００１』が格納される。〈仕様書日付開始ワード〉には、仕様書における仕様書日付の開始を示すワードが格納される。図５の例では、〈仕様書日付開始ワード〉には、『作成日』、『登録日』等が格納される。〈仕様書日付書式〉には、仕様書日付の書式が格納される。図５の例では、〈仕様書日付書式〉には、『００００／００／００』が格納されている。 <Specification Number Start Word> stores a word that indicates the start of the specification number in the specification. In the example of Figure 5, <Specification Number Start Word> stores "Specification Number", "Specification #", etc. <Specification Number Format> stores the format of the specification number. In the example of Figure 5, "A0001" is stored as the specification format. <Specification Date Start Word> stores a word that indicates the start of the specification date in the specification. In the example of Figure 5, <Specification Date Start Word> stores "Creation Date", "Registration Date", etc. <Specification Date Format> stores the format of the specification date. In the example of Figure 5, <Specification Date Format> stores "0000/00/00".

図６は、第１実施形態に係る指示文手掛り語辞書の構成図である。 Figure 6 is a diagram showing the configuration of the instruction sentence cue word dictionary according to the first embodiment.

指示文手掛り語辞書Ｄ２は、指示内容特定用情報及び個別指示特定情報の一例であり、〈仕様書開始ワード〉、〈仕様書終了ワード〉、〈指示文記載範囲開始ワード〉、〈指示文記載範囲終了ワード〉、〈行頭文字種類〉、〈行番号〉、及び〈行頭文字〉の項目を含む。 The instruction clue word dictionary D2 is an example of instruction content identification information and individual instruction identification information, and includes the following items: <specification start word>, <specification end word>, <instruction statement range start word>, <instruction statement range end word>, <line start character type>, <line number>, and <line start character>.

〈仕様書開始ワード〉には、仕様書における処理対象とする範囲の開始位置を示すワードが格納される。図６の例では、〈仕様書開始ワード〉には、『仕様書』、『仕様書』等が格納されている。〈仕様書終了ワード〉には、仕様書における処理対象とする範囲の終了位置を示すワードが格納される。図６の例では、〈仕様書終了ワード〉には、『頁』等が格納されている。 The <specification start word> stores a word that indicates the start position of the range in the specification to be processed. In the example in Figure 6, <specification start word> stores "specification", "specification document", etc. The <specification end word> stores a word that indicates the end position of the range in the specification to be processed. In the example in Figure 6, <specification end word> stores "page", etc.

〈指示文記載範囲開始ワード〉には、仕様書における指示文が記載される範囲（指示文記載範囲）の開始位置を示すワードが格納される。図６の例では、〈指示文記載範囲開始ワード〉には、『変更内容』が格納される。〈指示文記載範囲終了ワード〉には、仕様書における指示文記載範囲の終了位置を示すワードが格納される。図６の例では、〈指示文記載範囲終了ワード〉には、『要期』が格納されている。 The <start word of directive description range> field stores a word that indicates the start position of the range in the specification where the directive is written (directive description range). In the example in Figure 6, the <start word of directive description range> field stores "Change content." The <end word of directive description range> field stores a word that indicates the end position of the range in the specification where the directive is written. In the example in Figure 6, the <end word of directive description range> field stores "Essential."

〈行頭文字種類〉には、指示文記載範囲における箇条書きとなっている指示文の行頭を示す文字種類が格納される。図６の例では、〈行頭文字種類〉には、行頭文字が［ｎ］の種類であることを示す『ＮｕｍＴｙｐｅ１』、行頭文字が（ｎ）の種類であることを示す『ＮｕｍＴｙｐｅ２』等が格納されている。〈行番号〉には、指示文記載範囲における指示文の行番号が格納される。図６の例では、〈行番号〉には、行頭文字種類ごとに、１から順番に所定の数までの番号が格納されている。〈行頭文字〉には、文字行頭種類の行番号において表記される行頭文字が格納される。 <Line start character type> stores the character type that indicates the start of a line of an itemized directive within the directive description range. In the example in Figure 6, <Line start character type> stores "Num Type1", which indicates that the line start character is type [n], "Num Type2", which indicates that the line start character is type (n), etc. <Line number> stores the line number of the directive within the directive description range. In the example in Figure 6, <Line number> stores numbers from 1 to a predetermined number in order for each line start character type. <Line start character> stores the line start character that is indicated by the line number of the character line start type.

例えば、図６の例では、“〈行頭文字種類〉，〈行番号〉，〈行頭文字〉”として、“『ＮｕｍＴｙｐｅ１』，『１』，『［１］』”、“『ＮｕｍＴｙｐｅ１』、『２』、『［２］』”、・・・、“『ＮｕｍＴｙｐｅ２』、『１』、『（１）』”、“『ＮｕｍＴｙｐｅ２』、『２』、『（２）』”、・・・が格納されている。 For example, in the example in Figure 6, the following are stored as "〈Line start character type〉,〈Line number〉,〈Line start character〉": "'Num Type1', '1', '[1]'", "'Num Type1', '2', '[2]'", ..., "'Num Type2', '1', '(1)'", "'Num Type2', '2', '(2)'", ....

本実施形態では、代表語辞書Ｄ３（代表語特定情報の一例）は、カテゴリの異なる代表語についての複数の代表語辞書として、設計対象に関する代表語辞書Ｄ３－１と、ＣＡＤ形状に関する代表語辞書Ｄ３－２と、変更の種類に関する代表語辞書Ｄ３－３とを含む。 In this embodiment, the representative word dictionary D3 (an example of representative word identification information) includes multiple representative word dictionaries for representative words of different categories: a representative word dictionary D3-1 related to the design object, a representative word dictionary D3-2 related to CAD shapes, and a representative word dictionary D3-3 related to the type of change.

図７は、第１実施形態に係る設計対象に関する代表語辞書の構成図である。 Figure 7 is a diagram showing the configuration of the representative word dictionary for the design object in the first embodiment.

代表語辞書Ｄ３－１は、設計対象（カテゴリの一例）に関する代表語を特定するための辞書であり、〈処理番号〉、〈代表語〉、〈判定コード〉、〈特徴語（１）〉、〈特徴語（２）〉等の項目を含む。なお、図７の例では、特徴語を格納する項目は、〈特徴語（１）〉及び〈特徴語（２）〉の２つとなっているが、特徴語が多い場合には、項目を３以上としてもよい。 The representative word dictionary D3-1 is a dictionary for identifying representative words related to the design object (an example of a category), and includes items such as <process number>, <representative word>, <determination code>, <characteristic word (1)>, and <characteristic word (2)>. In the example of Figure 7, there are two items for storing characteristic words: <characteristic word (1)> and <characteristic word (2)>. However, if there are many characteristic words, there may be three or more items.

〈処理番号〉には、代表語辞書Ｄ３－１におけるレコードについての通し番号（処理番号）が格納される。〈代表語〉には、代表語の候補（代表語）が格納される。〈判定コード〉には、レコードに対応する〈代表語〉の代表語の候補が代表語であるか否かを判定するための判定処理に関する判定処理情報が格納される。判定処理情報としては、判定処理に利用する判定サブプログラムを特定するプログラム特定情報と、その判定サブプログラムに渡す引数との情報を含んでもよい。例えば、処理番号が２のレコードの〈判定コード〉における『Ｎｏ－部品Ａ、部品Ｂ』の『ｎｏ－』は、判定サブプログラム「Ｓｕｂ－ｎ」を示すプログラム特定情報であり、『部品Ａ、部品Ｂ』が引数である。なお、判定サブプログラム「Ｓｕｂ－ｎ」は、例えば、指示文が引数に該当する文字列を含まないことに適合するか否かを判定する判定処理を実行するプログラムである。また、処理番号が４のレコードの〈判定コード〉における『ｄａｔｅ‐ａｆｔｅｒ－２０２１０５０１』の『ｄａｔｅ－ａｆｔｅｒ』は、判定サブプログラム「ｄａｔｅ－ａｆｔｅｒ」を示すプログラム特定情報であり、『２０２１０５０１』が引数である。なお、判定サブプログラム「ｄａｔｅ－ａｆｔｅｒ」は、例えば、引数の日付以降の指示文であることに適合するか否かを判定する判定処理を実行するプログラムである。なお、〈判定コード〉にデータが設定されていない場合には、〈代表語〉の語が判定処理を行うことなく代表語となる。 <Processing Number> stores the serial number (processing number) for the record in the target word dictionary D3-1. <Target Word> stores the target word candidate (target word). <Determination Code> stores determination process information related to the determination process used to determine whether the target word candidate for the <Target Word> corresponding to the record is the target word. The determination process information may include program-specific information that identifies the determination subprogram used in the determination process and information on the arguments passed to that determination subprogram. For example, the "no-" in "No-Part A, Part B" in the <Determination Code> of a record with a processing number of 2 is program-specific information that indicates the determination subprogram "Sub-n," and "Part A, Part B" are the arguments. Note that the determination subprogram "Sub-n" is a program that executes a determination process that determines, for example, whether a directive conforms to the condition that it does not contain a string corresponding to the argument. Additionally, the "date-after" in "date-after-20210501" in the <judgment code> of the record with processing number 4 is program-specific information that indicates the judgment subprogram "date-after," and "20210501" is the argument. Note that the judgment subprogram "date-after" is a program that executes judgment processing to determine whether a directive is satisfied because it is dated after the argument date. Note that if no data is set in the <judgment code>, the word in the <representative word> becomes the representative word without performing judgment processing.

〈特徴語（１）〉、〈特徴語（２）〉には、レコードの代表語として判定される条件となる１以上の特徴語が格納される。本実施形態では、指示文に〈特徴語（１）のいずれかの特徴語と、〈特徴語（２）〉のいずれかの特徴語との組み合わせがあることが、そのレコードの〈代表語〉の語が、代表語となる条件の一つである。 <Feature Word (1)> and <Feature Word (2)> store one or more feature words that are the conditions for determining a record's representative word. In this embodiment, one of the conditions for a word in <Representative Word> to become a representative word is that the instruction statement contains a combination of one of the feature words in <Feature Word (1)> and one of the feature words in <Feature Word (2)>.

例えば、図７の４つ目のレコード、すなわち、〈処理番号〉が『４』のレコードは、〈代表語〉には、『本体底面』が格納され、〈判定コード〉には、『ｄａｔａ‐ａｆｔｅｒ－２０２１０５０１』が格納され、〈特徴語（１）〉には、『部品Ａ』が格納され、〈特徴語（２）〉には、『ハーネス、コード』が格納されている。このレコードは、部品Ａ及びハーネス、又は部品Ａ及びコードの組み合わせが指示文に含まれ、且つ『ｄａｔａ‐ａｆｔｅｒ』が示す判定サブプログラム「ｄａｔｅ－ａｆｔｅｒ」を、『２０２１０５０１』を引数として判定処理を行って適合すると判定された場合に、代表語が『本体底面』と特定されることを示している。 For example, in the fourth record in Figure 7, i.e., the record with a <processing number> of "4," the <representative word> stores "bottom of main body," the <determination code> stores "data-after-20210501," the <characteristic word (1)> stores "Part A," and the <characteristic word (2)> stores "harness, cord." This record indicates that if the combination of Part A and harness, or Part A and cord, is included in the directive, and the determination subprogram "date-after" indicated by "data-after" performs a determination process using "20210501" as an argument and determines that the combination is compatible, then the representative word is identified as "bottom of main body."

ここで、製品のある特定の方向については、製品についての通常時における方向に基づく表現により、製品についての設計図面上での方向に基づく表現により、或いは製品の製造時の製造工程における方向に基づく表現により等といったように多種の表現のし方がある。このため、同一の方向を示す語句を用いたとしても、実際に意味する方向が全く異なる場合がある。そこで、代表語辞書においては、同一の代表語に対して、製品の通常時における方向に関する特徴語（第１特徴語）との対応関係と、製品の設計図面における方向に関する特徴語（第２特徴語）との対応関係と、製品の製造時における方向に関する特徴語（第３特徴語）との対応関係とのうちの複数の対応関係を含んでもよい。例えば、代表語辞書に、代表語である本体上部についての製品の通常時における方向に関する特徴語を含むレコードと、本体上部についての製品の設計図面における方向に関する特徴語を含むレコードと、本体上部についての製品の製造時における方向に関する特徴語を含むレコードとの少なくとも複数のレコードを格納するようにしてもよい。このようにすると、いずれの表現により方向が表されている場合であっても、同一の代表語を特定することができるようになる。 A specific product orientation can be expressed in a variety of ways, such as by expressions based on the orientation of the product in its normal state, by expressions based on the orientation of the product on design drawings, or by expressions based on the orientation during the manufacturing process. Therefore, even when the same word or phrase is used to indicate a direction, the actual meaning of the direction may be completely different. Therefore, the target word dictionary may include multiple correspondences for the same target word: a correspondence with a characteristic word (first characteristic word) related to the orientation of the product in its normal state, a correspondence with a characteristic word (second characteristic word) related to the orientation on the product's design drawings, and a correspondence with a characteristic word (third characteristic word) related to the orientation during manufacturing. For example, the target word dictionary may store at least multiple records: a record containing a characteristic word related to the orientation of the target word for the upper part of the main body in its normal state, a record containing a characteristic word related to the orientation of the upper part of the main body on the design drawings, and a record containing a characteristic word related to the orientation of the upper part of the main body during manufacturing. This makes it possible to identify the same target word regardless of the expression used to indicate the direction.

図８は、第１実施形態に係るＣＡＤ形状に関する代表語辞書の構成図である。 Figure 8 is a diagram showing the configuration of the representative word dictionary for CAD shapes in the first embodiment.

代表語辞書Ｄ３－２は、ＣＡＤ形状（カテゴリの一例）に関する代表語を特定するための辞書であり、登録されるデータがＣＡＤ形状に関する代表語に関するものであるという点が代表語辞書Ｄ３－１と違うだけであり、項目の構成は、同様である。 The representative word dictionary D3-2 is a dictionary for identifying representative words related to CAD shapes (an example of a category), and differs from the representative word dictionary D3-1 only in that the registered data relates to representative words related to CAD shapes; the item structure is the same.

図９は、第１実施形態に係る変更の種類に関する代表語辞書の構成図である。 Figure 9 is a diagram showing the configuration of the representative word dictionary for change types in the first embodiment.

代表語辞書Ｄ３－３は、変更種類（カテゴリの一例）に関する代表語を特定するための辞書であり、登録されるデータが製品の設計内容の変更種類に関する代表語に関するものであるという点が代表語辞書Ｄ３－１と違うだけであり、項目の構成は、同様である。 The representative word dictionary D3-3 is a dictionary for identifying representative words related to change types (an example of a category). The only difference from the representative word dictionary D3-1 is that the registered data relates to representative words related to change types in product design content; the item structure is the same.

（処理動作）
次に、文書内指示抽出システム１０１による処理動作について説明する。 (Processing Operation)
Next, the processing operation of the document instruction extraction system 101 will be described.

（指示文抽出・代表語特定処理）
まず、指示文抽出・代表語特定処理について説明する。 (Instruction sentence extraction and representative word identification processing)
First, the instruction sentence extraction and group word specification process will be described.

図１０は、第１実施形態に係る指示文抽出・代表語特定処理のフローチャートである。 Figure 10 is a flowchart of the instruction sentence extraction and group word identification process according to the first embodiment.

文書内指示抽出システム１０１は、待機状態にあるときに、処理開始のトリガー（例えば、入力装置１８を介してのユーザによる処理開始指示）を受けると、ファイル入力部１０２は、ユーザから処理対象とする１以上の文書ファイルの選択を受け付け、記憶装置１４から選択された文書ファイルを読み込み、文書ファイルをデータとしてテキストデータ抽出部１０３に渡す（Ｓ１０１）。 When the intra-document instruction extraction system 101 receives a trigger to start processing (e.g., a processing start instruction from the user via the input device 18) while in standby mode, the file input unit 102 accepts a selection from the user of one or more document files to be processed, reads the selected document files from the storage device 14, and passes the document files as data to the text data extraction unit 103 (S101).

次いで、テキストデータ抽出部１０３は、文書ファイルのデータからテキストデータを抽出し、指示文抽出部１０４へ渡す（Ｓ１０２）。 Next, the text data extraction unit 103 extracts text data from the document file data and passes it to the instruction statement extraction unit 104 (S102).

次いで、指示文抽出部１０４は、文書ファイルのテキストデータと、メタ情報手掛り語辞書Ｄ１及び指示文手掛り語辞書Ｄ２とに基づいて、文書ファイルのメタ情報と、文書ファイル中の指示文を抽出するメタ情報・指示文抽出処理（図１１参照）を実行し、抽出された指示文のリスト（指示文リスト）とメタ情報データとを含むデータセットを代表語特定部１０５に渡す（Ｓ１０３）。 Next, the instruction sentence extraction unit 104 executes a meta information/instruction sentence extraction process (see FIG. 11) to extract meta information of the document file and instruction sentences in the document file based on the text data of the document file, the meta information clue word dictionary D1, and the instruction sentence clue word dictionary D2, and passes a data set including a list of the extracted instruction sentences (instruction sentence list) and meta information data to the representative word identification unit 105 (S103).

次いで、代表語特定部１０５は、指示文リスト、メタ情報データ、及び代表語辞書Ｄ３に基づいて、特徴語を検索し、代表語を特定する代表語特定処理（図１３参照）を実行して代表語リストを生成し、代表語リスト、指示文リスト、及びメタ情報データを集計部１１０へ渡す（Ｓ１０４）。本実施形態では、代表語特定部１０５は、ステップＳ１０４の処理を、処理対象のすべての文書ファイルを対象に、各文書ファイルに対して代表語の各カテゴリを対象に実行する。例えば、代表語特定部１０５は、処理対象のすべての文書ファイルを対象にして得られた、代表語リスト、指示文リスト、及びメタ情報データを、指示内容一覧表Ｒ１（図１９参照）に対応する表形式のファイルとして出力する。 Next, the target word identification unit 105 searches for characteristic words based on the instruction sentence list, meta information data, and target word dictionary D3, executes a target word identification process (see FIG. 13) to identify target words, generates a target word list, and passes the target word list, instruction sentence list, and meta information data to the aggregation unit 110 (S104). In this embodiment, the target word identification unit 105 performs the process of step S104 for all document files to be processed, and for each category of target words for each document file. For example, the target word identification unit 105 outputs the target word list, instruction sentence list, and meta information data obtained for all document files to be processed as a table-format file corresponding to the instruction content list R1 (see FIG. 19).

次いで、集計部１１０は、代表語特定部１０５から出力された表形式のファイルを読み込み、代表語のカテゴリごとに、特定された代表語の数をカウントして集計表Ｒ２（図２０参照）を作成し、記憶装置１４に記憶するとともに、表形式のファイルとして出力する（Ｓ１０５）。 Next, the counting unit 110 reads the tabular file output from the target word identification unit 105, counts the number of target words identified for each target word category, creates a count table R2 (see Figure 20), stores it in the storage device 14, and outputs it as a tabular file (S105).

次いで、出力部１１１は、集計表Ｒ２に基づいて、代表語のカテゴリごとに、各代表語の特定された数に関するグラフを描画して画像ファイルとして記憶装置１４に出力するとともに、グラフを含む集計画面４０１（図２１参照）を表示装置１９に表示し（Ｓ１０６）、処理を終了する。 Next, the output unit 111 draws a graph showing the identified number of representative words for each representative word category based on the summary table R2, outputs this to the storage device 14 as an image file, and displays the summary screen 401 (see Figure 21) including the graph on the display device 19 (S106), thereby terminating the process.

（メタ情報・指示文抽出処理）
次に、ステップＳ１０３のメタ情報・指示文抽出処理について説明する。 (Meta information and instruction sentence extraction processing)
Next, the meta-information/instruction sentence extraction process in step S103 will be described.

図１１は、第１実施形態に係るメタ情報・指示文抽出処理のフローチャートである。 Figure 11 is a flowchart of the meta information/instruction sentence extraction process according to the first embodiment.

まず、指示文抽出部１０４は、テキストデータ抽出部１０３から文書ファイルのテキストデータを受け付け、そのテキストデータを記憶する（Ｓ２０１）。 First, the instruction sentence extraction unit 104 receives text data of a document file from the text data extraction unit 103 and stores that text data (S201).

次いで、指示文抽出部１０４は、テキストデータの中から、メタ情報手掛り語辞書Ｄ１の〈仕様書番号開始ワード〉及び〈仕様書番号書式〉のデータに基づいて、仕様書番号を抽出し、記憶する（Ｓ２０２）。 Next, the instruction sentence extraction unit 104 extracts and stores the specification number from the text data based on the data in the <specification number start word> and <specification number format> in the meta information clue word dictionary D1 (S202).

次いで、指示文抽出部１０４は、テキストデータの中から、メタ情報手掛り語辞書Ｄ１の〈仕様書日付開始ワード〉及び〈仕様書日付書式〉のデータに基づいて、仕様書日付を抽出し、記憶する（Ｓ２０３）。 Next, the instruction sentence extraction unit 104 extracts and stores the specification date from the text data based on the <specification date start word> and <specification date format> data in the meta information clue word dictionary D1 (S203).

次いで、指示文抽出部１０４は、テキストデータの中から、指示文手掛り語辞書Ｄ２の〈指示文記載範囲開始ワード〉及び〈指示文記載範囲終了ワード〉のデータに基づいて、指示文記載範囲を抽出し、記憶する（Ｓ２０４）。 Next, the instruction sentence extraction unit 104 extracts and stores the instruction sentence description range from the text data based on the data for the <instruction sentence description range start word> and <instruction sentence description range end word> in the instruction sentence clue word dictionary D2 (S204).

次いで、指示文抽出部１０４は、指示文記載範囲を、指示文手掛り語辞書Ｄ２の〈行頭文字種類〉、〈行番号〉、及び〈行頭文字〉のデータに基づいて、指示文記載範囲の内容をそれぞれの指示文に分割する指示文分割処理（図１２参照）を実行する（Ｓ２０５）。 Next, the instruction sentence extraction unit 104 performs an instruction sentence division process (see FIG. 12) to divide the contents of the instruction sentence description range into individual instruction sentences based on the data for <first character type>, <line number>, and <first character> in the instruction sentence clue word dictionary D2 (S205).

次いで、指示文抽出部１０４は、記憶された仕様書番号、仕様書日付等のメタ情報データ、及び指示文のデータリストを、代表語特定部１０５で利用可能なデータとして記憶し（Ｓ２０６）、処理を終了する。 Next, the instruction sentence extraction unit 104 stores the stored meta-information data such as the specification number and specification date, as well as the instruction sentence data list, as data available to the representative word identification unit 105 (S206), and then ends the process.

（指示文分割処理）
次に、ステップＳ２０５の指示文分割処理について説明する。 (Directive Sentence Splitting Processing)
Next, the instruction statement division process in step S205 will be described.

図１２は、第１実施形態に係る指示文分割処理のフローチャートである。 Figure 12 is a flowchart of the directive segmentation process according to the first embodiment.

指示文抽出部１０４は、処理開始のトリガーを受け付けると、抽出済フラグの値を“Ｆａｌｓｅ”にリセットし、指示文リストを空の配列変数として準備する（Ｓ２１１）。ここで、指示文リストは、例えば、番号と、対応する１つの指示文との組をひとつの要素としている。 When the instruction statement extraction unit 104 receives a trigger to start processing, it resets the value of the extracted flag to "False" and prepares an instruction statement list as an empty array variable (S211). Here, the instruction statement list has, for example, a pair of a number and a corresponding instruction statement as one element.

次いで、指示文抽出部１０４は、変数ｔ１にステップＳ２０４で記憶された指示文記載範囲を代入し、指示文手掛り語辞書Ｄ２の〈行頭文字種類〉のデータを重複しないように配列変数Ｌ１に代入する（Ｓ２１２）。この結果、配列変数Ｌ１には、指示文手掛り語辞書Ｄ２の〈行頭文字種類〉に登録されている全種類の行頭文字種類が格納されることとなる。 Next, the instruction sentence extraction unit 104 assigns the instruction sentence description range stored in step S204 to variable t1, and assigns the data of <first character type> in the instruction sentence cue word dictionary D2 to array variable L1 so as not to overlap (S212). As a result, array variable L1 stores all types of first character types registered in <first character type> in the instruction sentence cue word dictionary D2.

次いで、指示文抽出部１０４は、配列変数Ｌ１の値を１つずつ変数ｎに代入することにより、ループＡの処理（Ｓ２１３～Ｓ２２０）を繰り返す。 The directive extraction unit 104 then repeats the processing of loop A (S213 to S220) by assigning the values of array variable L1 one by one to variable n.

ループＡの処理において、指示文抽出部１０４は、抽出済フラグの値をチェックし（Ｓ２１３）、抽出済みフラグが“Ｔｒｕｅ”である場合（Ｓ２１３：Ｙｅｓ）には、指示文を分割したことを示しているので、ループＡを抜けて処理を終了する。 In the processing of Loop A, the instruction extraction unit 104 checks the value of the extracted flag (S213). If the extracted flag is "True" (S213: Yes), this indicates that the instruction has been split, and the process exits Loop A and ends.

一方、抽出済フラグが“Ｆａｌｓｅ”である場合（Ｓ２１３：Ｎｏ）には、指示文抽出部１０４は、指示文手掛り語辞書Ｄ２の〈行頭文字種類〉＝変数ｎに対応する、〈行番号〉の行番号と〈行頭文字〉の行頭文字との組のすべてのリストを配列変数Ｌ２に代入する（Ｓ２１４）。 On the other hand, if the extracted flag is "False" (S213: No), the instruction sentence extraction unit 104 assigns to array variable L2 all lists of pairs of line numbers in <line number> and line start characters in <line start character> that correspond to <line start character type> = variable n in the instruction sentence clue word dictionary D2 (S214).

次いで、指示文抽出部１０４は、配列変数Ｌ２について〈行番号〉＝１から順に、変数ｍに〈行番号〉の行番号を代入し、変数ｓに〈行頭文字〉の行頭文字を代入することにより、ループＢの処理（Ｓ２１５～Ｓ２２０）を繰り返す。 Next, the directive extraction unit 104 repeats the processing of loop B (S215 to S220) by assigning the line number of <line number> to variable m and the first character of <line first character> to variable s, starting from <line number> = 1 for array variable L2.

ループＢの処理において、指示文抽出部１０４は、変数ｔ１内で変数ｓを検索し、変数ｔ１内に変数ｓがあるか否かを判定する（Ｓ２１５）。この結果、変数ｔ１内に変数ｓがない場合（Ｓ２１５：Ｎｏ）には、指示文抽出部１０４は、変数ｍの値が１であるか否かを判定し（Ｓ２２１）、変数ｍが１であると判定した場合（Ｓ２２１：Ｙｅｓ）、すなわち、ループＢの１巡目の処理である場合には、ループＢ処理を抜ける。一方、変数ｍが１でないと判定した場合（Ｓ２２１：Ｎｏ）、すなわち、ループＢの２巡目以降の処理である場合には、指示文抽出部１０４は、変数ｔ１をｍ－１番目の指示文として、指示文リストに追加し（Ｓ２２０）、ループＢ処理を抜ける。 In processing loop B, the directive extraction unit 104 searches for variable s within variable t1 and determines whether variable s is present within variable t1 (S215). As a result, if variable s is not present within variable t1 (S215: No), the directive extraction unit 104 determines whether the value of variable m is 1 (S221). If it is determined that variable m is 1 (S221: Yes), that is, if this is the first iteration of loop B, the directive extraction unit 104 exits loop B. On the other hand, if it is determined that variable m is not 1 (S221: No), that is, if this is the second or subsequent iteration of loop B, the directive extraction unit 104 adds variable t1 to the directive list as the (m-1)th directive (S220) and exits loop B.

一方、変数ｔ１内に変数ｓがある場合（Ｓ２１５：Ｙｅｓ）には、指示文抽出部１０４は、変数ｔ１内の変数ｓが見つかった位置をｐとし、抽出済フラグに“Ｔｒｕｅ”を代入し、変数ｔ１内のｐから末端までを変数ｔ２に代入する（Ｓ２１６）。 On the other hand, if variable s is found in variable t1 (S215: Yes), the directive extraction unit 104 sets the position where variable s is found in variable t1 to p, assigns "True" to the extraction flag, and assigns everything from p to the end of variable t1 to variable t2 (S216).

次いで、指示文抽出部１０４は、変数ｍの値が１であるか否かを判定し（Ｓ２１７）、変数ｍが１であると判定した場合（Ｓ２１７：Ｙｅｓ）、すなわち、ループＢの１巡目の処理である場合には、処理をステップＳ２１９に進める一方、変数ｍが１でないと判定した場合（Ｓ２１７：Ｎｏ）、すなわち、ループＢの２巡目以降の処理である場合には、処理をステップＳ２１８に進める。 The instruction sentence extraction unit 104 then determines whether the value of variable m is 1 (S217). If it determines that variable m is 1 (S217: Yes), i.e., if this is the first iteration of loop B, the processing proceeds to step S219. If it determines that variable m is not 1 (S217: No), i.e., if this is the second or subsequent iteration of loop B, the processing proceeds to step S218.

ステップＳ２１８では、指示文抽出部１０４は、変数ｔ１の先頭からｐまでをｍ－１番目の指示文として指示文リストに追加し、処理をステップＳ２１９に進める。ステップＳ２１９では、指示文抽出部１０４は、変数ｔ１に変数ｔ２を代入する（Ｓ２１９）。 In step S218, the instruction statement extraction unit 104 adds the first instruction statement of variable t1 to p as the (m-1)th instruction statement to the instruction statement list, and proceeds to step S219. In step S219, the instruction statement extraction unit 104 assigns variable t2 to variable t1 (S219).

ステップＳ２１９の処理を終えた場合には、指示文抽出部１０４は、次の行番号を処理対象として、ループＢの処理を行い、配列変数のすべての行番号を対象としてループＢの処理を行った場合には、ループＢを抜ける。 When the processing of step S219 is completed, the directive extraction unit 104 performs processing of loop B on the next line number, and when processing of loop B has been completed for all line numbers of the array variable, it exits loop B.

次いで、指示文抽出部１０４は、配列変数Ｌ１の次の値を対象として、ループＡの処理を行い、配列変数Ｌ１のすべての値を対象としてループＡの処理を行った場合には、ループＡを抜けて、指示文分割処理を終了する。 The directive extraction unit 104 then processes loop A, targeting the next value of array variable L1, and when loop A has been processed for all values of array variable L1, it exits loop A and terminates the directive splitting process.

（代表語特定処理）
次に、ステップＳ１０４の代表語特定処理について説明する。 (Representative word identification process)
Next, the representative word specification process in step S104 will be described.

図１３は、第１実施形態に係る代表語特定処理のフローチャートである。 Figure 13 is a flowchart of the target word identification process according to the first embodiment.

まず、代表語特定部１０５の第１判定部１０６は、処理開始のトリガーを受け付けると、指示文抽出部１０４により生成された指示文リストを記憶する（Ｓ３０１）。 First, when the first determination unit 106 of the target word identification unit 105 receives a trigger to start processing, it stores the instruction sentence list generated by the instruction sentence extraction unit 104 (S301).

次いで、第１判定部１０６は、指示文リストの値を１つずつ変数ｔ３に代入することにより、ループＣの処理（Ｓ３０２～Ｓ３０９）を繰り返す。 Then, the first determination unit 106 repeats the processing of loop C (S302 to S309) by assigning the values of the directive statement list one by one to variable t3.

ループＣの処理において、第１判定部１０６は、代表語辞書Ｄ３の上から順に１つのレコードの〈代表語〉、〈第２判定コード〉、〈特徴語（１）〉、〈特徴語（２）〉のデータを取得することにより、ループＤの処理（Ｓ３０２～Ｓ３０９）を繰り返す。 In the processing of Loop C, the first judgment unit 106 repeats the processing of Loop D (S302 to S309) by obtaining the data for the representative word, second judgment code, characteristic word (1), and characteristic word (2) of one record in the representative word dictionary D3, starting from the top.

ループＤの処理では、第１判定部１０６は、〈特徴語（１）〉と〈特徴語（２）〉とのそれぞれのデータを読点『、』で分割し、〈特徴語（１）〉のデータを分割してできた文字列と、〈特徴語（２）〉のデータを分割してできた文字列とを総当たりで組合せて連結することによりキーワードを作成し、このキーワードのリスト（キーワードリスト）を生成する（Ｓ３０２）。例えば、図７の〈処理番号〉が４のレコードを対象としている場合には、『部品Ａハーネス』と、『部品Ａコード』とのキーワードを含むリストが生成される。 In the processing of Loop D, the first determination unit 106 divides the data for each of <Feature Word (1)> and <Feature Word (2)> at the comma ",", and creates keywords by combining and concatenating the character strings created by dividing the data for <Feature Word (1)> with the character strings created by dividing the data for <Feature Word (2)> in a brute-force manner, and generates a list of these keywords (keyword list) (S302). For example, if the target record is one with a <Processing Number> of 4 in Figure 7, a list containing the keywords "Part A Harness" and "Part A Code" will be generated.

次いで、第１判定部１０６は、変数ｔ３においてキーワードリストのキーワードを検索し、キーワードが見つかったか否かを判定する（Ｓ３０３）。 Next, the first determination unit 106 searches for keywords in the keyword list using variable t3 and determines whether the keywords are found (S303).

この結果、変数ｔ３においてキーワードが見つかった場合（Ｓ３０３：Ｙｅｓ）には、第１判定部１０６は、処理をステップＳ３０４に進める一方、変数ｔ３においてキーワードが見つからなかった場合（Ｓ３０３：Ｎｏ）には、第１判定部１０６は、処理をステップＳ３０８に進める。 As a result, if a keyword is found in variable t3 (S303: Yes), the first determination unit 106 proceeds to step S304, whereas if a keyword is not found in variable t3 (S303: No), the first determination unit 106 proceeds to step S308.

ステップＳ３０４では、第１判定部１０６は、レコードの〈判定コード〉にデータが格納されているか否かを判定する。この結果、〈判定コード〉にデータが格納されている場合（Ｓ３０４：Ｙｅｓ）には、第１判定部１０６は、〈判定コード〉に含まれているデータに基づいて第２判定部１０８に判定処理（図１４参照）を実行させる（Ｓ３０５）。 In step S304, the first determination unit 106 determines whether data is stored in the <determination code> of the record. As a result, if data is stored in the <determination code> (S304: Yes), the first determination unit 106 causes the second determination unit 108 to execute a determination process (see FIG. 14) based on the data contained in the <determination code> (S305).

次いで、第１判定部１０６は、判定処理からの判定結果を受け取り、判定結果が代表語として適合しているか否かを判定する（Ｓ３０６）。この結果、判定結果が代表語として適合しているである場合（Ｓ３０６：Ｙｅｓ）には、第１判定部１０６は、処理をステップＳ３０７に進める一方、判定結果が代表語として適合していないである場合（Ｓ３０６：Ｎｏ）には、処理をステップＳ３０８に進める。 Next, the first determination unit 106 receives the determination result from the determination process and determines whether the determination result is suitable as a representative word (S306). As a result, if the determination result is suitable as a representative word (S306: Yes), the first determination unit 106 proceeds to step S307, whereas if the determination result is not suitable as a representative word (S306: No), the first determination unit 106 proceeds to step S308.

ステップＳ３０４で〈判定コード〉にデータが格納されていない場合（Ｓ３０４：Ｎｏ）、又は、ステップＳ３０６で判定結果が代表語として適合しているである場合（Ｓ３０６：Ｙｅｓ）には、第１判定部１０６は、レコードの〈代表語〉の文字列を代表語と決定し、変数ｔ３（指示文）と対応付けて代表語リストに追加する（Ｓ３０７）。 If no data is stored in the <determination code> in step S304 (S304: No), or if the determination result in step S306 indicates that the record is suitable as a representative word (S306: Yes), the first determination unit 106 determines the character string in the <representative word> of the record as the representative word, associates it with variable t3 (instruction statement), and adds it to the representative word list (S307).

ステップＳ３０８では、第１判定部１０６は、代表語辞書に次のレコードがあるか否かを判定し、この結果、次のレコードがない場合（Ｓ３０８：Ｎｏ）には、代表語を「未確定」として、代表語リストに追加する一方、次のレコードがある場合（Ｓ３０８：Ｙｅｓ）には、このレコードに対する処理を終了する。 In step S308, the first judgment unit 106 determines whether there is a next record in the representative word dictionary. If there is no next record (S308: No), the representative word is added to the representative word list as "undetermined," whereas if there is a next record (S308: Yes), processing for this record is terminated.

１つのレコードに対するループＤの処理を終えた場合には、第１判定部１０６は、次のレコードを処理対象として、ループＤの処理を行い、すべてのレコードを対象としてループＤの処理を行った場合には、ループＤを抜ける。 When loop D processing for one record has finished, the first determination unit 106 performs loop D processing on the next record, and when loop D processing has been performed on all records, it exits loop D.

ループＤを抜けた場合には、第１判定部１０６は、指示文リストの次の値を処理対象として、ループＣの処理を行い、指示文リストのすべての値を対象としてループＣの処理を行った場合には、ループＣを抜けて、代表語特定処理を終了する。 When loop D is exited, the first determination unit 106 processes loop C with the next value in the instruction statement list as the processing target, and when loop C has been processed with all values in the instruction statement list as the processing target, loop C is exited and the target word identification process is terminated.

（判定処理）
次に、ステップＳ３０５の判定処理について説明する。 (Determination process)
Next, the determination process in step S305 will be described.

図１４は、第１実施形態に係る判定処理のフローチャートである。 Figure 14 is a flowchart of the determination process according to the first embodiment.

第２判定部１０８は、判定コードに基づいて、判定処理に使用する判定サブプログラム及び判定サブプログラムにおける引数を決定する（Ｓ４０１）。例えば、判定コードが、図７の〈処理番号〉が２のレコードの判定コードである場合には、『ｎｏ－』に対応する判定サブプログラム『Ｓｕｂｎｏ』を使用する判定サブプログラムと決定し、引数を『部品Ａ、部品Ｂ』と決定する。次いで、第２判定部１０８は、決定した引数を使って、決定した判定サブプログラムを実行することにより、判定サブプログラム処理（図１５参照）を実行する（Ｓ４０２）。次いで、第２判定部１０８は、判定サブプログラム処理による判定結果を返し（Ｓ４０３）、判定処理を終了する。 The second judgment unit 108 determines the judgment subprogram and arguments in the judgment subprogram to be used in the judgment process based on the judgment code (S401). For example, if the judgment code is the judgment code for a record with a <Processing Number> of 2 in Figure 7, it determines that the judgment subprogram to use is the judgment subprogram "Sub no" corresponding to "no-", and determines the arguments to be "Part A, Part B". Next, the second judgment unit 108 executes the determined judgment subprogram using the determined arguments, thereby performing the judgment subprogram process (see Figure 15) (S402). Next, the second judgment unit 108 returns the judgment result from the judgment subprogram process (S403) and ends the judgment process.

（判定サブプログラム処理）
次に、ステップＳ４０２の判定サブプログラム処理の一例について説明する。 (Decision subprogram processing)
Next, an example of the determination subprogram process in step S402 will be described.

図１５は、第１実施形態に係る判定サブプログラム処理の一例のフローチャートである。図１５の判定サブプログラム処理は、第２判定部１０８が判定サブプログラム「Ｓｕｂ－ｎ」を実行することにより、実現される。 Figure 15 is a flowchart of an example of the judgment subprogram processing according to the first embodiment. The judgment subprogram processing in Figure 15 is realized by the second judgment unit 108 executing the judgment subprogram "Sub-n."

判定サブプログラム「Ｓｕｂ－ｎ」を実行する第２判定部１０８は、引数からキーワードのリストを生成する（Ｓ５０１）。 The second judgment unit 108, which executes the judgment subprogram "Sub-n," generates a list of keywords from the arguments (S501).

次いで、第２判定部１０８は、変数ｔ３（指示文）においてキーワードリストのキーワードを検索し、キーワードが見つかったか否かを判定する（Ｓ５０２）。 Next, the second determination unit 108 searches for keywords in the keyword list in variable t3 (instruction statement) and determines whether the keywords are found (S502).

この結果、キーワードが見つかった場合（Ｓ５０２：Ｙｅｓ）には、処理対象のレコードの〈代表語〉の文字列が代表語でないことを意味しているので、第２判定部１０８は、判定結果を不適合に決定し（Ｓ５０３）、処理をステップＳ５０５に進める。 If a keyword is found (S502: Yes), this means that the character string in the <target word> of the record being processed is not a target word, so the second judgment unit 108 determines the judgment result to be incompatible (S503) and proceeds to step S505.

一方、キーワードが見つからなかった場合（Ｓ５０２：Ｎｏ）には、処理対象のレコードの〈代表語〉の文字列が代表語であることを意味しているので、第２判定部１０８は、判定結果を適合に決定し（Ｓ５０４）、処理をステップＳ５０５に進める。 On the other hand, if the keyword is not found (S502: No), this means that the character string in the <target word> of the record being processed is the target word, so the second judgment unit 108 determines the judgment result to be a match (S504) and proceeds to step S505.

ステップＳ５０５では、第２判定部１０８は、判定結果を返して判定サブプログラム処理を終了する。 In step S505, the second judgment unit 108 returns the judgment result and terminates the judgment subprogram processing.

次に、ステップＳ４０２の判定サブプログラム処理の他の例について説明する。 Next, we will explain another example of the judgment subprogram processing in step S402.

図１６は、第１実施形態に係る判定サブプログラム処理の他の例のフローチャートである。図１６の判定サブプログラム処理は、第２判定部１０８が判定サブプログラム「ｄａｔｅ－ａｆｔｅｒ」を実行することにより、実現される。 Figure 16 is a flowchart of another example of the judgment subprogram processing according to the first embodiment. The judgment subprogram processing in Figure 16 is realized by the second judgment unit 108 executing the judgment subprogram "date-after."

判定サブプログラム「ｄａｔｅ－ａｆｔｅｒ」を実行する第２判定部１０８は、引数を基準日付として取得する（Ｓ６０１）。 The second judgment unit 108, which executes the judgment subprogram "date-after," acquires the argument as the reference date (S601).

次いで、第２判定部１０８は、変数ｔ３の指示文を含んでいる仕様書の日付（仕様書日付）と、基準日付とを比較し（Ｓ６０２）、仕様書日付が基準日付よりも後か否かを判定する（Ｓ６０３）。 Next, the second determination unit 108 compares the date of the specification containing the directive of variable t3 (specification date) with the reference date (S602) and determines whether the specification date is later than the reference date (S603).

この結果、仕様書日付が基準日付よりも後である場合（Ｓ６０３：Ｙｅｓ）には、処理対象のレコードの〈代表語〉の文字列が代表語であることを意味しているので、第２判定部１０８は、判定結果を適合に決定し（Ｓ６０４）、処理をステップＳ６０６に進める。 As a result, if the specification date is later than the reference date (S603: Yes), this means that the character string in the <target word> of the record being processed is a target word, so the second judgment unit 108 determines the judgment result to be conforming (S604) and proceeds to step S606.

一方、仕様書日付が基準日付よりも後でない場合（Ｓ６０３：Ｎｏ）には、処理対象のレコードの〈代表語〉の文字列が代表語でないことを意味しているので、第２判定部１０８は、判定結果を不適合に決定し（Ｓ６０５）、処理をステップＳ６０６に進める。 On the other hand, if the specification date is not later than the reference date (S603: No), this means that the character string in the <target word> of the record being processed is not a target word, so the second judgment unit 108 determines the judgment result to be non-conforming (S605) and proceeds to step S606.

ステップＳ６０６は、第２判定部１０８は、判定結果を返して判定サブプログラム処理を終了する。 In step S606, the second judgment unit 108 returns the judgment result and terminates the judgment subprogram processing.

なお、判定サブプログラムとして、判定サブプログラム「Ｓｕｂ－ｎ」と判定サブプログラム「ｄａｔｅ－ａｆｔｅｒ」との２つを例として説明したが、より多くの判定サブプログラムを利用するようにしてもよい。この場合には、判定サブプログラム記憶部１０９に各判定サブプログラムを格納し、〈判定コード〉には、第２判定部１０８が判定サブプログラムを特定することのできるプログラム特定情報を格納し、第２判定部１０８は、プログラム特定情報に対応する判定サブプログラムを特定して実行するようにすればよい。また、新たな判定サブプログラムを追加する場合には、判定サブプログラム記憶部１０９に判定サブプログラムを格納し、第２判定部１０８を判定コードから新たな判定サブプログラムを特定できるように構成すればよい。これにより、代表語であるか否かを適切な判定処理を行う判定サブプログラムにより容易に実現することができる。 Note that while two judgment subprograms, "Sub-n" and "date-after," have been described as examples of judgment subprograms, more judgment subprograms may be used. In this case, each judgment subprogram is stored in the judgment subprogram storage unit 109, and program-specific information that enables the second judgment unit 108 to identify the judgment subprogram is stored in the <judgment code>, and the second judgment unit 108 identifies and executes the judgment subprogram corresponding to the program-specific information. Furthermore, when adding a new judgment subprogram, the judgment subprogram is stored in the judgment subprogram storage unit 109, and the second judgment unit 108 is configured to be able to identify the new judgment subprogram from the judgment code. This makes it possible to easily determine whether a word is a representative word using a judgment subprogram that performs appropriate judgment processing.

（代表語特定処理の判定結果）
次に、設計対象に関わる代表語についての代表語特定処理の判定結果について説明する。 (Results of the target word identification process)
Next, the determination result of the target word identification process for target words related to the design object will be described.

図１７は、第１実施形態に係る設計対象に関わる代表語の判定結果の一例を示す図である。図１７は、図２に示す仕様書２０１を処理対象として、図７に示す設計対象に関わる代表語の代表語辞書Ｄ３を用いて代表語特定処理を行った際の判定結果を示す。 Figure 17 is a diagram showing an example of the results of determining representative words related to the design object according to the first embodiment. Figure 17 shows the results of determining representative words when the specification 201 shown in Figure 2 is used as the processing object and representative word identification processing is performed using the representative word dictionary D3 for representative words related to the design object shown in Figure 7.

図１７に示す判定結果は、各指示文と、各指示文に対する代表語辞書の各レコードについての適合結果と、結果として特定された代表語とを示している。具体的には、指示文『部品Ａ固定部上部Ｒ拡大』は、代表語辞書の処理番号１のレコードに適合し、これにより、『本体前面』が代表語として特定されている。また、指示文『上部リブ根元Ｒ拡大』は、代表語辞書の処理番号２のレコードに適合し、これにより、『本体上部』が代表語として特定されている。また、指示文『部品Ａハーネス用ボス根元Ｒ拡大』は、代表語辞書の処理番号３のレコードに適合し、これにより、『本体上部』が代表語として特定されている。 The determination results shown in Figure 17 show each instruction sentence, the matching results for each record in the target word dictionary for each instruction sentence, and the target word identified as a result. Specifically, the instruction sentence "Enlarge R of upper fixing part of part A" matches the record with process number 1 in the target word dictionary, thereby identifying "front surface of main body" as the target word. Furthermore, the instruction sentence "Enlarge R of base of upper rib" matches the record with process number 2 in the target word dictionary, thereby identifying "top of main body" as the target word. Furthermore, the instruction sentence "Enlarge R of base of boss for part A harness" matches the record with process number 3 in the target word dictionary, thereby identifying "top of main body" as the target word.

次に、ＣＡＤ形状に関わる代表語についての代表語特定処理の判定結果について説明する。 Next, we will explain the results of the target word identification process for target words related to CAD shapes.

図１８は、第１実施形態に係るＣＡＤ形状に関わる代表語の判定結果の一例を示す図である。図１８は、図２に示す仕様書２０１を処理対象として、図８に示すＣＡＤ形状に関わる代表語の代表語辞書Ｄ３を用いて代表語特定処理を行った際の判定結果を示す。 Figure 18 is a diagram showing an example of the results of determining representative words related to CAD shapes in the first embodiment. Figure 18 shows the results of determining representative words when the specification 201 shown in Figure 2 is processed and representative word identification processing is performed using the representative word dictionary D3 for representative words related to CAD shapes shown in Figure 8.

図１８に示す判定結果は、各指示文と、各指示文に対する代表語辞書の各レコードについての適合結果と、結果として特定された代表語とを示している。具体的には、指示文『部品Ａ固定部上部Ｒ拡大』は、代表語辞書の処理番号３のレコードに適合し、これにより、『その他』が代表語として特定されている。また、指示文『上部リブ根元Ｒ拡大』は、代表語辞書の処理番号１のレコードに適合し、これにより、『リブ』が代表語として特定されている。また、指示文『部品Ａハーネス用ボス根元Ｒ拡大』は、代表語辞書の処理番号２のレコードに適合し、これにより、『ボス』が代表語として特定されている。 The determination results shown in Figure 18 show each instruction statement, the matching results for each record in the target word dictionary for each instruction statement, and the target word identified as a result. Specifically, the instruction statement "Enlarge R of upper fixed part of part A" matches the record with process number 3 in the target word dictionary, thereby identifying "other" as the target word. Furthermore, the instruction statement "Enlarge R of upper rib base" matches the record with process number 1 in the target word dictionary, thereby identifying "rib" as the target word. Furthermore, the instruction statement "Enlarge R of base of boss for part A harness" matches the record with process number 2 in the target word dictionary, thereby identifying "boss" as the target word.

（指示内容一覧表） (Instruction List)

次に、指示内容一覧表Ｒ１について説明する。 Next, we will explain the instruction list R1.

図１９は、第１実施形態に係る指示内容一覧表の構成図である。 Figure 19 is a diagram showing the structure of the instruction content list table according to the first embodiment.

指示内容一覧表Ｒ１は、〈仕様書番号〉、〈作成日〉、〈行番号〉、〈指示文〉、〈設計対象〉、〈ＣＡＤ形状〉、及び〈変更種類〉の項目を有する。 Instruction content list R1 has the following fields: <Specification number>, <Creation date>, <Line number>, <Instruction statement>, <Design object>, <CAD shape>, and <Change type>.

〈仕様書番号〉には、仕様書番号が格納される。〈作成日〉には、仕様書が作成された作成日が格納される。〈行番号〉には、行に対応する指示文の指示文記載範囲での行番号が格納される。〈指示文〉には、行に対応する指示文が格納される。〈設計対象〉には、行に対応する指示文における設計対象に関する代表語が格納される。〈ＣＡＤ形状〉には、行に対応する指示文におけるＣＡＤ形状に関する代表語が格納される。〈変更種類〉には、行に対応する指示文における変更種類に関する代表語が格納される。 <Specification number> stores the specification number. <Creation date> stores the date the specification was created. <Line number> stores the line number within the directive range of the directive corresponding to the line. <Directive> stores the directive corresponding to the line. <Design object> stores the representative word related to the design object in the directive corresponding to the line. <CAD shape> stores the representative word related to the CAD shape in the directive corresponding to the line. <Change type> stores the representative word related to the change type in the directive corresponding to the line.

（集計表）
次に、集計表Ｒ２について説明する。 (Summary table)
Next, the summary table R2 will be described.

図２０は、第１実施形態に係る集計表の構成図である。 Figure 20 is a diagram showing the layout of a summary table according to the first embodiment.

集計表Ｒ２は、〈指示カテゴリ〉、〈代表語〉、及び〈指示の数〉の項目を有する。〈指示カテゴリ〉には、代表語についてのカテゴリ、例えば、設計対象、ＣＡＤ形状、変更種類等が格納される。〈代表語〉には、特定された代表語が格納される。〈指示の数〉には、代表語が特定された指示の数が格納される Summary table R2 has the following fields: <Instruction Category>, <Representative Word>, and <Number of Instructions>. <Instruction Category> stores the category of the representative word, such as design object, CAD shape, or change type. <Representative Word> stores the identified representative word. <Number of Instructions> stores the number of instructions for which the representative word is identified.

次に、集計画面４０１について説明する。 Next, we will explain the tally screen 401.

図２１は、第１実施形態に係る集計画面の構成図である。 Figure 21 is a diagram showing the configuration of the tally screen in the first embodiment.

集計画面４０１は、設計対象の代表語についての特定された数についてのグラフが表示される設計対象グラフ表示領域４０２と、ＣＡＤ形状の代表語についての特定された数についてのグラフが表示されるＣＡＤ形状グラフ表示領域４０３と、変更の種類の代表語についての特定された数についてのグラフが表示される変更種類グラフ表示領域４０４とを含む。 The summary screen 401 includes a design object graph display area 402 that displays a graph showing the number of identified representative words for the design object, a CAD shape graph display area 403 that displays a graph showing the number of identified representative words for the CAD shape, and a change type graph display area 404 that displays a graph showing the number of identified representative words for the change type.

集計画面４０１によると、各カテゴリごとに、特定された代表語と、その代表語の割合とを容易に把握することができる。これにより、文書において、どの代表語が注目されているかを容易に把握することができる。 The tally screen 401 allows you to easily see the identified representative words and their proportions for each category. This makes it easy to see which representative words are attracting attention in a document.

（第２実施形態）
次に、第２実施形態に係る文書内指示抽出システム１０１Ａについて説明する。なお、第１実施形態に係る文書内指示抽出システム１０１と同様な構成については、同一の符号を付すこととする。 Second Embodiment
Next, a document instruction extraction system 101A according to a second embodiment will be described. Note that the same components as those in the document instruction extraction system 101 according to the first embodiment will be denoted by the same reference numerals.

（機能構成）
図２２は、第２実施形態に係る文書内指示抽出システムの機能構成図である。 (Functional configuration)
FIG. 22 is a functional configuration diagram of a document instruction extraction system according to the second embodiment.

文書内指示抽出システム１０１Ａは、文書内指示抽出システム１０１に対して、さらに、製品ＣＡＤモデル記憶部１１２と、ＣＡＤ表示部１１３とを含む。 The document instruction extraction system 101A further includes a product CAD model storage unit 112 and a CAD display unit 113 in addition to the document instruction extraction system 101.

製品ＣＡＤモデル記憶部１１２は、製品を構成する各オブジェクト（構成部）の形状情報（図形データ）と、各オブジェクトを示す代表語とが対応付けられた製品ＣＡＤモデル情報を記憶する。ＣＡＤ表示部１１３は、出力部１１１から、例えば、指示内容一覧表Ｒ１又は集計表Ｒ２を取得し、特定された数の多い代表語に対応付けられているオブジェクトを特定し、特定したオブジェクトを強調表示（例えば、ハイライト表示）した製品のＣＡＤモデルを含むＣＡＤモデル表示画面５０５（図２３参照）を表示する。例えば、特定された数の多い代表語としては、特定された数が最も多い代表語としてもよく、特定された数が所定の数よりも多い１以上の代表語としてもよい。 The product CAD model storage unit 112 stores product CAD model information in which shape information (graphic data) of each object (component) that makes up the product is associated with a representative word that indicates each object. The CAD display unit 113 obtains, for example, an instruction content list R1 or a summary table R2 from the output unit 111, identifies objects associated with representative words that are frequently identified, and displays a CAD model display screen 505 (see FIG. 23) that includes a CAD model of the product in which the identified objects are emphasized (e.g., highlighted). For example, the representative word that is frequently identified may be the representative word that is identified the most, or it may be one or more representative words that are identified a number of times greater than a predetermined number.

次に、ＣＡＤモデル表示画面５０５について説明する。 Next, we will explain the CAD model display screen 505.

図２３は、第２実施形態に係るＣＡＤモデル表示画面の構成図である。なお、図２３のＣＡＤモデル表示画面５０５は、本体側面のボス５０３が代表語として最も多く特定された場合に表示される画面である。 Figure 23 is a diagram showing the configuration of a CAD model display screen according to the second embodiment. Note that the CAD model display screen 505 in Figure 23 is the screen that is displayed when the boss 503 on the side of the main body is most frequently identified as the representative word.

ＣＡＤモデル表示画面５０５において、製品のＣＡＤモデル５０２が表示され、そのＣＡＤモデル５０２において、代表語として最も多く特定された本体側面のボス５０３がハイライト表示される。このＣＡＤモデル表示画面５０５によると、代表語として最も多く特定された部分、すなわち、指示文において最も多く指定された部分を容易に把握することができる。 The CAD model display screen 505 displays the CAD model 502 of the product, and highlights the boss 503 on the side of the main body that is most frequently identified as the representative word in the CAD model 502. This CAD model display screen 505 makes it easy to see which part is most frequently identified as the representative word, i.e., which part is most frequently specified in instruction sentences.

なお、本発明は、上述の実施形態に限定されるものではなく、本発明の趣旨を逸脱しない範囲で、適宜変形して実施することが可能である。 The present invention is not limited to the above-described embodiments, and can be modified and implemented as appropriate without departing from the spirit of the present invention.

例えば、上記実施形態では、ＣＰＵが行っていた処理の一部又は全部を、ハードウェア回路で行うようにしてもよい。また、上記実施形態におけるプログラムは、プログラムソースからインストールされてよい。プログラムソースは、プログラム配布サーバ又は記憶メディア（例えば可搬型の記憶メディア）であってもよい。 For example, in the above embodiments, some or all of the processing performed by the CPU may be performed by hardware circuits. Furthermore, the programs in the above embodiments may be installed from a program source. The program source may be a program distribution server or a storage medium (e.g., a portable storage medium).

１１…ＣＰＵ、１４…記憶装置、２０…文書内指示抽出プログラム、２１…文書ファイル、１０１，１０１Ａ…文書内指示抽出システム、Ｄ１…メタ情報手掛り語辞書、Ｄ２…指示文手掛り語辞書、Ｄ３…代表語辞書

11...CPU, 14...storage device, 20...intra-document instruction extraction program, 21...document file, 101, 101A...intra-document instruction extraction system, D1...meta-information clue word dictionary, D2...reference sentence clue word dictionary, D3...representative word dictionary

Claims

A document instruction extraction system that extracts instruction content from a document containing the instruction content and identifies a representative word that is a representative word corresponding to a phrase included in the instruction content,
The document instruction extraction system includes a processor and a storage unit.
The storage unit
one or more of said documents;
instruction content identification information for identifying the instruction content from the document;
and target word specification information for specifying the target word from the instruction content,
The processor:
Identifying the instruction content from the document based on the instruction content identification information;
Identifying a target word from the instruction content based on the target word identification information;
The representative word identification information corresponds to one or more characteristic words included in the instruction content, candidate representative words corresponding to the one or more characteristic words, and determination process information related to a determination process for determining that the candidate representative words are the representative words corresponding to the one or more characteristic words,
The processor:
determining whether the candidate for the target word is a target word based on the determination process information;
The storage unit
storing one or more determination programs for executing a determination process;
the determination process information includes information for identifying a determination program to be used in the determination process and information on arguments to be passed to the determination program;
The processor:
The determination program is executed using the arguments to determine whether the candidate representative word is a representative word.
A system for extracting instructions from documents.

A document instruction extraction system that extracts instruction content from a document containing the instruction content and identifies a representative word that is a representative word corresponding to a phrase included in the instruction content,
The document instruction extraction system includes a processor and a storage unit.
The storage unit
one or more of said documents;
instruction content identification information for identifying the instruction content from the document;
and target word specification information for specifying the target word from the instruction content,
The processor:
Identifying instruction content from the document based on the instruction content identification information;
Identifying a target word from the instruction content based on the target word identification information;
For each target term identified from the plurality of documents, tallying the number of instructions in which the target term was identified;
Based on the aggregated results, create and display a graph showing the number of instructions for multiple representative words.
A system for extracting instructions from documents.

A document instruction extraction system that extracts instruction content from a document containing the instruction content and identifies a representative word that is a representative word corresponding to a phrase included in the instruction content,
The document instruction extraction system includes a processor and a storage unit.
The storage unit
one or more of said documents;
instruction content identification information for identifying the instruction content from the document;
and target word specification information for specifying the target word from the instruction content,
The processor:
Identifying instruction content from the document based on the instruction content identification information;
Identifying a target word from the instruction content based on the target word identification information;
The storage unit
storing graphic data in which representative words are associated with components of the product;
The processor:
When displaying an image of a product based on the graphic data, a component of the product corresponding to a representative word identified from a document relating to the product is highlighted.
A system for extracting instructions from documents.

A document instruction extraction system that extracts instruction content from a document containing the instruction content and identifies a representative word that is a representative word corresponding to a phrase included in the instruction content,
The document instruction extraction system includes a processor and a storage unit.
The storage unit
one or more of said documents;
instruction content identification information for identifying the instruction content from the document;
and target word specification information for specifying the target word from the instruction content,
The processor:
Identifying instruction content from the document based on the instruction content identification information;
Identifying a target word from the instruction content based on the target word identification information;
The document is a document related to the design and manufacturing of a specific product,
The target word specifying information is
For the same representative word, a plurality of correspondence relationships are included between at least a plurality of characteristic words among a first characteristic word relating to the direction of the product in normal times, a second characteristic word relating to the direction in design drawings of the product, and a third characteristic word relating to the direction of the product during manufacturing.
A system for extracting instructions from documents.

A document instruction extraction method using a document instruction extraction system that extracts instruction content from a document containing the instruction content and extracts representative words that are representative words corresponding to words included in the instruction content,
The document instruction extraction system includes:
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
The representative word identification information corresponds to one or more characteristic words included in the instruction content, candidate representative words corresponding to the one or more characteristic words, and determination process information related to a determination process for determining that the candidate representative words are the representative words corresponding to the one or more characteristic words,
The document instruction extraction system includes:
determining whether the candidate for the target word is a target word based on the determination process information;
The document instruction extraction system includes:
storing one or more determination programs for executing a determination process;
the determination process information includes information for identifying a determination program to be used in the determination process and information on arguments to be passed to the determination program;
The document instruction extraction system includes:
The determination program is executed using the arguments to determine whether the candidate for the target word is the target word.
Intra-document instruction extraction method.

A document instruction extraction method using a document instruction extraction system that extracts instruction content from a document containing the instruction content and extracts representative words that are representative words corresponding to words included in the instruction content,
The document instruction extraction system includes:
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
For each target term identified from the plurality of documents, tallying the number of instructions in which the target term was identified;
Based on the aggregated results, create and display a graph showing the number of instructions for multiple representative words.
Intra-document instruction extraction method.

A document instruction extraction method using a document instruction extraction system that extracts instruction content from a document containing the instruction content and extracts representative words that are representative words corresponding to words included in the instruction content,
The document instruction extraction system includes:
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
When displaying an image of a product based on graphic data in which a representative word is associated with a component part of the product, the component part of the product corresponding to the representative word identified from a document relating to the product is highlighted.
Intra-document instruction extraction method.

A document instruction extraction method using a document instruction extraction system that extracts instruction content from a document containing the instruction content and extracts representative words that are representative words corresponding to words included in the instruction content,
The document instruction extraction system includes:
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
The document is a document related to the design and manufacturing of a specific product,
The target word specifying information is
For the same representative word, a plurality of correspondence relationships are included between at least a plurality of characteristic words among a first characteristic word relating to the direction of the product in normal times, a second characteristic word relating to the direction in design drawings of the product, and a third characteristic word relating to the direction of the product during manufacturing.
Intra-document instruction extraction method.

A document instruction extraction program executed by a computer to extract instruction contents from a document containing the instruction contents and extract representative words that are representative words corresponding to words included in the instruction contents,
The computer,
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
The representative word identification information corresponds to one or more characteristic words included in the instruction content, candidate representative words corresponding to the one or more characteristic words, and determination process information related to a determination process for determining that the candidate representative words are the representative words corresponding to the one or more characteristic words,
The computer,
determining whether the candidate representative word is a representative word based on the determination processing information;
the determination process information includes information for identifying a determination program to be used in the determination process and information on arguments to be passed to the determination program;
The computer,
The determination program is executed using the arguments to determine whether the candidate for the representative word is the representative word.
Document instruction extractor.

A document instruction extraction program executed by a computer to extract instruction contents from a document containing the instruction contents and extract representative words that are representative words corresponding to words included in the instruction contents,
The computer,
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
For each target word identified from the plurality of documents, counting the number of instructions in which the target word was identified;
Based on the aggregated results, create and display a graph showing the number of instructions for multiple representative words.
Document instruction extractor.

A document instruction extraction program executed by a computer to extract instruction contents from a document containing the instruction contents and extract representative words that are representative words corresponding to words included in the instruction contents,
The computer,
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
When displaying an image of a product based on graphic data in which a representative word is associated with a component part of the product, the component part of the product corresponding to the representative word identified from a document relating to the product is highlighted.
Document instruction extractor.

A document instruction extraction program executed by a computer to extract instruction contents from a document containing the instruction contents and extract representative words that are representative words corresponding to words included in the instruction contents,
The computer,
Identifying the instruction content from the document based on instruction content identification information for identifying the instruction content from the document;
Identifying a target word from the instruction content based on target word identification information for identifying the target word from the instruction content;
The document is a document related to the design and manufacturing of a specific product,
The target word specifying information is
For the same representative word, a plurality of correspondence relationships are included between at least a plurality of characteristic words among a first characteristic word relating to the direction of the product in normal operation, a second characteristic word relating to the direction in the design drawing of the product, and a third characteristic word relating to the direction of the product during manufacturing.
Document instruction extractor.