JP7171482B2

JP7171482B2 - Business Exceptional Case Extraction Support System and Business Exceptional Case Extraction Support Method

Info

Publication number: JP7171482B2
Application number: JP2019056444A
Authority: JP
Inventors: 治高田; 美奈子鳥羽; 光浩笈川; 裕和長瀬; 伸寛鶴崎; 彰彦菊池; 啓前澤; 頌田口
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2019-03-25
Filing date: 2019-03-25
Publication date: 2022-11-15
Anticipated expiration: 2039-03-25
Also published as: JP2020160546A

Description

本発明は、業務の外れケース抽出を支援する技術に関する。 TECHNICAL FIELD The present invention relates to technology for assisting the extraction of out-of-business cases.

特許文献１に、審査部門における資料の検証を効率化するために、資料に関連づけた検証項目の組み合わせを格納しておき、新たな資料に対して必要な検証項目を絞り込んで提示することで、検証を支援する技術が記載されている。 In Patent Document 1, in order to improve the efficiency of verification of materials in the examination department, a combination of verification items associated with materials is stored, and necessary verification items for new materials are narrowed down and presented. Techniques to assist verification are described.

特開２００９－２３８１７３号公報Japanese Patent Application Laid-Open No. 2009-238173

申請書に対する業務結果の正確性を検証する、すなわち検証により業務誤りケースを検出する場合に、申請書全件に対して検証を実施することは人員リソースの観点で困難であるため、抽出した一部の申請書に対して検証を行わざるを得ない。申請書全件からランダムに一部の申請書を抽出して検証を行う場合には、業務誤りケースの検出率が低いことが課題である。 When verifying the accuracy of business results for applications, that is, when detecting business error cases through verification, it is difficult to verify all applications from the perspective of human resources. There is no choice but to verify the department's application form. The problem is that the detection rate of business error cases is low when some applications are randomly selected from all applications for verification.

特許文献１には検証項目を絞り込むことで検証を効率化する方法が開示されているが、申請書に対する業務のように業務が複雑に入り組んでいる場合には、検証項目の絞り込みが困難であり適用できない。 Patent Document 1 discloses a method for streamlining verification by narrowing down verification items. Not applicable.

本発明は、申請書全件から一部の申請書を抽出して検証を行う場合に、業務誤りケースの検出率を高くすることが可能な業務の外れケース抽出支援システム、および業務の外れケース抽出支援方法を提供することを目的とする。 The present invention provides a business exception case extraction support system capable of increasing the detection rate of business error cases when extracting and verifying some application forms from all application forms, and a business exception case extraction support system. An object of the present invention is to provide an extraction support method.

本発明にかかる業務の外れケース抽出支援システムは、申請書の審査業務の外れケースを抽出する業務の外れケース抽出支援システムであって、前記申請書に含まれる名義尺度データと、前記名義尺度データに対応するカテゴリ値とに基づいて、名義尺度データについての第１の１／０データを生成し、前記申請書に含まれる順序／間隔／比尺度データと、前記順序／間隔／比尺度データに対応するカテゴリ値とに基づいて、順序／間隔／比尺度データについての第２の１／０データを生成し、前記申請書についての前記第１の１／０データと前記第２の１／０データとを含む１／０テーブルを生成する１／０データ作成部と、生成した前記１／０テーブルの申請書間の距離に基づいてクラスタ分析を行い、前記申請書のクラスタを算出するクラスタ分析部と、を備えることを特徴とする業務の外れケース抽出支援システムとして構成される。 A business failure case extraction support system according to the present invention is a business failure case extraction support system for extracting failure cases of examination business of an application, comprising nominal scale data included in the application, and the nominal scale data. generating first 1/0 data for the nominal scale data based on the category values corresponding to the ordinal/interval/ratio scale data contained in the application and generating a second 1/0 data for the ordinal/interval/ratio scale data based on the corresponding category values and the first 1/0 data and the second 1/0 data for the application form; a 1/0 data creation unit that generates a 1/0 table containing data, and a cluster analysis that performs cluster analysis based on the distance between the application forms in the generated 1/0 table and calculates the clusters of the application forms. , and is configured as a business exception case extraction support system.

本発明によれば、申請書全件から一部の申請書を抽出して検証を行う場合に、業務誤りケースの検出率を高くできる。 According to the present invention, it is possible to increase the detection rate of business error cases when extracting and verifying some application forms from all the application forms.

システム構成図の例である。It is an example of a system configuration diagram. 業務支援サーバの例である。It is an example of a business support server. 業務サーバの例である。This is an example of a business server. 業務の外れケース抽出システムの全体処理フローの例である。It is an example of the overall processing flow of the out-of-business case extraction system. 業務の外れケース抽出システムの全体処理フローの例である。It is an example of the overall processing flow of the out-of-business case extraction system. 業務の外れケース抽出システムの全体処理フローの例である。It is an example of the overall processing flow of the out-of-business case extraction system. 業務支援サーバの処理フローの例である。It is an example of the processing flow of a business support server. 業務支援サーバの処理フローの例である。It is an example of the processing flow of a business support server. 申請書情報データの例である。It is an example of application form information data. 名義尺度データのカテゴリ定義の例である。It is an example of category definition for nominal scale data. 順序、間隔、比尺度データのカテゴリ定義の例である。Examples of category definitions for ordinal, interval, and ratio scale data. カテゴリ定義に基づく１/０データの例である。It is an example of 1/0 data based on category definition. 申請書－クラスタ対応表の例である。This is an example of an application-cluster correspondence table. クラスタ情報の例である。It is an example of cluster information. 主要クラスタ間距離情報の例である。It is an example of distance information between major clusters. クラスタ情報確認画面の例である。It is an example of a cluster information confirmation screen. 業務の外れケース抽出結果画面の例である。It is an example of an out-of-work case extraction result screen. 前年度と今年度の変化差分抽出結果画面の例である。It is an example of the change difference extraction result screen of last year and this year.

以下、図面を参照して本発明の実施形態を説明する。以下の記載および図面は、本発明を説明するための例示であって、説明の明確化のため、適宜、省略および簡略化がなされている。本発明は、他の種々の形態でも実施する事が可能である。特に限定しない限り、各構成要素は単数でも複数でも構わない。 Embodiments of the present invention will be described below with reference to the drawings. The following description and drawings are examples for explaining the present invention, and are appropriately omitted and simplified for clarity of explanation. The present invention can also be implemented in various other forms. Unless otherwise specified, each component may be singular or plural.

図面において示す各構成要素の位置、大きさ、形状、範囲などは、発明の理解を容易にするため、実際の位置、大きさ、形状、範囲などを表していない場合がある。このため、本発明は、必ずしも、図面に開示された位置、大きさ、形状、範囲などに限定されない。 The position, size, shape, range, etc. of each component shown in the drawings may not represent the actual position, size, shape, range, etc., in order to facilitate understanding of the invention. As such, the present invention is not necessarily limited to the locations, sizes, shapes, extents, etc., disclosed in the drawings.

以下の説明では、「テーブル」、「リスト」等の表現にて各種情報を説明することがあるが、各種情報は、これら以外のデータ構造で表現されていてもよい。データ構造に依存しないことを示すために「ＸＸテーブル」、「ＸＸリスト」等を「ＸＸ情報」と呼ぶことがある。識別情報について説明する際に、「識別情報」、「識別子」、「名」、「ＩＤ」、「番号」等の表現を用いるが、これらについてはお互いに置換が可能である。 In the following description, various types of information may be described using expressions such as “table” and “list”, but various types of information may be expressed in data structures other than these. "XX table", "XX list", etc. are sometimes referred to as "XX information" to indicate that they do not depend on the data structure. When describing identification information, expressions such as “identification information”, “identifier”, “name”, “ID”, and “number” are used, but these can be replaced with each other.

同一あるいは同様な機能を有する構成要素が複数ある場合には、同一の符号に異なる添字を付して説明する場合がある。ただし、これらの複数の構成要素を区別する必要がない場合には、添字を省略して説明する場合がある。 When there are a plurality of components having the same or similar functions, they may be described with the same reference numerals and different suffixes. However, if there is no need to distinguish between these multiple constituent elements, the subscripts may be omitted in the description.

また、以下の説明では、プログラムを実行して行う処理を説明する場合があるが、プログラムは、プロセッサ（例えばＣＰＵ、ＧＰＵ）によって実行されることで、定められた処理を、適宜に記憶資源（例えばメモリ）および／またはインターフェースデバイス（例えば通信ポート）等を用いながら行うため、処理の主体がプロセッサとされてもよい。同様に、プログラムを実行して行う処理の主体が、プロセッサを有するコントローラ、装置、システム、計算機、ノードであってもよい。プログラムを実行して行う処理の主体は、演算部であれば良く、特定の処理を行う専用回路（例えばＦＰＧＡやＡＳＩＣ）を含んでいてもよい。 Also, in the following description, there are cases where processing performed by executing a program is described, but the program is executed by a processor (for example, CPU, GPU) to appropriately perform the specified processing using storage resources ( For example, a memory) and/or an interface device (for example, a communication port) or the like is used, so processing may be performed by a processor. Similarly, a main body of processing executed by executing a program may be a controller having a processor, a device, a system, a computer, or a node. The subject of the processing performed by executing the program may be an arithmetic unit, and may include a dedicated circuit (for example, FPGA or ASIC) that performs specific processing.

プログラムは、プログラムソースから計算機のような装置にインストールされてもよい。プログラムソースは、例えば、プログラム配布サーバまたは計算機が読み取り可能な記憶メディアであってもよい。プログラムソースがプログラム配布サーバの場合、プログラム配布サーバはプロセッサと配布対象のプログラムを記憶する記憶資源を含み、プログラム配布サーバのプロセッサが配布対象のプログラムを他の計算機に配布してもよい。また、以下の説明において、２以上のプログラムが１つのプログラムとして実現されてもよいし、１つのプログラムが２以上のプログラムとして実現されてもよい。 A program may be installed on a device, such as a computer, from a program source. The program source may be, for example, a program distribution server or a computer-readable storage medium. When the program source is a program distribution server, the program distribution server may include a processor and storage resources for storing the distribution target program, and the processor of the program distribution server may distribute the distribution target program to other computers. Also, in the following description, two or more programs may be implemented as one program, and one program may be implemented as two or more programs.

図１は、業務システムの一例として、業務支援サーバ１０と業務サーバ２０と業務クライアント端末３０がネットワーク５を介して相互に接続している業務システム１０００の構成を示す。業務オペレータ１が業務クライアント端末３０を使って業務システム１０００を利用する。 FIG. 1 shows a configuration of a business system 1000 in which a business support server 10, a business server 20, and a business client terminal 30 are interconnected via a network 5, as an example of a business system. A business operator 1 uses a business client terminal 30 to utilize a business system 1000 .

図２は、業務支援サーバ１０の構成例である。業務支援サーバ１０は記憶部１０１と演算部１０２で構成する。記憶部１０１は、ハードウェアとしては、例えば、ＨＤＤ（Hard Disk Drive）やＳＳＤ（Solid State Drive）等の記憶装置から構成される。また、演算部１０２は、ハードウェアとしては、例えば、ＣＰＵ（Central Processing Unit）やＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）を有した演算装置から構成される。 FIG. 2 is a configuration example of the business support server 10. As shown in FIG. The business support server 10 is composed of a storage unit 101 and a calculation unit 102 . The storage unit 101 is configured by, for example, a storage device such as an HDD (Hard Disk Drive) or an SSD (Solid State Drive) as hardware. Further, the computing unit 102 is configured by, for example, a computing device having a CPU (Central Processing Unit), RAM (Random Access Memory), and ROM (Read Only Memory) as hardware.

記憶部１０１は、申請書情報データ１１と、名義尺度データのカテゴリ定義１２と、順序、間隔、比尺度データのカテゴリ定義１３と、カテゴリ定義に基づく１／０データ１４と、申請書－クラスタ対応表１５と、クラスタ情報１６と、主要クラスタ間距離情報１７と、で構成する。 The storage unit 101 stores application form information data 11, category definitions 12 of nominal scale data, category definitions 13 of order, interval, and ratio scale data, 1/0 data 14 based on the category definitions, and application form-cluster correspondence. It consists of Table 15, cluster information 16, and distance information 17 between major clusters.

演算部１０２は、カテゴリ定義に基づく１／０データ作成部１Ａと、クラスタ分析部１Ｂと、業務の外れケース抽出部（業務外れケース抽出部）１Ｃと、申請書２に該当するクラスタの業務の外れケース抽出部（クラスタ業務外れケース抽出部）１Ｄと、クラスタ情報間差分抽出部１Ｅと、で構成する。 The calculation unit 102 includes a 1/0 data creation unit 1A based on the category definition, a cluster analysis unit 1B, a business exception case extraction unit (business exception case extraction unit) 1C, and a business cluster corresponding to the application form 2. It is composed of an out-of-case extraction unit (cluster out-of-business case extraction unit) 1D and an inter-cluster information difference extraction unit 1E.

演算部１０２が有する各部は、プログラムの実行により実現される。例えば、ＣＰＵが、ＲＯＭからプログラムを読み出して実行することにより、演算部１０２の各部の機能が実現される。上記プログラムは、ＵＳＢ(Universal Serial Bus)メモリ等の記憶媒体から読み出されたり、ネットワークを介した他のコンピュータからダウンロードする等して、業務支援サーバ１０に提供されてもよい。 Each unit included in the calculation unit 102 is implemented by executing a program. For example, the function of each part of the arithmetic unit 102 is realized by the CPU reading out and executing a program from the ROM. The program may be provided to the business support server 10 by being read from a storage medium such as a USB (Universal Serial Bus) memory or downloaded from another computer via a network.

図３は、業務サーバ２０の構成例である。業務サーバ２０は、業務支援サーバ１０の申請書情報データ１１と同内容の申請書情報データ２１を保管する記憶部２０１と、業務結果記録部２２から構成する演算部２０２と、で構成する。記憶部２０１、演算部２０２は、ハードウェアとしては、業務支援サーバ１０と同様、従来から知られている一般的なコンピュータにより構成される。 FIG. 3 is a configuration example of the business server 20. As shown in FIG. The business server 20 comprises a storage unit 201 for storing application information data 21 having the same content as the application information data 11 of the business support server 10 and a calculation unit 202 comprising a business result recording unit 22 . As hardware, the storage unit 201 and the calculation unit 202 are configured by a conventionally known general computer, like the business support server 10 .

図４、図５、図６に３つの全体シーケンス例を示す。 4, 5 and 6 show three overall sequence examples.

図４は、申請書情報データを分析、加工し、クラスタ情報および業務外れケース抽出結果を表示する全体シーケンスの例である。以下シーケンスの各ステップについて説明する。 FIG. 4 is an example of an overall sequence for analyzing and processing application form information data and displaying cluster information and results of extracting non-work cases. Each step of the sequence will be described below.

業務オペレータ１は、業務クライアント端末３０を介して、業務サーバ２０の業務結果記録部２２を呼び出し、当該業務結果記録部２２が申請書情報データ２１を記録する。申請書情報データ２１は、図９の申請書情報データ１１と同じ形式である。例えば、業務オペレータ１は、一つの申請書の内容を、申請書情報データ２１（申請書情報データ１１と同じ形式）のレコード１の１１０２～１１１０、…に入力し、申請書が審査基準に合致している場合には、審査結果１１１１に”ＯＫ”を入力する。この操作を、申請書の数だけ繰り返して、業務結果記録部２２が申請書数と同数のレコードを作成する。（Ｓ２０１）
業務支援サーバ１０は、業務サーバ２０の申請書情報データ２１を受け取り、申請書情報データ１１にコピーし（Ｓ１００）、図７、８の処理により、申請書情報データ１１を分析、加工する。（Ｓ１０１～Ｓ１０７）
業務支援サーバ１０は、業務クライアント端末３０を介して業務オペレータ１にクラスタ情報確認画面３００を表示する。業務オペレータ１は、表示されたクラスタ情報確認画面３００を確認することで、申請書にどのようなパターン（クラスタ）が含まれており、それぞれのクラスタがどのような特徴を持つかを確認することができる。また、表示された樹形図（デンドログラム）３０１を確認することで、クラスタ間の類似性を確認することができる。これらにより、業務オペレータ１は、申請書の全体傾向を把握することができる。（Ｓ１１０）
業務支援サーバ１０は、業務クライアント端末３０を介して業務オペレータ１に業務の外れケース抽出結果画面３１０を表示する。業務オペレータ１は、業務の外れケース抽出結果画面３１０に表示された、めずらしいレア業務ケース３１１と、誤りの多い業務ケース３１２と、よくあるメジャーな誤りの少ない業務ケース３１３と、を確認できる。（Ｓ１１１）めずらしいレア業務ケース３１１、誤りの多い業務ケース３１２、よくあるメジャーな誤りの少ない業務ケース３１３は、図７、８の処理において判断される。 The business operator 1 calls the business result recording unit 22 of the business server 20 via the business client terminal 30 , and the business result recording unit 22 records the application form information data 21 . The application form information data 21 has the same format as the application form information data 11 in FIG. For example, the business operator 1 inputs the contents of one application form to 1102 to 1110, . If they match, enter “OK” in the examination result 1111 . This operation is repeated by the number of applications, and the work result recording unit 22 creates the same number of records as the number of applications. (S201)
The business support server 10 receives the application information data 21 of the business server 20, copies it to the application information data 11 (S100), and analyzes and processes the application information data 11 by the processes of FIGS. (S101-S107)
The business support server 10 displays a cluster information confirmation screen 300 to the business operator 1 via the business client terminal 30 . The business operator 1 checks the displayed cluster information confirmation screen 300 to check what patterns (clusters) are included in the application form and what characteristics each cluster has. can be done. Also, by checking the displayed tree diagram (dendrogram) 301, it is possible to check the similarity between clusters. These allow the business operator 1 to grasp the overall trend of the application forms. (S110)
The business support server 10 displays a business exception case extraction result screen 310 to the business operator 1 via the business client terminal 30 . The business operator 1 can confirm rare business cases 311, business cases 312 with many errors, and business cases 313 with few major errors that are displayed on the business exception case extraction result screen 310. FIG. (S111) A rare business case 311, a business case 312 with many errors, and a business case 313 with few major errors are determined in the processing of FIGS.

Ｓ１１１で表示された業務の外れケース抽出結果画面３１０の２つの活用例について以下説明する。
１つ目の活用例を説明する。業務誤りがないかのチェックを担当するオペレータ１は、申請書内容がめずらしいレア申請書に業務誤りが多いと想定されるため、画面３１０のめずらしいレア業務ケース３１１に該当する申請書情報データ１１をクラスタの特徴１６４の観点を中心に、優先的にチェックすることで、効率的に業務誤りがないかのチェックを行うことができる。 Two utilization examples of the business exception case extraction result screen 310 displayed in S111 will be described below.
The first application example will be explained. Operator 1, who is in charge of checking whether there are any business errors, is assumed to have many business errors in rare application forms with rare application content. By preferentially checking from the viewpoint of the cluster feature 164, it is possible to efficiently check whether there is an operational error.

２つ目の活用例を説明する。申請書に対する審査結果１１１１＝ＮＧ判定（否判定）とすべきものの取りこぼしがないかをチェックする担当の業務オペレータ１は、審査結果＝ＮＧの申請書と類似内容の申請書は同様に審査結果＝ＮＧとすべきである可能性が高いため、画面３１０の誤りの多い業務ケース３１２に該当する申請書情報データ１１をクラスタの特徴１６４の観点を中心に、優先的にチェックすることで、効率的に審査結果１１１１＝ＮＧ判定とすべきものの取りこぼしがないかチェックを行うことができる。 A second application example will be described. Examination result 1111 for the application form = NG judgment (no judgment) Business operator 1 in charge of checking whether there is any omission, the examination result = NG application form and similar examination result = Since there is a high possibility that it should be NG, the application form information data 11 corresponding to the business case 312 with many errors on the screen 310 is preferentially checked mainly from the viewpoint of the cluster characteristics 164. Then, it is possible to check whether or not there is anything that should be judged as the examination result 1111 = NG.

図５は、申請書情報データ１１を分析、加工しておき、新たな申請書２に対して、業務外れケース抽出結果の該当部分を表示する全体シーケンスの例である。以下シーケンスの各ステップについて説明する。 FIG. 5 is an example of an overall sequence in which the application form information data 11 is analyzed and processed, and the relevant part of the result of the out-of-work case extraction is displayed for the new application form 2 . Each step of the sequence will be described below.

まず、図４と同様のＳ２０１、Ｓ１００、Ｓ１０１～Ｓ１０７の処理を行っておく。 First, the processes of S201, S100, and S101 to S107 similar to those in FIG. 4 are performed.

業務オペレータ１は、新たな申請書２の情報（申請書情報データ１１の１１０２～１１１０、…の情報）を、業務クライアント端末３０を介して、業務支援サーバ１０に入力する（Ｓ１１２）。 The business operator 1 inputs the information of the new application form 2 (the information of 1102 to 1110, . . . of the application form information data 11) to the business support server 10 via the business client terminal 30 (S112).

業務支援サーバ１０は、Ｓ１１２で入力された情報が業務の外れケース抽出結果画面３１０の３つのケース（めずらしいレア業務ケース３１１、誤りの多い業務ケース３１２、よくあるメジャーな誤りの少ない業務ケース３１３）のいずれかあるいは複数に該当するかを判定し、該当するケース（クラスタ）のみを業務クライアント端末３０の業務の外れケース抽出結果画面３１０に表示する（図１７の例で該当するケース（クラスタ）のみを表示）。（Ｓ１１３）
業務の外れケース抽出結果画面３１０に表示される情報の２つの活用例について以下説明する。
１つ目の活用例を説明する。申請書の審査業務を行うスタッフに、審査ルールに詳しい経験のあるベテランスタッフと審査ルールに詳しくない初心者スタッフがいる場合を想定する。ベテランスタッフと初心者スタッフに申請書を振り分けるオペレータ１は、申請書２情報をＳ１１２で入力する。Ｓ１１３で、業務支援サーバ１０が、めずらしいレア業務ケース３１１や誤りの多い業務ケース３１２と判定して表示した場合には、申請書２をベテランスタッフに振り分け、かつ、該当するクラスタの特徴１６４をそのベテランスタッフに提供する。Ｓ１１３で、業務支援サーバ１０が、よくあるメジャーな誤りの少ない業務ケース３１３に該当と判定して表示した場合には、申請書２を初心者スタッフに振り分ける。このように申請書の振り分けの最適化を行うことで、審査業務全体を、より正確に、より迅速にすることができる。 The business support server 10 extracts the information input in S112 in three cases (a rare business case 311, a business case 312 with many errors, and a business case 313 with few major errors). , and only the applicable cases (clusters) are displayed on the business exception case extraction result screen 310 of the business client terminal 30 (in the example of FIG. 17, only the applicable cases (clusters) Show). (S113)
Two utilization examples of the information displayed on the out-of-work case extraction result screen 310 will be described below.
The first application example will be explained. It is assumed that there are experienced staff members who are familiar with the examination rules and novice staff members who are not familiar with the examination rules among the staff members who examine applications. The operator 1 who distributes the application form to the experienced staff and the novice staff inputs the application form 2 information in S112. In S113, when the business support server 10 determines and displays a rare business case 311 or a business case 312 with many errors, the application form 2 is sorted to the veteran staff, and the feature 164 of the corresponding cluster is displayed. Provide veteran staff. In S113, when the business support server 10 determines that it corresponds to the common business case 313 with few major errors and displays it, it distributes the application form 2 to the novice staff. By optimizing the allocation of applications in this way, the entire examination process can be made more accurate and faster.

２つ目の活用例を説明する。オペレータ１が申請書全体の中で特異な申請書を検出したいとする。この場合、Ｓ１１３で、業務支援サーバ１０が、業務のはずれケース抽出結果画面３１０に、めずらしいレア業務ケース３１１に該当すると判定した場合に、特異な申請書を検出したと判定することもできる。 A second application example will be described. Suppose operator 1 wishes to detect a unique application form among all application forms. In this case, in S113, when the business support server 10 determines that the business failure case extraction result screen 310 corresponds to the rare business case 311, it can be determined that a peculiar application form has been detected.

図６は、前年度の申請書群と今年度の申請書群をそれぞれ分析、加工し、それぞれクラスタ情報１６を作成し、その差分を抽出し表示する全体シーケンスの例である。以下シーケンスの各ステップについて説明する。 FIG. 6 is an example of an overall sequence in which the group of application forms for the previous year and the group of application forms for the current year are analyzed and processed, respectively, the cluster information 16 is created, and the difference between them is extracted and displayed. Each step of the sequence will be described below.

まず、前年度の申請書群と今年度の申請書群に対してそれぞれ図４と同じＳ２０１、Ｓ１００、Ｓ１０１～Ｓ１０７の処理を行っておく。 First, the same processing of S201, S100, and S101 to S107 as in FIG. 4 is performed for the group of application forms of the previous year and the group of application forms of the current year.

次に、業務支援サーバ１０は、得られた、前年度分のクラスタ情報１６と、今年度分のクラスタ情報１６について、クラスタに所属する申請書レコード（申請書情報データ１１の１１０２～１１１０、…）が同一あるいはとても類似性が高い場合に、前年度分のクラスタと今年度分のクラスタを対応づける（図１８のクラスタ変化差分３２１の”クラスタ１前年度”，”クラスタ１今年度”のように対応付ける）。対応づけてその差分を表示した例が図１８の前年度と今年度の変化差分抽出結果画面３２０である。（Ｓ１１４）上記レコードの類似性がとても高いとは、例えば、あらかじめ定められた所定の基準を満たす場合（例えば、全体の項目数の８０％以上が一致している場合）である。 Next, the business support server 10 extracts the cluster information 16 for the previous year and the cluster information 16 for the current year obtained from the application records belonging to the clusters (1102 to 1110 of the application information data 11, . . . ). ) are the same or very similar, the clusters for the previous year and the clusters for the current year are associated with each other (such as "Cluster 1 last year" and "Cluster 1 this year" in the cluster change difference 321 in FIG. 18). ). An example in which the differences are displayed in association is the change difference extraction result screen 320 of FIG. 18 between the previous year and the current year. (S114) The similarity of the records is very high, for example, when a predetermined criterion is satisfied (for example, when 80% or more of the total number of items match).

変化差分抽出結果画面３２０に表示される情報の３つの活用例について以下説明する。 Three utilization examples of the information displayed on the change difference extraction result screen 320 will be described below.

１つ目の活用例を説明する。前年度と今年度で、申請書内容や審査業務に大きな変化がない場合に、申請書と審査結果の変化を確認したいオペレータ１がいるとする。オペレータ１は、Ｓ１１４で表示された前年度と今年度の変化差分抽出結果画面３２０で、クラスタ１変化差分３２１やクラスタ４変化差分３２２や今年度新規クラスタ３２３を糸口に、前年度と今年度間の変化の理由を探ることができる。 The first application example will be explained. Assume that there is an operator 1 who wants to check the changes in the application form and examination results when there is no significant change in the application form content or the examination work between the previous year and the current year. Operator 1 uses the cluster 1 change difference 321, the cluster 4 change difference 322, and the current year new cluster 323 on the change difference extraction result screen 320 of the previous year and the current year displayed in S114 to extract the difference between the previous year and the current year. It is possible to explore the reasons for the change in

２つ目の活用例を説明する。制度の改訂前と改訂後の申請書内容や審査業務の変化を把握したいオペレータ１がいるとする。制度の改訂前と改訂後の申請書群に対して、図６同様の処理を行うことで、オペレータは、Ｓ１１４で表示された変化差分抽出結果画面３２０を確認することで、制度の改訂前と改訂後の申請書内容や審査業務の変化を把握することができる。 A second application example will be described. Suppose that there is an operator 1 who wants to grasp changes in application contents and examination work before and after revision of the system. By performing the same processing as in FIG. 6 on the application forms before and after the revision of the system, the operator can check the change difference extraction result screen 320 displayed in S114 to obtain the group of application forms before and after the revision of the system. It is possible to understand changes in application content and examination work after revision.

３つ目の活用例を説明する。毎月の申請書内容や審査業務の変化を把握したいオペレータ１がいるとする。月毎の申請書群に対して、図６と同様の処理を行うことで、オペレータ１は、Ｓ１１４で表示された変化差分抽出結果画面３２０を確認することで、毎月の申請書内容や審査業務の変化を把握することができる。 A third application example will be described. Suppose that there is an operator 1 who wants to grasp monthly changes in application forms and examination work. By performing the same processing as in FIG. 6 for the group of monthly application forms, the operator 1 can check the change difference extraction result screen 320 displayed in S114, thereby confirming the content of the monthly application forms and examination work. change can be grasped.

図７と図８は、業務支援サーバ１０が申請書情報データ１１を加工してカテゴリ定義に基づく１／０データ１４を作成し（Ｓ１０１～Ｓ１０４）、クラスタ分析を行い（Ｓ１０５）、クラスタ情報１６４を作成する（Ｓ１０６～Ｓ１０７）処理フローを示す図である。以下、フローの各ステップについて説明する。 7 and 8, the business support server 10 processes the application form information data 11 to create 1/0 data 14 based on the category definition (S101 to S104), performs cluster analysis (S105), cluster information 164 is created (S106-S107) processing flow. Each step of the flow will be described below.

業務支援サーバ１０は、申請書情報データ２１のコピーを業務サーバ２０より受け取り、申請書情報データ１１に格納する（Ｓ１０１）。 The business support server 10 receives a copy of the application information data 21 from the business server 20 and stores it in the application information data 11 (S101).

ここで、申請書情報データ１１の「審査結果１１１」以外の各フィールド（１１０１～１１１０、…）の種類について説明する。データの各フィールドには２つの種類がある。 Here, the types of each field (1101 to 1110, . Each field of data is of two types.

１つ目は、ある値が他とは異なるか同一かの意味を持つデータであり、“名義尺度データ”と呼ぶ。例えば、フィールド１１０１、１１０３、１１０６、１１０７、１１０８、１１１０がこれに該当する。一般的には、性別、同居／別居、等の区分が名義尺度データの例である。 The first is data that has the meaning of whether a certain value is different from or the same as another, and is called "nominal scale data". For example, fields 1101, 1103, 1106, 1107, 1108, and 1110 correspond to this. In general, categories such as gender, cohabitation/separation, etc. are examples of nominal scale data.

２つ目は、値の順序や間隔や比率の意味を持つデータであり、“順序、間隔、比尺度データ”と呼ぶ。例えば、フィールド１１０２、１１０４、１１０５、１１０９がこれに該当する。一般的には、年月は順序および間隔尺度データ、金額は比尺度データ、温度は比尺度データ、資格の等級は順序尺度データ、の例である。 The second type is data having the meaning of order, interval, and ratio of values, and is called "order, interval, ratio scale data". For example, fields 1102, 1104, 1105 and 1109 correspond to this. In general, years and months are examples of ordinal and interval scale data, money is ratio scale data, temperature is ratio scale data, and qualification grades are ordinal scale data.

業務オペレータ１は、業務クライアント端末３０を介して業務支援サーバ１０にアクセスし、１／０データ作成部１Ａが、名義尺度データのカテゴリ定義１２を作成する。 The business operator 1 accesses the business support server 10 via the business client terminal 30, and the 1/0 data creation unit 1A creates the category definition 12 of the nominal scale data.

すなわち、業務オペレータ１からの操作により、１／０データ作成部１Ａは、名義尺度データのカテゴリ定義１２のカテゴリ値１２２を、業務サーバ２０の申請書情報データ２１の設計書から転記する。例えば、申請書情報のフィールド名である申請者区分１１０３に対応するカテゴリ値１２２欄は、申請者区分１１０３が４つの値“申請者区分Ａ～Ｄ”をとりうることを意味する。 That is, according to the operation of the business operator 1, the 1/0 data creation unit 1A transcribes the category value 122 of the category definition 12 of the nominal scale data from the design document of the application form information data 21 of the business server 20. For example, the category value 122 column corresponding to the applicant category 1103, which is the field name of the application form information, means that the applicant category 1103 can take four values "applicant categories A to D."

さらに、業務オペレータ１からの操作により、１／０データ作成部１Ａは、各フィールド名のカテゴリ値１２２に対して、一つ以上のカテゴリ値１２２をまとめた上位カテゴリ値があれば、これを上位カテゴリ値１２３に記載する。例えば、申請書情報のフィールド名である申請者区分“長男”、“長女”“次男”“次女”の４つのカテゴリ値があった場合に、これらを上位カテゴリ１２３“子”に対応づけることができる。 Furthermore, according to the operation from the business operator 1, the 1/0 data creation section 1A, if there is a superordinate category value that summarizes one or more category values 122 for the category value 122 of each field name, converts it to a superordinate category value. Described in the category value 123. For example, if there are four category values of applicant categories "eldest son", "eldest daughter", "second son", and "second daughter", which are field names of application information, these can be associated with the upper category 123 "child". can.

業務オペレータ１からの操作により、１／０データ作成部１Ａは、これらの処理を申請書情報データ２１の全ての名義尺度データフィールド（１１０１、１１０３、１１０６、１１０７、１１０８、１１１０、…）に対して繰り返して、名義尺度データのカテゴリ定義１２を作成する。（Ｓ１０２）
業務オペレータ１は、業務クライアント端末３０を介して業務支援サーバ１０にアクセスし、１／０データ作成部１Ａが、順序、間隔、比尺度データのカテゴリ定義１３を作成する。 By the operation from the business operator 1, the 1/0 data creation unit 1A performs these processes on all nominal scale data fields (1101, 1103, 1106, 1107, 1108, 1110, . . . ) of the application form information data 21. to create a category definition 12 for nominal scale data. (S102)
The business operator 1 accesses the business support server 10 via the business client terminal 30, and the 1/0 data creation unit 1A creates category definitions 13 for order, interval, and ratio scale data.

業務オペレータ１からの操作により、１／０データ作成部１Ａは、業務の区切りとなる値１３２と、値１３２に対応するカテゴリ値１３３との対応を、業務オペレータ１により定められた所定の業務観点から決め、入力する。例えば、申請者生年月日フィールド１１０２は、生年の値１３２に対応する４カテゴリ（カテゴリ値１３３）に区分することを示している。これにより、順序、間隔、比尺度データを、カテゴリ値１３３に対応づけることができる。 By an operation from the business operator 1, the 1/0 data creation unit 1A determines the correspondence between the value 132, which is the delimiter of the business, and the category value 133 corresponding to the value 132, from a predetermined business viewpoint determined by the business operator 1. Decide from and enter. For example, the applicant date of birth field 1102 indicates that it is divided into 4 categories (category value 133) corresponding to the value 132 of the year of birth. This allows order, interval, and ratio scale data to be associated with category values 133 .

業務オペレータ１からの操作により、１／０データ作成部１Ａは、これら処理を申請書情報データ２１の全ての順序、間隔、比尺度データフィールド（１１０２、１１０４、１１０５、１１０９、…）に対して繰り返して、名義尺度データのカテゴリ定義１２を作成する。（Ｓ１０３）
業務支援サーバ１０のカテゴリ定義に基づく１／０データ作成部１Ａは、名義尺度データのカテゴリ定義１２と、順序、間隔、比尺度データのカテゴリ定義１３を参照して、申請書情報データ１１をカテゴリ値（１or０）に変換し、カテゴリ定義に基づく１/０データ１４に格納する。 By the operation from the business operator 1, the 1/0 data creation unit 1A performs these processes on all the order, interval, ratio scale data fields (1102, 1104, 1105, 1109, . . . ) of the application form information data 21. A category definition 12 for nominal scale data is created repeatedly. (S103)
The 1/0 data creation unit 1A based on the category definition of the business support server 10 refers to the category definition 12 of the nominal scale data and the category definition 13 of the order, interval, and ratio scale data, and categorizes the application form information data 11 into categories. It is converted to a value (1 or 0) and stored in 1/0 data 14 based on the category definition.

名義尺度データの場合の具体例を示す。業務支援サーバ１０の１／０データ作成部１Ａは、申請書情報データ１１の申請者区分１１０３のレコード１（１１２）の値“申請者区分Ａ”を取得し、名義尺度データのカテゴリ定義１２の“申請者区分Ａ”に対応する１２０１行を参照し、“申請者区分Ａ”の上位カテゴリ値”申請者上位区分１”を取得する。次に、１／０データ作成部１Ａは、カテゴリ定義に基づく１/０データ１４の、申請者区分１１０３行のレコード１列（１４３）に対して、“申請者区分Ａ”に対応するレコード１列（１４３）の値に”１”を格納し、それ以外の”申請者区分Ｂ～Ｄ”に対応するレコード１列（１４３）の値に値”０”を格納する。さらに、１／０データ作成部１Ａは、申請者上位区分１１２１行のレコード１列（１４３）に対して、“申請者上位区分１”に対応するレコード１列（１４３）の値に”１”を格納し、それ以外の”申請者上位区分２”に対応するレコード１列（１４３）の値に”０”を格納する（第１の１／０データ）。 A specific example in the case of nominal scale data is shown. The 1/0 data creation unit 1A of the business support server 10 acquires the value “applicant category A” of the record 1 (112) of the applicant category 1103 of the application form information data 11, Refer to line 1201 corresponding to "Applicant Category A" to acquire the upper category value "Applicant Upper Category 1" of "Applicant Category A". Next, the 1/0 data creation unit 1A creates a record 1 corresponding to "applicant classification A" for the record 1 column (143) of the applicant classification 1103 row of the 1/0 data 14 based on the category definition. "1" is stored in the value of column (143), and "0" is stored in the value of record 1 column (143) corresponding to the other "applicant categories B to D". Furthermore, the 1/0 data creation unit 1A sets the value of the record 1 column (143) corresponding to the "applicant upper category 1" to "1" for the record 1 column (143) of the applicant upper category 1121 row. is stored, and "0" is stored in the value of the other record 1 column (143) corresponding to "applicant upper division 2" (first 1/0 data).

次に、順序、間隔、比尺度データの場合の具体例を示す。業務支援サーバ１０の１／０データ作成部１Ａは、申請書情報データ１１の申請者生年月日１１０２のレコード１（１１２）の値”1950/01/01”を取得し、順序、間隔、比尺度データのカテゴリ定義１３の、”1950/01/01”に対応する１３０３行を参照し、カテゴリ値”申請者生年月日区分３”を取得する。次に、カテゴリ定義に基づく１/０データ１４の、申請者生年月日１１０２行のレコード１列（１４３）に対して、“申請者生年月日区分３”に対応するレコード１列（１４３）の値に”１”を格納し、それ以外の”申請者生年月日区分１、２、４”に対応するレコード１列（１４３）の値に”０”を格納する（第２の１／０データ）。 Next, specific examples for order, interval, and ratio scale data will be shown. The 1/0 data creation unit 1A of the business support server 10 acquires the value "1950/01/01" of the record 1 (112) of the applicant date of birth 1102 of the application form information data 11, Refer to line 1303 corresponding to "1950/01/01" in the scale data category definition 13 to obtain the category value "applicant's date of birth division 3". Next, in the 1/0 data 14 based on the category definition, for the record 1 column (143) of the applicant's date of birth 1102 row, the record 1 column (143) corresponding to "applicant date of birth classification 3" "1" is stored in the value of , and "0" is stored in the value of record 1 column (143) corresponding to other "applicant's date of birth classification 1, 2, 4" (second 1/ 0 data).

業務支援サーバ１０の１／０データ作成部１Ａは、これらの操作を、申請書情報データ１１の全てのレコードに対する「審査結果フィールド」以外の全てのフィールド１１１に対して行い、カテゴリ定義に基づく１／０データ１４を作成する。（Ｓ１０４）
業務支援サーバ１０のクラスタ分析部１Ｂは、カテゴリ定義に基づく１／０データ１４のレコード群（１４３，１４４，．．．）を、申請書ＩＤ１１０１以外の値（１or０）を使って、レコード間の総当りの距離を算出し、算出した距離に基づいてクラスタ分析を行う。なお、距離はＬ１距離を用いて算出する。クラスタ分析は、階層クラスタ分析あるいは非階層クラスタ分析を、機械学習ソフトウェアライブラリ等を用いて行う。 The 1/0 data creation unit 1A of the business support server 10 performs these operations on all fields 111 other than the "examination result field" for all records of the application form information data 11, and creates 1 data based on the category definition. /0 data 14 is created. (S104)
The cluster analysis unit 1B of the business support server 10 divides the records (143, 144, . . . ) of the 1/0 data 14 based on the category definition into A round-robin distance is calculated, and cluster analysis is performed based on the calculated distance. Note that the distance is calculated using the L1 distance. For cluster analysis, hierarchical cluster analysis or non-hierarchical cluster analysis is performed using a machine learning software library or the like.

業務支援サーバ１０のクラスタ分析部１Ｂは、クラスタ分析を行った結果を申請書－クラスタ対応表１５に格納する。図１３は、申請書ＩＤ００００１のレコード１はクラスタ１に所属し、申請書ＩＤ００００２のレコード２はクラスタ２に所属するというクラスタ分析結果が得られた場合の申請書－クラスタ対応表１５の例である。（Ｓ１０５）
業務支援サーバ１０のクラスタ分析部１Ｂは、申請書情報データ１１の審査結果１１１１と、申請書－クラスタ対応表１５と、を参照して、クラスタ情報１６の所属レコード数１６１と、主要クラスタ判定結果１６２と、審査結果＝ＮＧの数１６３と、を算出し格納する。 The cluster analysis unit 1B of the business support server 10 stores the results of the cluster analysis in the application form-cluster correspondence table 15. FIG. FIG. 13 is an example of the application form-cluster correspondence table 15 when the cluster analysis result indicates that record 1 with application form ID 00001 belongs to cluster 1 and record 2 with application form ID 00002 belongs to cluster 2. . (S105)
The cluster analysis unit 1B of the business support server 10 refers to the examination result 1111 of the application form information data 11 and the application form-cluster correspondence table 15 to determine the number of belonging records 161 of the cluster information 16 and the main cluster determination result. 162 and the number 163 of examination results=NG are calculated and stored.

以下、それぞれの算出方法を説明する。 Each calculation method will be described below.

クラスタ分析部１Ｂは、所属レコード数１６１として、申請書－クラスタ対応表１５の所属クラスタ１５０１行で合致するレコード数をカウントし格納する。例えば、クラスタ分析部１Ｂは、クラスタ１に関して所属クラスタ１５０１行で“クラスタ１”となっているレコード数をカウントし（図１３の例ではレコード１で１つカウント）格納する。クラスタ情報１６のクラスタ１の所属レコード数１６１の値が”２０００”とは、申請書－クラスタ対応表１５の、所属クラスタ１５０１行に“クラスタ１”が２０００レコードあることを示す。 The cluster analysis unit 1B counts and stores the number of records that match the belonging cluster 1501 row of the application form-cluster correspondence table 15 as the belonging record number 161. FIG. For example, the cluster analysis unit 1B counts the number of records with "Cluster 1" in the row 1501 of the belonging cluster for cluster 1 (in the example of FIG. 13, one is counted for record 1) and stored. A value of “2000” in the number of records belonging to cluster 1 161 of the cluster information 16 indicates that the application form-cluster correspondence table 15 has 2000 records of “cluster 1” in the belonging cluster 1501 row.

クラスタ分析部１Ｂは、主要クラスタ判定結果１６２として、所属レコード数１６１があらかじめ定めた閾値（例：所属レコード数＝１０）以上であれば値“○”を格納し、未満であれば“Ｘ”を格納する。これにより、閾値以上の所属レコード数１６１があるクラスタを識別（○orＸ）できる。すなわち、よくあるメジャーな類似する申請書群（クラスタ）を抽出（○）できる。 As the main cluster determination result 162, the cluster analysis unit 1B stores a value "○" if the number of belonging records 161 is equal to or greater than a predetermined threshold value (eg, number of belonging records = 10), and stores "X" if less than. to store As a result, clusters having the number of belonging records 161 equal to or greater than the threshold can be identified (○ or X). In other words, it is possible to extract (○) a common, major, similar application form group (cluster).

クラスタ分析部１Ｂは、審査結果＝ＮＧの数１６３として、各クラスタに関して申請書－クラスタ対応表１５で該当する申請書ＩＤ１１０１を抽出し、抽出した申請書ＩＤ１１０１に該当する申請書情報データ１１の審査結果行１１１１のＮＧの数をカウントして格納する。例えば、クラスタ分析部１Ｂは、クラスタ２に関して、申請書－クラスタ対応表１５の所属クラスタ１５０１の値が”クラスタ２”である申請書ＩＤ１１０１の値”00002”を抽出し、申請書情報データ１１の申請書ＩＤ１１０１の値が”00002”である１１３列の審査結果１１１１の値”ＮＧ”を参照し、”ＮＧ”1件分カウントする。クラスタ分析部１Ｂは、カウントした値を審査結果＝ＮＧの数１６３に格納する。これらをクラスタ情報１６の各クラスタに対して行うことで、各クラスタの審査結果＝ＮＧの数がわかる。すなわち、申請書グループ（クラスタ）が、業務誤りが少ないグループなのか、業務誤りが多いグループなのかがわかる。（Ｓ１０６）
業務オペレータ１は、業務クライアント端末３０を介して業務支援サーバ１０にアクセスし、クラスタ分析部１Ｂが、申請書－クラスタ対応表１５と、申請書情報データ１１と、を参照して、クラスタ情報１６のクラスタ特徴１６４を調査し入力する。 The cluster analysis unit 1B extracts the corresponding application form ID 1101 from the application form-cluster correspondence table 15 for each cluster as examination result = NG number 163, and examines the application form information data 11 corresponding to the extracted application form ID 1101. The number of NG in the result row 1111 is counted and stored. For example, the cluster analysis unit 1B extracts the value "00002" of the application form ID 1101 for which the value of the belonging cluster 1501 of the application form-cluster correspondence table 15 is "cluster 2" for cluster 2, The value "NG" of the examination result 1111 in the 113 column whose value of the application form ID 1101 is "00002" is referenced, and one "NG" is counted. The cluster analysis unit 1B stores the counted value in the number 163 of examination result=NG. By performing these operations for each cluster in the cluster information 16, the number of examination results=NG for each cluster can be obtained. That is, it can be understood whether the application form group (cluster) is a group with few business errors or a group with many business errors. (S106)
The business operator 1 accesses the business support server 10 via the business client terminal 30, and the cluster analysis unit 1B refers to the application form-cluster correspondence table 15 and the application form information data 11 to obtain the cluster information 16. are examined and input the cluster features 164 of .

２つのクラスタ特徴１６４の調査、入力の例を説明する。 An example of searching and inputting two cluster features 164 will now be described.

１つ目の例を説明する。業務オペレータ１はクラスタ情報１６のクラスタ１に着目し、申請書－クラスタ対応表１５を参照して対応するレコード１を特定し、申請書情報データ１１のレコード１（１１２）の値を参照し、業務知識を活用しながら、クラスタ情報１６のクラスタ１のクラスタの特徴１６４における”子扶養正規申請パタン”の文書を作文し、クラスタ分析部１Ｂが、入力する。なお、対応するレコードが複数ある場合には、複数レコードの値の統計情報を、クラスタ分析部１Ｂが統計ツール等を用いて参照して、クラスタ特徴１６４を作文し、入力する。さらに、業務オペレータ１はクラスタ情報１６の所属レコード数”2000”を参照し、他のクラスタと比較して所属レコード数が多いことから、クラスタ１のクラスタの特徴１６４における”典型的な”の文書を作文し、クラスタ分析部１Ｂが、追加入力する。 A first example will be described. The business operator 1 focuses on the cluster 1 of the cluster information 16, refers to the application-cluster correspondence table 15 to identify the corresponding record 1, refers to the value of the record 1 (112) of the application information data 11, While utilizing business knowledge, the document of "regular child support application pattern" in the cluster feature 164 of the cluster 1 of the cluster information 16 is composed, and the cluster analysis unit 1B inputs it. If there are a plurality of corresponding records, the cluster analysis unit 1B uses a statistical tool or the like to refer to the statistical information of the values of the plurality of records, and composes and inputs the cluster feature 164 . Further, the business operator 1 refers to the number of belonging records "2000" in the cluster information 16, and since the number of belonging records is larger than that of other clusters, the "typical" document in the cluster feature 164 of the cluster 1 and the cluster analysis unit 1B additionally inputs.

２つ目の例を説明する。業務オペレータ１はクラスタ情報１６のクラスタ２に着目し、申請書－クラスタ対応表１５を参照して対応するレコード２を特定し、申請書情報データ１１のレコード２（１１３）の値を参照し、レコード２の特徴”申請者生年月日1102と申請者区分1103が矛盾”しており、審査結果がＮＧとなっていることを、業務マニュアル等から確認し、クラスタ情報１６のクラスタ２のクラスタ特徴１６４を作文し、クラスタ分析部１Ｂが、入力する。これにより、クラスタ２の審査結果＝ＮＧの数が多い理由を、クラスタ特徴１６４に記載することができる。（Ｓ１０７）
図９は、申請書情報データ１１の例である。なお、申請書情報データ２１も同内容である。フィールド名１１１に対応する各レコード１１２、１１３、…、の値が格納されている。申請書情報データ１１は、例えば住民が公共機関に認定等の申請書類を提出する場合の申請書類の内容である。公共機関に申請書類が、書面あるいはオンラインで提出された後、その内容を公共機関の職員が入力したものである。レコード１（１１２）、レコード２（１１３）が、各々一申請書類に対応する。 A second example will be described. The business operator 1 focuses on the cluster 2 of the cluster information 16, refers to the application-cluster correspondence table 15 to identify the corresponding record 2, refers to the value of the record 2 (113) of the application information data 11, The feature of record 2 is that "applicant's date of birth 1102 and applicant classification 1103 are inconsistent", and the examination result is NG. 164, and input by the cluster analysis unit 1B. As a result, the reason for the large number of examination results=NG in cluster 2 can be described in the cluster feature 164 . (S107)
FIG. 9 is an example of the application form information data 11. As shown in FIG. The application form information data 21 has the same contents. The values of each record 112, 113, . . . corresponding to the field name 111 are stored. The application form information data 11 is, for example, the contents of an application form when a resident submits an application form for certification or the like to a public institution. After the application documents are submitted to the public institution in writing or online, the content is entered by the staff of the public institution. Record 1 (112) and record 2 (113) each correspond to one application document.

例えば、レコード１（１１２）の例では、申請書ＩＤ”00001”が申請書に振られた番号であり、申請者生年月日１１０２から、控除区分１１１０、…、が申請書に記載された情報である。申請書は、公共機関の職員により審査され、申請書が審査基準を満たしていれば、審査結果１１１１が”ＯＫ“と入力され、申請書が審査基準を満たしていなければ、審査結果１１１１が”ＮＧ“と入力される。なお、審査結果は、”ＯＫ”or”ＮＧ”のような２値でなくてもよい。例えば、申請に基づく給付額”１万円”，”２万円”，”３万円”であってもよいし、認定区分”区分Ａ”，”区分Ｂ”，”区分Ｃ”であってもよい。 For example, in the example of record 1 (112), the application form ID "00001" is the number assigned to the application form, and from the applicant's date of birth 1102, the deduction category 1110, ... is the information described in the application form. is. The application is examined by a staff member of the public institution, and if the application satisfies the examination criteria, the examination result 1111 is entered as "OK", and if the application does not meet the examination criteria, the examination result 1111 is ". NG" is entered. The examination result does not have to be binary such as "OK" or "NG". For example, the benefit amount based on the application may be "10,000 yen", "20,000 yen", or "30,000 yen", or the certified classification "Division A", "Division B", or "Division C". good too.

図１０は、名義尺度データのカテゴリ定義１２の例である。名義尺度データとは、値が他とは異なるか同一かの意味を持つデータであり、例えば、申請書情報データ１１の申請書ＩＤ１１０１、申請者区分１１０３、申請者に対する続柄１１０６、扶養者区分１１０７、同居／別居区分１１０８、控除区分１１１０がこれに該当する。例えば、申請書区分フィールド１１０３は、４つのカテゴリ値１２２”申請者区分Ａ”～”申請者区分Ｄ”をとりうることを示している。上位カテゴリ値１２３は、一つあるいは複数のカテゴリ値１２２を束ねたカテゴリ値であり、カテゴリ値１２２”申請者区分Ａ”，”申請者区分Ｂ”，”申請者区分Ｃ”を束ねた上位カテゴリ値１２３が”申請者上位区分１”であることを意味する。 FIG. 10 is an example of category definition 12 of nominal scale data. Nominal scale data is data that has the meaning of whether the value is different from others or the same. , cohabitation/separate division 1108, and deduction division 1110 correspond to this. For example, the application category field 1103 indicates that four category values 122 "applicant category A" to "applicant category D" can be taken. The upper category value 123 is a category value that bundles one or more category values 122, and is a higher category that bundles the category values 122 "Applicant Category A", "Applicant Category B", and "Applicant Category C". A value of 123 means "applicant superclass 1".

図１１は、順序、間隔、比尺度データのカテゴリ定義１３の例である。値の順序や間隔や比率の意味を持つデータであり、例えば、申請者年月日１１０２、申請者所得１１０４、扶養者生年月日１１０５、年間所得１１０９がこれに該当する。年月は順序および間隔尺度データ、金額は比尺度データ、温度は比尺度データ、資格の等級は順序尺度データ、の一例である。 FIG. 11 is an example of category definitions 13 for ordinal, interval, and ratio scale data. It is data having the meaning of the order, intervals, and ratios of values. Years and months are ordinal and interval scale data, monetary amounts are ratio scale data, temperatures are ratio scale data, and qualification grades are ordinal scale data.

申請者生年月日１１０２の例では、値１３２に示されるような４つの値レンジに区分けされ、それぞれカテゴリ値１３３と対応づけられている（１３０１、１３０２、１３０３、１３０４）。 In the example of applicant date of birth 1102, it is divided into four value ranges as indicated by value 132, which are associated with category values 133 respectively (1301, 1302, 1303, 1304).

図１２は、カテゴリ定義に基づく１/０データ１４の例である。申請書データ１１の１１０２～１１１０、…、に対して、名義尺度データのカテゴリ定義１２と、順序、間隔、比尺度データのカテゴリ定義１３と、を使ってデータ変換（Ｓ１０４）することで得られる。 FIG. 12 is an example of 1/0 data 14 based on category definitions. , of the application form data 11 are converted (S104) using the category definition 12 of nominal scale data and the category definition 13 of order, interval, and ratio scale data .

フィールド名１４１の、申請書ＩＤ１１０１、申請者生年月日１１０２、申請者区分１１０３、申請者上位区分１１０４、…、控除区分１１１０は、申請書データ１１のフィールド名１１１に対応している。また、１１２２は、名義尺度データのカテゴリ定義１２の上位カテゴリ値１２３に対応している。カテゴリ値１４２は、名義尺度データのカテゴリ定義１２のカテゴリ値１２２と上位カテゴリ値１２３と、順序、間隔、比尺度データのカテゴリ定義１３のカテゴリ値１３３を列挙する。 Application form ID 1101 , applicant date of birth 1102 , applicant classification 1103 , applicant upper level classification 1104 , . 1122 corresponds to the upper category value 123 of the category definition 12 of the nominal scale data. The category values 142 list the category values 122 and upper category values 123 of the category definition 12 for nominal scale data and the category values 133 of the category definition 13 for ordinal, interval, and ratio scale data.

例えば、レコード１（１４３）の申請者生年月日１１０２は、”申請者生年月日区分３”に該当する（値＝”1”）ことを示している。 For example, the applicant's date of birth 1102 of record 1 (143) indicates that it falls under the “applicant's date of birth classification 3” (value=“1”).

図１３は、クラスタ分析の結果得られる申請書－クラスタ対応表１５の例である。クラスタ分析では、各レコードをレコード間の距離を使っていずれかのクラスタに分類する。例えば、申請書ＩＤ１１０１＝”00001”のレコード１の所属クラスタ１５０１は、”クラスタ１”であることを示している。 FIG. 13 is an example of an application form-cluster correspondence table 15 obtained as a result of cluster analysis. In cluster analysis, each record is classified into one of clusters using the distance between records. For example, the belonging cluster 1501 of record 1 with application form ID 1101="00001" indicates that it is "cluster 1".

図１４は、クラスタ分析の結果得られた各クラスタの特徴を示すクラスタ情報１６の例である。 FIG. 14 is an example of cluster information 16 indicating the characteristics of each cluster obtained as a result of cluster analysis.

所属レコード数１６１は、クラスタに所属するレコード数を示しており、例えば、クラスタ１には２０００レコードが所属し、クラスタ２には５レコードが所属することを示している。ここから、業務外れケース抽出部１Ｃは、クラスタ１に所属している申請書が2000個あり、クラスタ２に所属している申請書が5個あると判断する。ここから、業務外れケース抽出部１Ｃは、クラスタ１に所属する申請書はとてもよくあるメジャーな申請書であり、クラスタ２に所属する申請書は珍しいレアケースの申請書であることが推定できる。あらかじめ定められた所定の閾値を満たす場合に、上記とてもよくあるメジャーな申請書であると判断することができる。上記閾値は、業務の内容により任意に定めることができる。 The number of belonging records 161 indicates the number of records belonging to the cluster. For example, cluster 1 has 2000 records and cluster 2 has 5 records. From this, the out-of-work case extraction unit 1C determines that there are 2000 applications belonging to cluster 1 and 5 applications belonging to cluster 2. FIG. From this, the out-of-work case extraction unit 1C can infer that the application forms belonging to cluster 1 are very common major application forms, and the application forms belonging to cluster 2 are rare case application forms. If a predetermined threshold is satisfied, it can be determined that the application is a very common major application. The above threshold can be arbitrarily determined according to the content of the work.

主要クラスタ判定結果１６２は、所属レコード数１６１がある閾値以上か否かで”○”，”Ｘ”を付与したものである。例えば、業務外れケース抽出部１Ｃは、クラスタ１の所属レコード数”2000”に、閾値10より大きいので”○”と設定し、クラスタ２の所属レコード数”5”に、閾値10より小さいので”Ｘ”と設定する。”○”のクラスタのみに着目すれば、メジャーな申請書パタンを一覧でき、”Ｘ”のクラスタに着目すれば、レアケースの申請書パタンを一覧できる。 The main cluster determination result 162 is obtained by assigning “◯” and “X” depending on whether or not the number of belonging records 161 is equal to or greater than a certain threshold. For example, the out-of-work case extraction unit 1C sets the number of records belonging to cluster 1 to “2000” because it is greater than the threshold value of 10, so it sets “○”, and the number of records belonging to cluster 2, “5”, because it is smaller than the threshold value of 10. X”. Focusing only on clusters marked with "○" allows a list of major application patterns, and focusing on clusters marked with "X" allows a list of application patterns for rare cases.

審査結果＝ＮＧの数１６３は、クラスタに所属するレコードの中で、申請書情報データ１１の審査結果１１１１＝”ＮＧ”の数をカウントしたものである。業務外れケース抽出部１Ｃは、クラスタの中に、審査で”ＮＧ”となった申請書がいくつ含まれているかをカウントし、その結果を出力する。なお、上記図９の説明で記載したように、審査結果１１１１は、例えば、申請に基づく給付額”１万円”，”２万円”，”３万円”であってもよい。このような場合には、審査結果＝ＮＧの数１６３の代わりに、審査結果＝給付額”１万円”の数、審査結果＝給付額”２万円”の数、審査結果＝給付額”３万円”の数のような列を作成し、それぞれカウントしてもよい。 The examination result=NG number 163 is obtained by counting the number of examination results 1111="NG" in the application information data 11 among the records belonging to the cluster. The out-of-work case extraction unit 1C counts how many application forms judged as "NG" in the examination are included in the cluster, and outputs the result. Incidentally, as described in the explanation of FIG. 9 above, the examination result 1111 may be, for example, the benefit amount "10,000 yen", "20,000 yen", or "30,000 yen" based on the application. In such a case, instead of examination result = NG number 163, examination result = number of payment amount "10,000 yen", examination result = number of payment amount "20,000 yen", examination result = payment amount You may create a column such as the number of 30,000 yen and count each.

クラスタの特徴１６４には、所属レコード数１６１や主要クラスタ判定結果１６２や審査結果＝ＮＧの数１６３と業務マニュアル等を参考に、業務オペレータ１が各クラスタが持つ特徴を作文、入力した文書を入力し、業務外れケース抽出部１Ｃが格納する。 In the cluster characteristics 164, the number of belonging records 161, the main cluster judgment result 162, the number of examination results = NG 163, and the document written and input by the business operator 1 with reference to the business manual etc. and is stored by the non-business case extraction unit 1C.

図１５は、主要クラスタ間距離情報１７の例である。主要クラスタ間距離情報１７は、クラスタ分析処理Ｓ１０５の中で中間的に生成される情報の一部である。クラスタ情報間差分抽出部１Ｅは、クラスタ情報１６の主要クラスタ判定結果＝”○”のクラスタについて、各クラスタ間の距離を算出し、その結果を格納する。図１５の例では、クラスタ１からみた場合に、クラスタ３よりもクラスタ５が離れている（３＜１０）ことを示している。すなわち、クラスタ１に所属する申請書群とクラスタ３に所属する申請書群の類似性は、クラスタ１に所属する申請書群とクラスタ５に所属する申請書群の類似性よりも高いことを意味する。 FIG. 15 is an example of the distance information 17 between major clusters. The main inter-cluster distance information 17 is part of the information generated intermediately in the cluster analysis processing S105. The inter-cluster information difference extracting unit 1E calculates the distance between each cluster for the clusters with the main cluster determination result=“◯” in the cluster information 16, and stores the result. In the example of FIG. 15, when viewed from cluster 1, cluster 5 is further away than cluster 3 (3<10). That is, it means that the similarity between the group of application forms belonging to cluster 1 and the group of application forms belonging to cluster 3 is higher than the similarity between the group of application forms belonging to cluster 1 and the group of application forms belonging to cluster 5. do.

図１６は、業務オペレータ１が操作する業務クライアント端末３０に表示するクラスタ情報確認画面３００の例である。画面上部の樹形図（デンドログラム）３０１は、主要クラスタ間距離情報１７を使って描いた木構造であり、ソフトウェアツール等で作成する。主要クラスタ１と主要クラスタ３との距離が近く、主要クラスタ１と主要クラスタ３から少し離れて主要クラスタ５があることを示している。これにより、業務オペレータ１が、各主要クラスタが、距離の観点でどのような関係にあるかが確認できる。 FIG. 16 is an example of a cluster information confirmation screen 300 displayed on the business client terminal 30 operated by the business operator 1 . A tree diagram (dendrogram) 301 at the top of the screen is a tree structure drawn using the distance information 17 between major clusters, and is created by a software tool or the like. It shows that the distance between the main cluster 1 and the main cluster 3 is short, and the main cluster 5 is a little away from the main cluster 1 and the main cluster 3. This allows the business operator 1 to confirm what kind of relationship each main cluster has in terms of distance.

画面下部のクラスタ情報１６は、業務支援サーバ１０のクラスタ情報１６を表示したものである。申請書には、クラスタ１、クラスタ２、…のようなパタンがあり、各パタンの特性を一覧できる。当該画面は、クラスタ情報間差分抽出部１Ｅ（またはクラスタ分析部１Ｂにより表示される）。なお、図５に示した全体シーケンスの例では、クラスタ分析部１Ｂは、既に説明した上記手法を用いて、入力された新たな申請書のレコードが属するクラスタを判定し、該当するクラスタに所属するレコード数と審査結果が否であったレコード数とを、それぞれ表示部に表示してもよい。これにより、新たな申請書のレコードが属するクラスタ、当該クラスタに所属するレコード数、審査結果が否であったレコード数が一目で確認することができる。 The cluster information 16 at the bottom of the screen displays the cluster information 16 of the business support server 10 . The application form has patterns such as cluster 1, cluster 2, . . . , and the characteristics of each pattern can be listed. The screen is displayed by the inter-cluster information difference extraction unit 1E (or the cluster analysis unit 1B). In the example of the overall sequence shown in FIG. 5, the cluster analysis unit 1B uses the above-described method to determine the cluster to which the input new application record belongs, and the cluster to which the new application record belongs. The number of records and the number of records for which the screening result was rejected may be displayed on the display unit. As a result, the cluster to which the new application record belongs, the number of records belonging to the cluster, and the number of records for which the examination result was rejected can be confirmed at a glance.

図１７は、業務オペレータ１が操作する業務クライアント端末３０に表示する、業務の外れケース抽出結果画面３１０の例である。 FIG. 17 is an example of a business exception case extraction result screen 310 displayed on the business client terminal 30 operated by the business operator 1 .

クラスタ業務外れケース抽出部１Ｄは、画面上部のめずらしいレア業務ケース３１１として、業務支援サーバ１０のクラスタ情報１６から、主要クラスタ判定結果１６２＝”Ｘ”の行を抽出し、業務の外れケース抽出結果画面３１０に出力する。ここには、レアな業務ケース（Ｓ２０１において申請書情報を業務オペレータがたまたま誤って記録したことによるレアな業務ケースの場合もありうる）が表示される。 The cluster work failure case extraction unit 1D extracts the main cluster determination result 162 = "X" from the cluster information 16 of the work support server 10 as the rare business case 311 at the top of the screen, and extracts the work failure case extraction result. Output to screen 310 . Rare business cases (there may be cases where the business operator accidentally recorded the application form information in S201) are displayed here.

クラスタ業務外れケース抽出部１Ｄは、画面中部の誤りの多い業務ケース３１２として、業務支援サーバ１０のクラスタ情報１６から、所属レコード数１６１に対する審査結果＝ＮＧの数１６３の比率があらかじめ定めた閾値以上の行を抽出し、業務の外れケース抽出結果画面３１０に出力する。抽出の結果として、クラスタの特徴１６４欄には、誤りの多い業務ケースの特徴が表示される。 The cluster work failure case extraction unit 1D extracts a business case 312 with many errors in the middle part of the screen, from the cluster information 16 of the business support server 10, the ratio of the examination result = NG number 163 to the number of belonging records 161 is greater than or equal to a predetermined threshold. line is extracted and output to the business exception case extraction result screen 310 . As a result of the extraction, the Cluster Features 164 column displays the features of error-prone business cases.

クラスタ業務外れケース抽出部１Ｄは、画面下部のよくあるメジャーな誤りの少ない業務ケース３１３として、業務支援サーバ１０のクラスタ情報１６から、所属レコード数１５１があらかじめ定めた閾値以上であり、かつ、所属レコード数１６１に対する審査結果＝ＮＧの数１６３があらかじめ定めた閾値以下である行を抽出し、業務の外れケース抽出結果画面３１０に出力する。例えば、Ｓ１１３の抽出結果が、網掛けで示した「クラスタ５」のようなケースとして表示される。ここには、典型的なよくある申請パタンで申請の誤りや業務誤りが少ないために、審査結果＝ＮＧが少ないものが表示されることが期待される。 The cluster-out-of-work case extraction unit 1D extracts a business case 313 with few common major errors at the bottom of the screen from the cluster information 16 of the business support server 10. Rows in which the number 163 of examination results=NG for the number of records 161 is equal to or less than a predetermined threshold are extracted and output to the out-of-business case extraction result screen 310 . For example, the extraction result of S113 is displayed as a case such as "Cluster 5" indicated by shading. Here, it is expected that typical common application patterns with few application errors and business errors, and therefore few examination results (NG) will be displayed.

図１８は、業務オペレータ１が操作する業務クライアント端末３０に表示する、前年度と今年度の変化差分抽出結果画面３２０の例である。 FIG. 18 is an example of a change difference extraction result screen 320 between the previous year and the current year displayed on the business client terminal 30 operated by the business operator 1 .

クラスタ業務外れケース抽出部１Ｄ（またはクラスタ分析部１Ｂ）は、クラスタ１変化差分３２１として、前年度申請書群に対するクラスタ情報１６と、今年度申請書群に対するクラスタ情報１６から、同一あるいはとても類似性の高いクラスタを抽出し、並列させて表示し（クラスタ１前年度行と、クラスタ１今年度行）加えてその差分行を表示している。なお、差分があらかじめ定めた閾値よりも大きい箇所を太枠で表示している。クラスタ４変化差分３２２も同様である。このように、図１８では、１／０データ作成部１Ａが、第１の申請書群（例えば、前年度申請書群）と第２の申請書群（例えば、今年度申請書群）のそれぞれについて、１／０テーブルを生成し、クラスタ分析部１Ｂが、第１の申請書群と第２の申請書群のそれぞれについて、各クラスタに属するレコード数と審査結果が否であったレコード数とをカウントし、クラスタに属するレコード数と審査結果が否であったレコード数との、第１の申請書群と第２の申請書群との差異を、表示部に表示する。これにより、業務オペレータ１は、前年度と今年度の申請書の全体特性の変化が把握できる。 The cluster out-of-work case extraction unit 1D (or cluster analysis unit 1B) extracts cluster 1 change difference 321 from the cluster information 16 for the previous year's application form group and the cluster information 16 for the current year's application form group. Clusters with high values are extracted and displayed in parallel (Cluster 1 previous year row and Cluster 1 current year row), and the difference row is displayed. Note that portions where the difference is larger than a predetermined threshold are displayed with thick frames. The cluster 4 change difference 322 is similar. Thus, in FIG. 18, the 1/0 data creation unit 1A creates the first application form group (for example, the previous year's application form group) and the second application form group (for example, the current year's application form group). The cluster analysis unit 1B generates a 1/0 table for each of the first application group and the second application group, and calculates the number of records belonging to each cluster and the number of records for which the examination result was negative, and are counted, and the difference between the number of records belonging to the cluster and the number of records for which the examination result was negative, between the first application form group and the second application form group, is displayed on the display unit. As a result, the business operator 1 can grasp the changes in the overall characteristics of the application forms between the previous year and the current year.

クラスタ業務外れケース抽出部１Ｄ（またはクラスタ分析部１Ｂ）は、今年度新規クラスタ３２３として、今年度のクラスタ情報１６にあるが、前年度のクラスタ情報１６には、同一あるいはとても類似性の高いクラスタがないものを表示する。これは、昨年度にはなかった申請パタンが今年度新たに発生したことを示している。 The cluster out-of-work case extraction unit 1D (or the cluster analysis unit 1B) has this year's new cluster 323 in the current year's cluster information 16, but the previous year's cluster information 16 contains the same or very similar cluster Show what is missing. This indicates that an application pattern that did not exist in the previous fiscal year has newly emerged this fiscal year.

なお、図１８の例では、前年度と今年度の変化差分を抽出しているが、例えば対象データを変えて、月単位で差分を抽出したり、前年度と今年度の同月での差分を抽出したり、地域毎の差分を抽出したり、法制度の改訂前後で差分を抽出したりすることもできる。これにより、月や年毎の申請パタンの変化や、社会動向に伴う申請パタンの変化や、申請パタンの地域特性や、法制度の改訂による申請パタンの変化等を把握することができる。 In the example of FIG. 18, the change difference between the previous year and the current year is extracted. It is also possible to extract differences, extract differences for each region, and extract differences before and after the revision of the legal system. This makes it possible to grasp changes in application patterns by month or year, changes in application patterns due to social trends, regional characteristics of application patterns, changes in application patterns due to revisions of legal systems, and the like.

このように、本実施例では、名義尺度データと、順序／間隔／比尺度データが混在している申請書情報について、順序／間隔／比尺度データの値をカテゴリ値に対応付けるテーブルをあらかじめ作成しておく。申請書情報における名義尺度データを名義尺度データのカテゴリ値に対する１／０に変換し、順序／間隔／比尺度データを対応付けテーブルを使ってカテゴリ値に対する１／０に変換することで、申請書情報全体をカテゴリ値に対する１／０テーブルに変換する。さらに、変換した１／０テーブルの申請書間の距離に基づきクラスタ分析を行い、申請書クラスタを算出する。算出した各クラスタに対する所属レコード数や、所属レコード数に対する審査結果がＮＧの割合に基づき、誤りケースの可能性が高い申請書群を抽出する。上記で抽出した一部の申請書に対して検証を行う。したがって、申請書全件から一部の申請書を抽出して検証を行う場合に、業務誤りケースの検出率を高くできる。 As described above, in this embodiment, for application form information in which nominal scale data and ordinal/interval/ratio scale data are mixed, a table is created in advance that associates the values of the order/interval/ratio scale data with the category values. Keep By converting the nominal scale data in the application form information to 1/0 for the category value of the nominal scale data, and converting the order/interval/ratio scale data to 1/0 for the category value using the correspondence table, the application form Convert the entire information into a 1/0 table for category values. Furthermore, cluster analysis is performed based on the distance between the applications in the converted 1/0 table to calculate application clusters. Based on the calculated number of records belonging to each cluster and the ratio of NG examination results to the number of belonging records, a group of application forms with a high possibility of being an error case is extracted. Verification is performed for some of the application forms extracted above. Therefore, when extracting and verifying some application forms from all the application forms, the detection rate of business error cases can be increased.

１０００業務システム
１０業務支援サーバ
２０業務サーバ
３０業務クライアント端末
５ネットワーク
１０１記憶部
１１申請書情報データ
１２名義尺度データのカテゴリ定義１２
１３順序、間隔、比尺度データのカテゴリ定義
１４カテゴリ定義に基づく１／０データ
１５申請書－クラスタ対応表
１６クラスタ情報
１７主要クラスタ間距離情報
１Ａ１／０データ作成部
１Ｂクラスタ分析部
１Ｃ業務の外れケース抽出部（業務外れケース抽出部）
１Ｄクラスタの業務の外れケース抽出部（クラスタ業務外れケース抽出部）
１Ｅクラスタ情報間差分抽出部
２０業務サーバ
２０１記憶部
２０２演算部２０２ 1000 business system 10 business support server 20 business server 30 business client terminal 5 network 101 storage unit 11 application form information data 12 category definition 12 of nominal scale data
13 Category definition of order, interval, ratio scale data 14 1/0 data based on category definition 15 Application form-cluster correspondence table 16 Cluster information 17 Main inter-cluster distance information 1A 1/0 data creation unit 1B Cluster analysis unit 1C Business Out-of-work case extraction unit (out-of-work case extraction unit)
1D cluster business exception case extraction unit (cluster business exception case extraction unit)
1E inter-cluster information difference extraction unit 20 business server 201 storage unit 202 calculation unit 202

Claims

An out-of-business case extraction support system for extracting out-of-case cases of application examination work,
generating first 1/0 data for the nominal scale data based on the nominal scale data included in the application and the category values corresponding to the nominal scale data; /ratio scale data and category values corresponding to said ordinal/interval/ratio scale data; a 1/0 data creation unit that creates a 1/0 table including the first 1/0 data and the second 1/0 data;
a cluster analysis unit that performs cluster analysis based on the distance between the applications in the generated 1/0 table and calculates clusters of the applications ;
When the cluster analysis unit counts the number of records belonging to each cluster and the number of records for which the examination result was rejected for each cluster of the calculated application forms and displays them on the display unit,
The 1/0 data creation unit generates the 1/0 table for each of the first application group and the second application group,
The cluster analysis unit counts the number of records belonging to each cluster and the number of records for which the screening result was negative for each of the first application group and the second application group, and counts the records belonging to the cluster. displaying on the display unit the difference between the first application form group and the second application form group in terms of the number and the number of records for which the examination result was rejected;
An out-of-business case extraction support system characterized by:

The business exception case extraction support system according to claim 1 ,
The cluster analysis unit determines the cluster to which the input new application record belongs, and displays the number of records belonging to the corresponding cluster and the number of records for which the screening result was negative on the display unit.
An out-of-business case extraction support system characterized by:

An out-of-service case extraction support method for extracting out-of-service cases of application examination work, comprising:
a 1/0 data generating unit generating first 1/0 data for the nominal scale data based on the nominal scale data included in the application form and the category values corresponding to the nominal scale data;
The 1/0 data creation unit, based on the order/interval/ratio scale data included in the application form and the category value corresponding to the order/interval/ratio scale data, for the order/interval/ratio scale data generating the second 1/0 data of
The 1/0 data creation unit generates a 1/0 table including the first 1/0 data and the second 1/0 data for the application form,
When the cluster analysis unit performs cluster analysis based on the distance between the applications in the generated 1/0 table and calculates the cluster of the applications ,
When the cluster analysis unit counts the number of records belonging to each cluster and the number of records for which the examination result was rejected for each cluster of the calculated application forms and displays them on the display unit,
The 1/0 data creation unit generates the 1/0 table for each of the first application group and the second application group,
The cluster analysis unit counts the number of records belonging to each cluster and the number of records for which the screening result was negative for each of the first application group and the second application group, and counts the records belonging to the cluster. displaying on the display unit the difference between the first application form group and the second application form group in terms of the number and the number of records for which the examination result was rejected;
An out-of-work case extraction support method characterized by:

The work outlier case extraction support method according to claim 3 ,
The cluster analysis unit determines the cluster to which the input new application record belongs, and displays the number of records belonging to the corresponding cluster and the number of records for which the screening result was negative on the display unit.
An out-of-work case extraction support method characterized by: