JP6635966B2

JP6635966B2 - Visualization device, visualization method, and program

Info

Publication number: JP6635966B2
Application number: JP2017063573A
Authority: JP
Inventors: 節夫山田; 喜昭野田; 杉崎　正之; 正之杉崎
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2017-03-28
Filing date: 2017-03-28
Publication date: 2020-01-29
Anticipated expiration: 2037-03-28
Also published as: JP2018165942A

Description

本発明は、可視化装置、可視化方法、及びプログラムに関する。 The present invention relates to a visualization device, a visualization method, and a program.

従来、文書（テキスト）が属する分類が多値（３種類以上）の場合に、文書がどの分類に属するかを推定（Ｎ値分類の推定）し、最も可能性が高い分類結果を１つ表示する分類判定方法が知られている。例えば、パソコンの故障状況を記述した文書を故障の種類に分類する場合に、分類が「電源」、「ＨＤＤ（Hard Disc Drive）」、「ディスプレイ」、及び「ファン」であるとき、例えば、「電源を入れても画面に何も表示されない」という文書が入力されると、図１７に示すように入力された文書を取得する（ステップＳ５１）。ステップＳ５１で文書が取得されると、Ｎ値分類の推定により文書が上記複数の分類の中から「ディスプレイ」という分類に属すると推定する（ステップＳ５２）。ステップＳ５２でＮ値分類が推定されると、「ディスプレイ」という分類結果を含む結果情報を格納し（ステップＳ５３）、ディスプレイ等に表示する（ステップＳ５４）。Ｎ値分類の推定に用いられる分類器としては、ＳＶＭ（Support Vector Machine）が非常に高い分類性能を持つことが知られている（例えば、非特許文献１参照）。また、分類結果の信頼度を付与し、信頼度とともに１つまたは複数の分類結果を表示する方法もある。 Conventionally, when the classification to which a document (text) belongs is multi-valued (three or more), it is estimated which classification the document belongs to (estimation of N-value classification), and one of the most likely classification results is displayed. There are known classification determination methods. For example, when a document describing a failure state of a personal computer is classified into failure types, when the classification is “power supply”, “HDD (Hard Disc Drive)”, “display”, and “fan”, for example, “ When the document "Nothing is displayed on the screen even when the power is turned on" is input, the input document is acquired as shown in FIG. 17 (step S51). When the document is obtained in step S51, it is estimated that the document belongs to the class "display" from the plurality of classes by estimating the N-value class (step S52). When the N-value classification is estimated in step S52, the result information including the classification result of "display" is stored (step S53) and displayed on a display or the like (step S54). As a classifier used for estimating the N-value classification, it is known that an SVM (Support Vector Machine) has very high classification performance (for example, see Non-Patent Document 1). There is also a method of giving the reliability of the classification result and displaying one or more classification results together with the reliability.

高村大也著、「言語処理のための機械学習入門」コロナ社、２０１０年８月１０日、Ｐ．１１７−１２７Daiya Takamura, "Introduction to Machine Learning for Language Processing," Corona Publishing Co., August 10, 2010, p. 117-127

しかしながら、上述のような従来技術によって表示された分類結果を参考にして利用者が分類を確定しようとする際、1つの分類結果のみが表示される場合、分類結果の信頼度が分からないため、利用者が分類結果を採用してよいか判断することが困難な場合がある。また、分類結果が信頼度とともに表示される場合であっても、表示された分類結果の信頼度は分かるが、その分類結果を採用する判断基準（例えば、何パーセントの信頼度なら採用できるのか）を決めることが困難な場合がある。さらに、文書に複数の文が含まれる場合、いずれの文が分類の判定に寄与しているのか判断しかねる場合がある。 However, when the user tries to determine the classification by referring to the classification result displayed by the conventional technology as described above, if only one classification result is displayed, the reliability of the classification result is not known, In some cases, it is difficult for the user to determine whether to adopt the classification result. Even when the classification result is displayed together with the reliability, the reliability of the displayed classification result is known, but a criterion for adopting the classification result (for example, what percentage of reliability can be adopted) It can be difficult to decide. Further, when a document includes a plurality of sentences, it may not be possible to determine which sentence contributes to the classification determination.

したがって、かかる点に鑑みてなされた本発明の目的は、入力された文書の分類結果に対するユーザの判断基準を可視化することである。 Therefore, an object of the present invention made in view of such a point is to visualize a user's criterion for a classification result of an input document.

上記の課題を解決するため、本発明に係る可視化装置は、利用者の操作に基づいて文書を取得する文書取得部と、前記文書を複数の文に分割する分割部と、前記文書及び前記複数の文それぞれが、予め定められた複数の分類それぞれに属するか否かを推定する２値分類推定部と、前記文書及び前記複数の文と、前記文書及び前記複数の文それぞれが前記分類それぞれに属するか否かを示す分類結果とを含む結果情報を表示する表示部と、を備えることを特徴とする。
また、本発明に係る可視化装置は、利用者の操作に基づいて、文書、及び前記文書に対する人手付与分類を取得する文書取得部と、前記文書を複数の文に分割する分割部と、前記文書及び前記複数の文それぞれが、予め定められた複数の分類それぞれに属するか否かを推定する２値分類推定部と、前記文書及び前記複数の文、前記２値分類推定部において推定された分類結果、並びに前記人手付与分類による分類結果を含む結果情報と、該結果情報に含まれる分類結果が、前記推定された分類結果であるか、前記人手付与分類による分類結果であるかを示す情報とを表示する表示部と、を備えることを特徴とする。 In order to solve the above-described problem, a visualization device according to the present invention includes a document acquisition unit that acquires a document based on a user operation, a division unit that divides the document into a plurality of sentences, the document and the plurality of documents. each respective statements, advance the binary classifier estimation unit that estimates whether belonging to each of a plurality of classifications defined, pre-SL document and the plurality of sentences, the document and the plurality of statements, each said classification And a display unit for displaying result information including a classification result indicating whether or not the data belongs to the group.
Further, the visualization device according to the present invention, based on a user operation, a document acquisition unit that acquires a document and a manual assignment classification for the document, a division unit that divides the document into a plurality of sentences, A binary classification estimating unit for estimating whether each of the plurality of sentences belongs to each of a plurality of predetermined classifications, and a classification estimated by the document and the plurality of sentences, the binary classification estimating unit Results, and the result information including the classification result by the manual assignment classification, and the classification result included in the result information is the estimated classification result, or information indicating whether the classification result by the manual assignment classification And a display unit for displaying

また、本発明に係る可視化方法は、文書の分類を推定する可視化装置が実行する可視化方法であって、利用者の操作に基づいて文書を取得するステップと、前記文書を複数の文に分割する分割するステップと、前記文書及び前記複数の文それぞれが、予め定められた複数の分類それぞれに属するか否かを推定するステップと、前記文書及び前記複数の文と、前記文書及び前記複数の文それぞれが前記分類それぞれに属するか否かを示す分類結果とを含む結果情報を表示するステップと、を含むことを特徴とする。
また、本発明に係る可視化方法は、利用者の操作に基づいて、文書、及び前記文書に対する人手付与分類による分類結果を取得するステップと、前記文書を複数の文に分割するステップと、前記文書及び前記複数の文それぞれが、予め定められた複数の分類それぞれに属するか否かを推定するステップと、前記文書及び前記複数の文、前記推定された分類結果、並びに前記人手付与分類による分類結果含む結果情報と、該結果情報に含まれる分類結果が、前記推定された分類結果であるか、前記人手付与分類による分類結果であるかを示す情報と、を含む結果情報を表示するステップと、を含むことを特徴とする。 Further, a visualization method according to the present invention is a visualization method executed by a visualization device for estimating a classification of a document, wherein the step of acquiring a document based on a user operation and the step of dividing the document into a plurality of sentences dividing the document and each of the plurality of statements, estimating whether belonging to each of a plurality of classifications predetermined said document and said plurality of statements, the document and the plurality of sentences each of which comprising the steps of: displaying the result information including the classification result indicating whether belonging to each of the classification.
Further, the visualization method according to the present invention includes a step of acquiring a document and a classification result by a manual assignment classification for the document based on an operation of a user; a step of dividing the document into a plurality of sentences; And a step of estimating whether or not each of the plurality of sentences belongs to each of a plurality of predetermined classifications, and the document and the plurality of sentences, the estimated classification result, and the classification result by the manual assignment classification. Displaying the result information including the result information including, and the classification result included in the result information, the estimated classification result, or information indicating whether the classification result is based on the manual assignment classification, It is characterized by including.

また、本発明に係るプログラムは、コンピュータを、上記可視化装置として機能させるためのプログラム。 Further, a program according to the present invention is a program for causing a computer to function as the visualization device.

本発明によれば、入力された文書の分類結果に対するユーザの判断基準を可視化することができる。 According to the present invention, it is possible to visualize a user's criterion for a classification result of an input document.

第１の実施形態に係る可視化装置の構成例を示すブロック図である。It is a block diagram showing the example of composition of the visualization device concerning a 1st embodiment. 第２の実施形態における表示部の表示例を示す図である。It is a figure showing the example of a display of the display part in a 2nd embodiment. 第１の実施形態の結果情報の一例を示す図である。FIG. 5 is a diagram illustrating an example of result information according to the first embodiment. 第１の実施形態に係る可視化方法の一例を示すフローチャートである。5 is a flowchart illustrating an example of a visualization method according to the first embodiment. 第２の実施形態に係る可視化装置の構成例を示すブロック図である。It is a block diagram showing an example of composition of a visualization device concerning a 2nd embodiment. 第２の実施形態における表示部の表示例を示す図である。It is a figure showing the example of a display of the display part in a 2nd embodiment. 第２の実施形態の結果情報の一例を示す図である。It is a figure showing an example of the result information of a 2nd embodiment. 第２の実施形態に係る可視化方法の一例を示すフローチャートである。It is a flow chart which shows an example of the visualization method concerning a 2nd embodiment. 第３の実施形態における表示部の表示例を示す図である。It is a figure showing the example of a display of the display part in a 3rd embodiment. 第３の実施形態の結果情報の一例を示す図である。It is a figure showing an example of the result information of a 3rd embodiment. 第３の実施形態に係る可視化方法の一例を示すフローチャートである。It is a flow chart which shows an example of the visualization method concerning a 3rd embodiment. 第４の実施形態の表示部の表示例を示す図である。It is a figure showing the example of a display of the display part of a 4th embodiment. 第４の実施形態における結果情報の一例を示す図である。It is a figure showing an example of the result information in a 4th embodiment. 第４の実施形態に係る可視化方法の一例を示すフローチャートである。13 is a flowchart illustrating an example of a visualization method according to a fourth embodiment. 第５の実施形態結果情報の一例を示す図である。It is a figure showing an example of a 5th embodiment result information. 第１の実施形態の変形例に係る可視化装置の構成例を示すブロック図である。It is a block diagram showing the example of composition of the visualization device concerning the modification of a 1st embodiment. 従来技術における分類判定方法の一例を示すフローチャートである。9 is a flowchart illustrating an example of a classification determination method according to the related art.

［第１の実施形態］
まず、図１を参照して、本発明の第１の実施形態の機能構成について説明する。図１は、第１の実施形態に係る可視化装置１の機能構成図である。 [First Embodiment]
First, the functional configuration of the first embodiment of the present invention will be described with reference to FIG. FIG. 1 is a functional configuration diagram of the visualization device 1 according to the first embodiment.

図１に示すように、可視化装置１は、文書取得部１１、２値分類推定部１２、格納部１３、及び表示部１４を備える。 As shown in FIG. 1, the visualization device 1 includes a document acquisition unit 11, a binary classification estimation unit 12, a storage unit 13, and a display unit 14.

文書取得部１１は、利用者の操作に基づいて文書を取得する。文書取得部１１によって取得される文書は任意であるが、本実施形態では、一例として、利用者は、Ｗｅｂアプリケーションを用いて、図２（ａ）に示すような画面に「電源を入れても画面に何も表示されない。」という文書を入力し、送信ボタンを押下する。この場合、文書取得部１１は入力された文書を取得する。 The document acquisition unit 11 acquires a document based on a user operation. The document acquired by the document acquisition unit 11 is arbitrary, but in the present embodiment, as an example, the user uses a Web application to display “Even if the power is turned on” on a screen shown in FIG. Nothing is displayed on the screen. "And press the send button. In this case, the document acquisition unit 11 acquires the input document.

２値分類推定部１２は、文書取得部１１によって取得された文書が複数の分類それぞれに属するか否かを、学習データに基づいて公知の機械推定の手法により推定する。分類は任意のものであってよいが、本実施形態では、分類の例を「電源」、「ＨＤＤ」、「ディスプレイ」、及び「ファン」として説明する。 The binary classification estimating unit 12 estimates whether or not the document acquired by the document acquiring unit 11 belongs to each of a plurality of classifications based on learning data by a known machine estimation method. Although the classification may be arbitrary, in the present embodiment, examples of the classification will be described as “power supply”, “HDD”, “display”, and “fan”.

この場合、２値分類推定部１２は、文書取得部１１によって取得された文書が「電源」、「ＨＤＤ」、「ディスプレイ」、及び「ファン」にそれぞれ属するか否かを学習データに基づき推定する。文書が「電源を入れても画面に何も表示されない。」である場合、２値分類推定部１２は、学習データに基づいて、文書が「電源」という分類に属する（○）と推定し、「ＨＤＤ」という分類に属さない（×）と推定し、「ディスプレイ」という分類に属する（○）と推定し、「ファン」という分類に属さない（×）と推定したとする。 In this case, the binary classification estimation unit 12 estimates whether or not the document acquired by the document acquisition unit 11 belongs to “power”, “HDD”, “display”, and “fan” based on the learning data. . When the document is “Nothing is displayed on the screen even when the power is turned on”, the binary classification estimation unit 12 estimates that the document belongs to the classification of “power” based on the learning data (○), It is assumed that it is estimated that it does not belong to the classification of “HDD” (×), it is estimated that it belongs to the classification of “display” (○), and it is estimated that it does not belong to the classification of “fan” (×).

格納部１３は結果情報を格納する。図３に示すように、結果情報には、文書を一意に識別するためのＩＤ（Identification）と、文書と、分類結果とが含まれる。分類結果は、上述のように２値分類推定部１２によって推定された、該文書が各分類に属する（○）か否か（×）を示す情報である。例えば、格納部１３は、ＩＤ「Ａ」の「電源を入れても画面に何も表示されない。」という文書に対応して、電源「○」、ＨＤＤ「×」、ディスプレイ「○」、ファン「×」を分類結果として格納する。 The storage unit 13 stores the result information. As shown in FIG. 3, the result information includes an ID (Identification) for uniquely identifying the document, the document, and a classification result. The classification result is information indicating whether the document belongs to each classification (o) or not (x), estimated by the binary classification estimating unit 12 as described above. For example, the storage unit 13 stores the power “O”, HDD “X”, display “O”, and fan “ “X” is stored as the classification result.

表示部１４は、格納部１３に記憶されている結果情報を表示する。具体的には、表示部１４は、文書、及び分類結果をディスプレイ等に表示する。例えば、表示部１４は、２値分類推定部１２によって推定された分類結果を、図２（ｂ）に示すように、文書は「電源」、「ディスプレイ」という分類に属することを表示する。なお、表示部１４は、通信ネットワークを介して結果情報を他の情報処理装置に送信してもよい。 The display unit 14 displays the result information stored in the storage unit 13. Specifically, the display unit 14 displays the document and the classification result on a display or the like. For example, the display unit 14 displays the classification result estimated by the binary classification estimation unit 12 as indicating that the document belongs to the classification of “power” and “display” as shown in FIG. 2B. The display unit 14 may transmit the result information to another information processing device via a communication network.

続いて、第１の実施形態における可視化装置１の可視化方法について図４に示すシーケンス図を参照して説明する。 Subsequently, a visualization method of the visualization device 1 according to the first embodiment will be described with reference to a sequence diagram shown in FIG.

まず、文書取得部１１は、利用者の操作に基づいて文書を取得する（ステップＳ１１）。本実施形態では、文書取得部１１は、例えば、「電源を入れても画面に何も表示されない。」という文書を取得する。 First, the document acquisition unit 11 acquires a document based on a user operation (Step S11). In the present embodiment, the document acquisition unit 11 acquires, for example, a document “Nothing is displayed on the screen even when the power is turned on”.

ステップＳ１１で文書が取得されると、２値分類推定部１２が文書の２値分類を推定する（ステップＳ１２）。具体的には、２値分類推定部１２は、文書が予め定められた分類それぞれに属するか否かを推定する。本実施形態では、例えば、２値分類推定部１２は、文書が「電源」という分類に属するか否かを推定する。 When the document is obtained in step S11, the binary classification estimating unit 12 estimates the binary classification of the document (step S12). Specifically, the binary classification estimating unit 12 estimates whether the document belongs to each of the predetermined classifications. In the present embodiment, for example, the binary classification estimating unit 12 estimates whether or not the document belongs to the classification “power supply”.

ステップＳ１２で２値分類が推定されると、２値分類推定部１２は、全ての分類について推定されたか否かを判定する（ステップＳ１３）。本実施形態の例では、２値分類推定部１２は、「電源」、「ＨＤＤ」、「ディスプレイ」、及び「ファン」の全ての分類について２値分類が推定されたか否かを判定する。 When the binary classification is estimated in step S12, the binary classification estimation unit 12 determines whether or not all the classifications have been estimated (step S13). In the example of the present embodiment, the binary classification estimating unit 12 determines whether the binary classification has been estimated for all the classifications of “power supply”, “HDD”, “display”, and “fan”.

ステップＳ１３で全ての分類について推定されていない場合には、ステップＳ１２に戻って、２値分類推定部１２は、まだ推定されていない分類に文書が属するか推定する。ステップＳ１３で全ての分類について推定されていた場合には、格納部１３が分類結果を含む結果情報を格納する（ステップＳ１４）。 If all the classifications have not been estimated in step S13, the process returns to step S12, and the binary classification estimation unit 12 estimates whether the document belongs to a classification that has not been estimated. If all the classifications have been estimated in step S13, the storage unit 13 stores the result information including the classification results (step S14).

ステップＳ１４で結果情報が格納されると、表示部１４が結果情報をディスプレイ等に表示する（ステップＳ１５）。 When the result information is stored in step S14, the display unit 14 displays the result information on a display or the like (step S15).

以上のように、第１の実施形態によれば、入力された文書に対して、全ての分類についての分類結果が格納され、表示される。文書が、複数の分類に属する場合は、どちらの分類もありえることを意味し、逆にどの分類にも属さない場合は、決め手に欠けることを意味する。例えば、図３に示す分類結果の場合、「電源を入れても画面に何も表示されない」という文書は、「電源」「ディスプレイ」という分類の両方に属する可能性があることを示している。これにより、入力された文書において過不足の情報を考え、不要な情報は削除し、足りない情報は補うなどして、分類を確定することが可能となる。例えば、図３に示す分類結果の場合、足りない情報として「パソコン本体の電源ランプが点灯しない」を補えば、分類を「電源」に確定することができる。 As described above, according to the first embodiment, the classification results of all the classifications are stored and displayed for the input document. If a document belongs to a plurality of categories, it means that there is any category, and if it does not belong to any category, it means that the document is lacking. For example, in the case of the classification result shown in FIG. 3, it is indicated that a document "Nothing is displayed on the screen even when the power is turned on" may belong to both the classifications "power" and "display". As a result, it is possible to determine the classification by considering excessive or insufficient information in the input document, deleting unnecessary information, and supplementing missing information. For example, in the case of the classification result shown in FIG. 3, if "power lamp of personal computer is not lit" is supplemented as missing information, the classification can be determined to be "power".

［第２の実施形態］
続いて、本発明の第２の実施形態について図面を参照して説明する。 [Second embodiment]
Next, a second embodiment of the present invention will be described with reference to the drawings.

図５に示すように、第２の実施形態に係る可視化装置２は、第１の実施形態に係る可視化装置１と同様に、文書取得部１１、２値分類推定部１２、格納部１３、及び表示部１４を備え、さらに分割部１５を備える。なお、第１の実施形態における可視化装置１と同様の構成ブロックについては同一の参照符号を付して、適宜、説明を省略する。 As shown in FIG. 5, the visualization device 2 according to the second embodiment, like the visualization device 1 according to the first embodiment, includes a document acquisition unit 11, a binary classification estimation unit 12, a storage unit 13, and It has a display unit 14 and further has a dividing unit 15. Note that the same constituent blocks as those of the visualization device 1 according to the first embodiment are denoted by the same reference numerals, and description thereof will be omitted as appropriate.

分割部１５は、文書取得部１１によって取得された文書を分割する。分割部１５は、例えば、文書に含まれる文を１文ずつ抽出する。分割部１５は、例えば、句点「。」を検索し、文書の最初の文字から、最初の句点以前の文字列を１文として抽出することができる。また、分割部１５は、句点より後の文字から次の句点以前の文字列を１文として抽出することができる。また、分割部１５は、文書が改行されている場合、改行の前後で文書を分割してもよい。 The dividing unit 15 divides the document acquired by the document acquiring unit 11. The dividing unit 15 extracts, for example, sentences included in the document one by one. For example, the dividing unit 15 can search for a period "." And extract a character string before the first period as one sentence from the first character of the document. In addition, the dividing unit 15 can extract a character string before the next period from the character after the period as one sentence. Further, when the document has a line feed, the dividing unit 15 may divide the document before and after the line feed.

一例として、利用者は、Ｗｅｂアプリケーションを用いて、図６（ａ）に示すような画面に「動作が非常に遅く、ファンの音がしない。立ち上がらないこともある。」という文書を入力し、送信ボタンを押下する。この場合、文書取得部１１は入力された文書を取得する。そして、分割部１５は、文書取得部１１が取得した文書を、「動作が非常に遅く、ファンの音がしない。」という文と「立ち上がらないこともある。」という文とに分割する。 As an example, the user uses the Web application to input a document “The operation is very slow and the fan does not make a sound. There is a case where the fan does not start up” on a screen as shown in FIG. Press the send button. In this case, the document acquisition unit 11 acquires the input document. Then, the dividing unit 15 divides the document acquired by the document acquiring unit 11 into a sentence "The operation is extremely slow and the fan does not make a sound." And a sentence "It may not stand up."

２値分類推定部１２は、分割部１５によって分割された各文について、学習データに基づいて、公知の機械推定の手法により２値分類を推定する。上述のように、分割部１５によって、「動作が非常に遅く、ファンの音がしない。」という文と「立ち上がらないこともある。」という文に分割された場合、２値分類推定部１２は、分割部１５によって分割された文がそれぞれ「電源」、「ＨＤＤ」、「ディスプレイ」、及び「ファン」にそれぞれ属するか否かを推定する。 The binary classification estimation unit 12 estimates the binary classification of each sentence divided by the division unit 15 based on the learning data by a known machine estimation method. As described above, when the division unit 15 divides the sentence into “the operation is very slow and the fan does not make a sound” and “there is no possibility of standing up”, the binary classification estimation unit 12 It is estimated whether the sentences divided by the dividing unit 15 respectively belong to “power supply”, “HDD”, “display”, and “fan”.

具体的には、２値分類推定部１２は、「動作が非常に遅く、ファンの音がしない。」という文について、学習データに基づいて、図７に示すように、文が「電源」という分類に属さない（×）と推定し、「ＨＤＤ」という分類に属さない（×）と推定し、「ディスプレイ」という分類に属さない（×）と推定し、「ファン」という分類に属する（○）と推定したとする。 Specifically, as shown in FIG. 7, based on the learning data, the binary classification estimating unit 12 determines that the sentence “Power is very slow and the fan does not make a sound”. It is estimated that it does not belong to the classification (x), it is estimated that it does not belong to the classification of "HDD" (x), it is estimated that it does not belong to the classification of "display" (x), and it belongs to the classification of "fan" (o ).

また、２値分類推定部１２は、「立ち上がらないこともある。」という文について、学習データに基づいて、文が「電源」という分類に属する（○）と推定し、「ＨＤＤ」という分類に属さない（×）と推定し、「ディスプレイ」という分類に属さない（×）と推定し、「ファン」という分類に属さない（×）と推定したとする。 In addition, the binary classification estimating unit 12 estimates that the sentence “may not start up” belongs to the classification “power” based on the learning data, and classifies the sentence as “HDD”. It is assumed that it is estimated that it does not belong to the category “display”, that it does not belong to the category “display”, and that it does not belong to the category “fan” (×).

さらに、２値分類推定部１２は、「動作が非常に遅く、ファンの音がしない。立ち上がらないこともある。」という文書全体について、学習データに基づいて、文書が「電源」という分類に属する（○）と推定し、「ＨＤＤ」という分類に属さない（×）と推定し、「ディスプレイ」という分類に属さない（×）と推定し、「ファン」という分類に属する（○）と推定したとする。 Further, the binary classification estimating unit 12 classifies the entire document “The operation is very slow, the fan does not make a sound, and sometimes does not start up” based on the learning data, and the document belongs to the classification “Power”. (○), presumed that it does not belong to the classification of “HDD” (×), presumed that it does not belong to the classification of “display” (x), and presumed to belong to the classification of “fan” (○) And

格納部１３は、結果情報を格納する。図７に示すように、結果情報には、ＩＤと、文書と、分割された各文と、２値分類推定部１２によって推定された、各文が各分類に属するか否かを示す分類結果、及び文書全体が各分類に属するか否かを示す分類結果とが含まれる。 The storage unit 13 stores the result information. As shown in FIG. 7, the result information includes an ID, a document, each of the divided sentences, and a classification result estimated by the binary classification estimation unit 12 and indicating whether each sentence belongs to each classification. , And a classification result indicating whether the entire document belongs to each classification.

表示部１４は、格納部１３に記憶されている結果情報を表示する。具体的には、表示部１４は、文書、各文、及び分類結果をディスプレイ等に表示する。例えば、表示部１４は、２値分類推定部１２によって推定された分類結果を、図６（ｂ）に示すように、文書の全体では「ファン」、「電源」という分類に属し、「動作が非常に遅く、ファンの音がしない。」という文は「ファン」という分類に属し、「立ち上がらないこともある。」という文は「電源」という分類に属することを表示する。なお、表示部１４は、通信ネットワークを介して結果情報を他の情報処理装置に送信してもよい。 The display unit 14 displays the result information stored in the storage unit 13. Specifically, the display unit 14 displays the document, each sentence, and the classification result on a display or the like. For example, as illustrated in FIG. 6B, the display unit 14 classifies the classification result estimated by the binary classification estimation unit 12 into the “fan” and “power” classifications of the entire document, and The sentence "Very slow, no fan sound" belongs to the category of "fans", and the sentence "may not start up" indicates that it belongs to the category of "power". The display unit 14 may transmit the result information to another information processing device via a communication network.

続いて、第２の実施形態における可視化装置２の可視化方法について図８に示すシーケンス図を参照して説明する。 Next, a visualization method of the visualization device 2 according to the second embodiment will be described with reference to a sequence diagram shown in FIG.

まず、文書取得部１１は、利用者の操作に基づいて文書を取得する（ステップＳ２１）。例えば、文書取得部１１は、利用者が、Ｗｅｂアプリケーションを用いて、図６（ａ）に示したような画面に「動作が非常に遅く、ファンの音がしない。立ち上がらないこともある。」という文書を入力し、送信ボタンを押下すると、入力された文書を取得する。 First, the document acquisition unit 11 acquires a document based on a user operation (step S21). For example, the document acquisition unit 11 uses the Web application to display, on the screen shown in FIG. 6A, “the operation is very slow, there is no fan sound, and the user may not start up”. Is input and when the send button is pressed, the input document is acquired.

ステップＳ２１で文書が取得されると、分割部１５が文書を分割する（ステップＳ２２）。例えば、分割部１５は、「動作が非常に遅く、ファンの音がしない。立ち上がらないこともある。」という文書を「動作が非常に遅く、ファンの音がしない。」という文と「立ち上がらないこともある。」という文に分割する。 When the document is obtained in step S21, the dividing unit 15 divides the document (step S22). For example, the dividing unit 15 describes the text "The operation is very slow and the fan does not sound. There is a case where the fan does not start." Sometimes it is. "

ステップＳ２２で文書が分割されると、２値分類推定部１２は分割された文の２値分類を推定する（ステップＳ２３）。具体的には、２値分類推定部１２は、分割された文が予め定められた分類に属するか否かを推定する。本実施形態では、例えば、２値分類推定部１２は、文書に含まれる「動作が非常に遅く、ファンの音がしない。」という文が「電源」という分類に属するか否かを推定する。 When the document is divided in step S22, the binary classification estimating unit 12 estimates the binary classification of the divided sentence (step S23). Specifically, the binary classification estimating unit 12 estimates whether or not the divided sentence belongs to a predetermined classification. In the present embodiment, for example, the binary classification estimating unit 12 estimates whether or not the sentence “The operation is very slow and the fan does not sound” included in the document belongs to the classification of “power supply”.

ステップＳ２３で２値分類が推定されると、２値分類推定部１２は、全ての分類について推定されたか否かを判定する（ステップＳ２４）。本実施形態の例では、２値分類推定部１２は、「電源」、「ＨＤＤ」、「ディスプレイ」、及び「ファン」の全ての分類についてそれぞれ文書が属するか推定したか否かを判定する。 When the binary classification is estimated in step S23, the binary classification estimation unit 12 determines whether or not all the classifications have been estimated (step S24). In the example of the present embodiment, the binary classification estimating unit 12 determines whether or not the document belongs to each of the classifications “power”, “HDD”, “display”, and “fan”.

ステップＳ２４で全ての分類について文が推定されていない場合には、ステップＳ２３に戻って、２値分類推定部１２が、まだ推定されていない分類に文が属するか推定する。ステップＳ２４で全ての分類について文が推定された場合、２値分類推定部１２は、分割された全ての文について推定されたか否かを判定する（ステップＳ２５）。本実施形態では、分割された「動作が非常に遅く、ファンの音がしない。」という文と、「立ち上がらないこともある。」という文のいずれについても推定されたか否かを判定する。 If the sentence has not been estimated for all the classes in step S24, the process returns to step S23, and the binary classification estimating unit 12 estimates whether the sentence belongs to a class that has not been estimated yet. When sentences have been estimated for all the classes in step S24, the binary classification estimating unit 12 determines whether or not all the divided sentences have been estimated (step S25). In the present embodiment, it is determined whether or not any of the divided sentences “the operation is extremely slow and the fan does not make a sound” and the sentence “it may not start up” are both estimated.

ステップＳ２５で全ての文について判定がされていた場合、格納部１３が分類結果を含む結果情報を格納する（ステップＳ２６）。ステップＳ２５で全ての文について判定がされていなかった場合、ステップＳ２３に戻って、まだ推定されていない文が各分類に属するか否かを判定する。 If all sentences have been determined in step S25, the storage unit 13 stores result information including the classification result (step S26). If all sentences have not been determined in step S25, the process returns to step S23 to determine whether a sentence that has not been estimated belongs to each category.

ステップＳ２６で結果情報が格納されると、表示部１４が結果情報をディスプレイ等に表示する（ステップＳ２７）。例えば、表示部１４は、２値分類推定部１２によって推定された分類結果を、図６（ｂ）に示したように、文書の全体では「ファン」、「電源」という分類に属し、「動作が非常に遅く、ファンの音がしない。」という文は「ファン」という分類に属し、「立ち上がらないこともある。」という文は「電源」という分類に属することを表示する。 When the result information is stored in step S26, the display unit 14 displays the result information on a display or the like (step S27). For example, as shown in FIG. 6B, the display unit 14 classifies the classification result estimated by the binary classification estimating unit 12 into the “fan” and “power” classifications of the entire document, and Is very slow, and the fan does not sound. "The sentence" Fan "belongs to the category, and" Sometimes it does not start up. "

以上のように、第２の実施形態によれば、分割部１５が、文書を複数の文に分割する。そして、２値分類推定部１２は、複数の文それぞれが、分類それぞれに該当するか否かを推定する。そのため、入力された文書に、複数の文が含まれている場合、文ごとに分類結果が格納され、表示される。これにより、文書全体では分類の判断がつかない場合であっても、文ごとに推定された分類結果を参考に、分類を再考することが可能となる。 As described above, according to the second embodiment, the dividing unit 15 divides a document into a plurality of sentences. Then, the binary classification estimating unit 12 estimates whether or not each of the plurality of sentences corresponds to each of the classifications. Therefore, if the input document includes a plurality of sentences, the classification result is stored and displayed for each sentence. This makes it possible to reconsider the classification by referring to the classification result estimated for each sentence, even when the classification of the entire document cannot be determined.

［第３の実施形態］
続いて、本発明の第３の実施形態について図面を参照して説明する。 [Third Embodiment]
Next, a third embodiment of the present invention will be described with reference to the drawings.

図１に示すように、第３の実施形態に係る可視化装置１は、第１の実施形態と同様に、文書取得部１１、２値分類推定部１２、格納部１３、及び表示部１４を備える。なお、構成ブロックは、第１の実施形態と同じであるので、構成ブロックの説明を省略する As shown in FIG. 1, the visualization device 1 according to the third embodiment includes a document acquisition unit 11, a binary classification estimation unit 12, a storage unit 13, and a display unit 14, as in the first embodiment. . Note that the configuration blocks are the same as those in the first embodiment, and a description of the configuration blocks will be omitted.

第３の実施形態において、文書取得部１１は、文書を取得するとともに、利用者の操作に基づいて該文書に対応する分類（人手付与分類）を取得する。一例として、利用者は、Ｗｅｂアプリケーションを用いて、図９（ａ）に示すような画面に「立ち上がりが悪い。」という文書を入力し、人手付与分類としての故障分類に「ＨＤＤ」を入力する。そして、利用者は、送信ボタンを押下する。この場合、文書取得部１１は、「立ち上がりが悪い。」という文書を取得し、「ＨＤＤ」という故障分類を人手付与分類として取得する。 In the third embodiment, the document acquisition unit 11 acquires a document and acquires a classification (manually assigned classification) corresponding to the document based on a user operation. As an example, using a Web application, the user inputs a document “The startup is poor” on a screen as shown in FIG. 9A, and inputs “HDD” as a failure classification as a manually assigned classification. . Then, the user presses the transmission button. In this case, the document acquisition unit 11 acquires the document “The startup is bad.” And acquires the failure classification “HDD” as the manually assigned classification.

２値分類推定部１２は、取得された文書が各分類に属するか否かを、学習データに基づいて、公知の機械推定の手法により推定する。例えば、２値分類推定部１２は、「立ち上がりが悪い。」という文書について、学習データに基づいて、図１０に示すように、文書が「電源」という分類に属さない（×）と推定し、「ＨＤＤ」という分類に属する（○）と推定し、「ディスプレイ」という分類に属さない（×）と推定し、「ファン」という分類に属さない（×）と推定したとする。 The binary classification estimating unit 12 estimates whether or not the acquired document belongs to each classification by using a known machine estimation method based on the learning data. For example, based on the learning data, the binary classification estimating unit 12 estimates, based on the learning data, that the document does not belong to the classification of “power” (×) based on the learning data, It is assumed that it is estimated that it belongs to the category of “HDD” (（), that it does not belong to the category of “display” (×), and that it does not belong to the category of “fan” (×).

格納部１３は結果情報を格納する。図１０示すように、結果情報には、ＩＤと、文書と、２値分類推定部１２によって推定された機械推定の分類結果と、人手付与分類とが含まれる。例えば、格納部１３は、図１０示すように、ＩＤ「Ｃ」の「立ち上がりが悪い」という文書に対応して、「ＨＤＤ」という人手付与分類を格納する。さらに格納部１３は、電源「×」、ＨＤＤ「○」、ディスプレイ「×」、ファン「×」を機械推定の分類結果として格納する。 The storage unit 13 stores the result information. As shown in FIG. 10, the result information includes an ID, a document, a classification result of the machine estimation estimated by the binary classification estimation unit 12, and a manual assignment classification. For example, as shown in FIG. 10, the storage unit 13 stores a manually assigned classification “HDD” corresponding to a document “bad start” of ID “C”. Further, the storage unit 13 stores the power supply “×”, the HDD “○”, the display “×”, and the fan “×” as the classification result of the machine estimation.

表示部１４は、格納部１３に記憶されている結果情報を表示する。具体的には、表示部１４は、ディスプレイ等に文書、２値分類推定部１２によって推定された機械推定の分類結果、及び人手付与分類を表示する。例えば、表示部１４は、２値分類推定部１２によって推定された分類結果を、図９（ｂ）に示すように、文書が「ＨＤＤ」という分類に属すること、及び入力された故障分類（人手付与分類）が「ＨＤＤ」であることを表示する。なお、表示部１４は、通信ネットワークを介して結果情報を他の情報処理装置に送信してもよい。 The display unit 14 displays the result information stored in the storage unit 13. More specifically, the display unit 14 displays the classification result of the machine estimation estimated by the binary classification estimation unit 12 and the manual assignment classification on a display or the like. For example, as shown in FIG. 9B, the display unit 14 displays the classification result estimated by the binary classification estimation unit 12 as indicating that the document belongs to the classification “HDD” and the input failure classification (manual). This indicates that the “assignment classification” is “HDD”. The display unit 14 may transmit the result information to another information processing device via a communication network.

続いて、第３の実施形態における可視化装置３の可視化方法について図１１に示すシーケンス図を参照して説明する。 Subsequently, a visualization method of the visualization device 3 according to the third embodiment will be described with reference to a sequence diagram shown in FIG.

まず、文書取得部１１は、利用者の操作に基づいて文書及び人手付与分類を取得する（ステップＳ３１）。例えば、利用者の操作により、図９（ａ）に示すように、「立ち上がりが悪い。」という文書と、「ＨＤＤ」という人手付与分類とが入力された場合、入力された文書及び人手付与分類を取得する。 First, the document acquisition unit 11 acquires a document and a hand-carrying classification based on a user operation (step S31). For example, as shown in FIG. 9A, when the user inputs a document “Wrong start-up” and a manual assignment classification “HDD” as shown in FIG. 9A, the input document and the manual assignment classification are input. To get.

本実施形態で可視化装置３が行う可視化方法のステップＳ３２及びＳ３３の処理は、第１の実施形態のステップＳ１２及びＳ１３の処理と同様である。 The processes in steps S32 and S33 of the visualization method performed by the visualization device 3 in the present embodiment are the same as the processes in steps S12 and S13 in the first embodiment.

ステップＳ３３で全ての分類について文書が推定された場合には、格納部１３が結果情報を格納する（ステップＳ３４）。第３の実施形態における結果情報には、第１の実施形態における分類結果に加えて、人手付与分類が含まれる。例えば、格納部１３は、図１０に示したように、電源「×」、ＨＤＤ「○」、ディスプレイ「×」、ファン「×」という分類結果と、「ＨＤＤ」という人手分類とを含む結果情報を格納する。 If the documents are estimated for all the categories in step S33, the storage unit 13 stores the result information (step S34). The result information according to the third embodiment includes a manually assigned classification in addition to the classification result according to the first embodiment. For example, as shown in FIG. 10, the storage unit 13 stores result information including a classification result of a power supply “×”, an HDD “「 ”, a display“ × ”, a fan“ × ”, and a manual classification of“ HDD ”. Is stored.

ステップＳ３４で結果情報が格納されると、表示部１４が、結果情報をディスプレイ等に表示する（ステップＳ３５）。例えば、表示部１４は、図９（ｂ）に示すように、機械学習による判定で「ＨＤＤ」という分類に属する可能性が高いと判定されたこと、及び入力された故障分類（人手付与分類）が「ＨＤＤ」であることを表示する。 When the result information is stored in step S34, the display unit 14 displays the result information on a display or the like (step S35). For example, as illustrated in FIG. 9B, the display unit 14 determines that the possibility of belonging to the classification “HDD” is high in the determination by the machine learning, and the input failure classification (manually assigned classification). Is "HDD".

以上のように、第３の実施形態によれば、文書取得部１１は、利用者の操作に基づいて文書の人手付与分類をさらに取得する。そして、表示部１４は、２値分類推定部１２によって推定された分類結果と、人手付与分類とを結果情報として表示する。そのため、利用者は、学習データに基づいて推定された分類結果を認識するとともに、文書を取得した利用者の判断を示す人手付与分類を考慮して、文書の分類を最終的に決定することができる。 As described above, according to the third embodiment, the document acquisition unit 11 further acquires the manual assignment classification of the document based on the operation of the user. Then, the display unit 14 displays the classification result estimated by the binary classification estimation unit 12 and the manually assigned classification as result information. Therefore, the user can recognize the classification result estimated based on the learning data, and finally decide the classification of the document in consideration of the manual assignment classification indicating the judgment of the user who acquired the document. it can.

［第４の実施形態］
続いて、本発明の第４の実施形態について図面を参照して説明する。 [Fourth embodiment]
Next, a fourth embodiment of the present invention will be described with reference to the drawings.

図５に示すように、第４の実施形態に係る可視化装置２は、第２の実施形態と同様に、文書取得部１１、２値分類推定部１２、格納部１３、表示部１４、及び分割部１５を備える。なお、構成ブロックは、第2の実施形態と同じであるので、構成ブロックの説明を省略する。 As shown in FIG. 5, the visualization device 2 according to the fourth embodiment includes a document acquisition unit 11, a binary classification estimation unit 12, a storage unit 13, a display unit 14, and a division unit, as in the second embodiment. A unit 15 is provided. Note that the configuration blocks are the same as in the second embodiment, and a description of the configuration blocks will be omitted.

第４の実施形態において、文書取得部１１は、利用者の操作に基づいて文書を取得するとともに、該文書に対応する人手付与分類を取得する。一例として、利用者は、Ｗｅｂアプリケーションを用いて、図１２（ａ）に示すような画面に「立ち上がりが悪い。電源を切って立ち上げても同じ。」という文書を入力し、人手付与分類としての故障分類に「ＨＤＤ」を入力する。そして、利用者は、送信ボタンを押下する。この場合、文書取得部１１は、「立ち上がりが悪い。電源を切って再度立ち上げても同じ。」という文書を取得する。さらに、「ＨＤＤ」を人手付与分類として取得する。 In the fourth embodiment, the document acquisition unit 11 acquires a document based on a user operation, and acquires a manual assignment classification corresponding to the document. As an example, the user uses a Web application to input a document such as “The startup is poor. The same is true even when the power is turned off and started.” On the screen shown in FIG. "HDD" is input in the failure classification of "1. Then, the user presses the transmission button. In this case, the document obtaining unit 11 obtains a document “The startup is poor. The same is true even if the power is turned off and restarted.” Further, “HDD” is acquired as a manual assignment classification.

そして、分割部１５は、文書取得部１１が取得した文書を「立ち上がりが悪い。」という文と「電源を切って再度立ち上げても同じ。」という文に分割する。 Then, the dividing unit 15 divides the document acquired by the document acquiring unit 11 into a sentence “Poor start-up.” And a sentence “Even if the power is turned off and started again.”

また、２値分類推定部１２は、分割部１５によって分割された各文が各分類に属するか否かを、学習データに基づいて、公知の機械推定の手法により推定する。例えば、２値分類推定部１２は、図１３に示すように、学習データに基づいて、「立ち上がりが悪い。」という文について、文が「電源」という分類に属さない（×）と推定し、「ＨＤＤ」という分類に属する（○）と推定し、「ディスプレイ」という分類に属さない（×）と推定し、「ファン」という分類に属さない（×）と推定したとする。 Further, the binary classification estimating unit 12 estimates whether or not each sentence divided by the dividing unit 15 belongs to each classification by using a known machine estimation method based on the learning data. For example, as shown in FIG. 13, the binary classification estimating unit 12 estimates that the sentence “bad startup” based on the learning data does not belong to the classification of “power” (×), and It is assumed that it is estimated that it belongs to the category of “HDD” (（), that it does not belong to the category of “display” (×), and that it does not belong to the category of “fan” (×).

また、２値分類推定部１２は、「電源を切って再度立ち上げても同じ。」という文について、学習データに基づいて文が「電源」という分類に属さない（×）と推定し、「ＨＤＤ」に属さない（×）と推定し、「ディスプレイ」に属さない（×）と推定し、「ファン」に属さない（×）と推定したとする。また、２値分類推定部１２は、「立ち上がりが悪い。電源を切って再度立ち上げても同じ。」という文書全体について、学習データに基づいて文書全体が「電源」という分類に属さない（×）と推定し、「ＨＤＤ」に属する（○）と推定し、「ディスプレイ」に属さない（×）と推定し、「ファン」に属さない（×）と推定したとする。 In addition, the binary classification estimating unit 12 estimates that the sentence “The same applies even if the power is turned off and restarted” is based on the learning data, and that the sentence does not belong to the classification “power” (×). It is assumed that it is estimated that it does not belong to “HDD”, that it does not belong to “display” (×), and that it does not belong to “fan” (×). Further, the binary classification estimating unit 12 does not belong to the classification of “power supply” based on the learning data for the entire document “The startup is bad. The same applies even if the power is turned off and restarted.” ), It is estimated that it belongs to “HDD” (○), it is estimated that it does not belong to “display” (×), and it is estimated that it does not belong to “fan” (×).

格納部１３は、結果情報を格納する。図１３に示すように、結果情報にはＩＤと、文書と、分割された各文と、２値分類推定部１２によって推定された、各文が各分類に属するか否かを示す分類結果、及び文書全体が各分類に属するか否かを示す分類結果と、人手付与分類とが含まれる。 The storage unit 13 stores the result information. As shown in FIG. 13, the result information includes an ID, a document, each divided sentence, and a classification result estimated by the binary classification estimation unit 12 indicating whether each sentence belongs to each classification, And a classification result indicating whether or not the entire document belongs to each classification, and a manual assignment classification.

上述のように、文書取得部１１によって「立ち上がりが悪い。電源を切って再度立ち上げても同じ。」という文書と、「ＨＤＤ」という人手付与分類とが取得され、２値分類推定部１２によって推定された場合に格納部１３が格納する結果情報について説明する。格納部１３は、図１３に示すように、ＩＤ「Ｄ」の「立ち上がりが悪い。電源を切って再度立ち上げても同じ。」という文書に対応して、人手付与分類である「ＨＤＤ」、機械推定による分類結果である電源「×」、ＨＤＤ「○」、ディスプレイ「×」、及びファン「×」を格納する。また、格納部１３は、分割された「立ち上がりが悪い。」という文に対応して、電源「×」、ＨＤＤ「○」、ディスプレイ「×」、ファン「×」を格納する。さらに、格納部１３は、分割された「電源を切って再度立ち上げても同じ。」という文に対応して、電源「×」、ＨＤＤ「×」、ディスプレイ「×」、ファン「×」を格納する。 As described above, the document acquisition unit 11 acquires the document “The startup is poor. The same applies even if the power is turned off and restarted.” And the manually assigned classification “HDD”. The result information stored in the storage unit 13 when it is estimated will be described. As shown in FIG. 13, the storage unit 13 responds to the document “ID has a poor start-up. The power supply “×”, the HDD “○”, the display “×”, and the fan “×” which are the classification results based on the machine estimation are stored. In addition, the storage unit 13 stores the power supply “×”, the HDD “」 ”, the display“ × ”, and the fan“ × ”in response to the divided sentence“ The startup is bad. ” Further, the storage unit 13 stores the power supply “×”, the HDD “×”, the display “×”, and the fan “×” in response to the divided statement “the same applies even if the power is turned off and restarted”. Store.

表示部１４は、格納部１３に記憶されている結果情報を表示する。具体的には、表示部１４は、文書、各文、各文が各分類に属するか否かを示す分類結果、各文を含む文書が各分類に属するか否かを示す分類結果、及び人手付与分類をディスプレイ等に表示する。 The display unit 14 displays the result information stored in the storage unit 13. Specifically, the display unit 14 displays a document, each sentence, a classification result indicating whether each sentence belongs to each classification, a classification result indicating whether a document including each sentence belongs to each classification, and The assignment classification is displayed on a display or the like.

例えば、表示部１４は、図１２（ｂ）に示すように、文書全体が機械学習による判定で「ＨＤＤ」という分類に属する可能性が高いと判定されたこと、及び入力された故障分類（人手付与分類）が「ＨＤＤ」であることを表示する。さらに表示部１４は、分割された「立ち上がりが悪い」という文が「ＨＤＤ」という分類に属する可能性が高いと判定されたこと、「電源を切って立ち上げても同じ」という文がいずれの分類にも属さない可能性が高いと判定されたことを表示する。なお、表示部１４は、通信ネットワークを介して結果情報を他の情報処理装置に送信してもよい。 For example, as illustrated in FIG. 12B, the display unit 14 determines that the entire document is determined to be highly likely to belong to the classification “HDD” by the determination based on the machine learning, and that the input failure classification (manually This indicates that the “assignment classification” is “HDD”. Further, the display unit 14 displays that the sentence “the start is poor” is highly likely to belong to the classification “HDD”, and the sentence “the same applies even when the power is turned off” is displayed. Displays that it is determined that there is a high possibility that it does not belong to the classification. The display unit 14 may transmit the result information to another information processing device via a communication network.

続いて、第４の実施形態における可視化装置４の可視化方法について図１４に示すシーケンス図を参照して説明する。 Subsequently, a visualization method of the visualization device 4 according to the fourth embodiment will be described with reference to a sequence diagram shown in FIG.

まず、文書取得部１１は、利用者の操作に基づいて文書及び人手付与分類を取得する（ステップＳ４１）。例えば、利用者の操作により、図１２（ａ）に示したように、「立ち上がりが悪い。電源を切って立ち上げても同じ。」という文書と、「ＨＤＤ」という人手付与分類とが入力された場合、入力された文書及び人手付与分類を取得する。 First, the document obtaining unit 11 obtains a document and a manual assignment classification based on a user operation (step S41). For example, as shown in FIG. 12A, a document “The startup is poor. The same is true even after the power is turned off.” And a manually assigned classification “HDD” are input by a user operation. In this case, the input document and the manually assigned classification are acquired.

本実施形態で可視化装置３が行う可視化方法のステップＳ４２からＳ４５の処理は、第２の実施形態のステップＳ２２からＳ２５の処理と同様である。 The processing of steps S42 to S45 of the visualization method performed by the visualization device 3 in the present embodiment is the same as the processing of steps S22 to S25 of the second embodiment.

ステップＳ４５で全ての文について推定された場合には、格納部１３が結果情報を格納する（ステップＳ４６）。第４の実施形態における結果情報は、第２の実施形態における分類結果に加えて、人手付与分類が含まれる。例えば、格納部１３は、図１３に示したような、ＩＤ「Ｄ」の「立ち上がりが悪い。電源を切って再度立ち上げても同じ。」という文書に対応して、人手付与分類である「ＨＤＤ」、機械推定による分類結果である電源「×」、ＨＤＤ「○」、ディスプレイ「×」、及びファン「×」を格納する。また、格納部１３は、分割された「立ち上がりが悪い。」という文に対応して、機械推定による分類結果である電源「×」、ＨＤＤ「○」、ディスプレイ「×」、ファン「×」を格納する。さらに、格納部１３は、分割された「電源を切って再度立ち上げても同じ。」という文に対応して、機械推定による分類結果である電源「×」、ＨＤＤ「×」、ディスプレイ「×」、ファン「×」を格納する。 If all sentences have been estimated in step S45, the storage unit 13 stores the result information (step S46). The result information according to the fourth embodiment includes a manual assignment classification in addition to the classification result according to the second embodiment. For example, as shown in FIG. 13, the storage unit 13 is a manual assignment classification corresponding to the document of ID “D” “The start is poor. The same applies even if the power is turned off and restarted.” HDD, power “x”, HDD “ある”, display “x”, and fan “x”, which are classification results based on machine estimation, are stored. In addition, the storage unit 13 stores the power supply “×”, the HDD “O”, the display “×”, and the fan “×”, which are the classification results based on the machine estimation, in response to the divided sentence “The rising is bad.” Store. Further, the storage unit 13 responds to the divided sentence “Even if the power is turned off and restarted.”, The power supply “x”, the HDD “x”, and the display “x” which are the classification results based on machine estimation. ”And fan“ x ”.

ステップＳ３４で結果情報が格納されると、表示部１４が、結果情報をディスプレイ等に表示する（ステップＳ４７）。表示部１４は、例えば、図１２（ｂ）に示したように、文書全体が機械学習による判定で「ＨＤＤ」という分類に属する可能性が高いと判定されたこと、及び入力された故障分類（人手付与分類）が「ＨＤＤ」であることを表示する。さらに表示部１４は、分割された「立ち上がりが悪い」という文が「ＨＤＤ」という分類に属する可能性が高いと判定されたこと、「電源を切って立ち上げても同じ」という文がいずれの分類にも属さない可能性が高いと判定されたことを表示する。 When the result information is stored in step S34, the display unit 14 displays the result information on a display or the like (step S47). For example, as illustrated in FIG. 12B, the display unit 14 determines that the entire document is determined to be highly likely to belong to the classification “HDD” by the determination based on the machine learning, and that the input failure classification ( This indicates that “manual assignment classification” is “HDD”. Further, the display unit 14 displays that the sentence “the start is poor” is highly likely to belong to the classification “HDD”, and the sentence “the same applies even when the power is turned off” is displayed. Displays that it is determined that there is a high possibility that it does not belong to the classification.

以上のように、第４の実施形態によれば、第２の実施形態と同様に、分割部１５が、文書を複数の文に分割する。そして、２値分類推定部１２は、複数の文それぞれが、分類それぞれに該当するか否かを推定する。そのため、利用者は、文書に含まれる文それぞれが、各分類に該当するかを認識することができる。また、第４の実施形態によれば、第３の実施形態と同様に、文書取得部１１は、利用者の操作に基づいて文書の人手付与分類をさらに取得する。そして、表示部１４は、２値分類推定部１２によって推定された分類と、人手付与分類とを結果情報として表示する。そのため、利用者は、学習データに基づいて推定された分類とともに文書を取得した利用者の判断を示す人手付与分類を認識することができる。したがって利用者が文書の属する分類を最終的に決定するときに、文ごとの分類結果及び人手付与分類の両方を鑑みて総合的な決定を行うことが可能となる。 As described above, according to the fourth embodiment, similarly to the second embodiment, the dividing unit 15 divides a document into a plurality of sentences. Then, the binary classification estimating unit 12 estimates whether or not each of the plurality of sentences corresponds to each of the classifications. Therefore, the user can recognize whether each sentence included in the document corresponds to each classification. Further, according to the fourth embodiment, similarly to the third embodiment, the document acquisition unit 11 further acquires the manual assignment classification of the document based on the operation of the user. Then, the display unit 14 displays the classification estimated by the binary classification estimation unit 12 and the manually assigned classification as result information. Therefore, the user can recognize the classification estimated based on the learning data and the manual assignment classification indicating the judgment of the user who acquired the document. Therefore, when the user finally decides the classification to which the document belongs, it is possible to make a comprehensive decision in consideration of both the classification result for each sentence and the manually assigned classification.

［第５の実施形態］
続いて、本発明の第５の実施形態について図面を参照して説明する。 [Fifth Embodiment]
Next, a fifth embodiment of the present invention will be described with reference to the drawings.

図５に示すように、第５の実施形態に係る可視化装置２は、第４の実施形態と同様に、文書取得部１１、２値分類推定部１２、格納部１３、表示部１４、及び分割部１５を備える。なお、構成ブロックは、第4の実施形態と同じであるので、構成ブロックの説明を省略する。 As shown in FIG. 5, the visualization device 2 according to the fifth embodiment includes a document acquisition unit 11, a binary classification estimation unit 12, a storage unit 13, a display unit 14, and a division unit, as in the fourth embodiment. A unit 15 is provided. Note that the configuration blocks are the same as those in the fourth embodiment, and a description of the configuration blocks will be omitted.

第５の実施形態においては、文書取得部１１は、複数の文書を取得するとともに、利用者の操作に基づいて各文書に対応する人手付与分類を取得する。例えば、文書取得部１１は、「電源を入れても画面に何も表示されない」という文書と、「ディスプレイ」という人手付与分類とを取得する。また、文書取得部１１は、「動作が非常に遅く、ファンの音がしない。立ち上がらないこともある。」という文書と、「ファン」という人手付与分類とを取得する。また、文書取得部１１は、「立ち上がりが悪い。電源を切って再度立ち上げても同じ。」という文書と、「ＨＤＤ」という人手付与分類とを取得する。 In the fifth embodiment, the document acquisition unit 11 acquires a plurality of documents and acquires a manual assignment classification corresponding to each document based on a user operation. For example, the document acquisition unit 11 acquires a document “Nothing is displayed on the screen even when the power is turned on” and a manual assignment classification “Display”. In addition, the document acquisition unit 11 acquires a document “The operation is extremely slow and the fan does not make a sound. There is a case where the fan does not start.” And a manual assignment classification “Fan”. In addition, the document acquisition unit 11 acquires a document “The startup is poor. The same applies even if the power is turned off and restarted.” And a manual assignment classification “HDD”.

この場合、２値分類推定部１２及び格納部１３は、「電源を入れても画面に何も表示されない。」という文書について、第３の実施形態と同様の処理を行う。また、分割部１５、２値分類推定部１２、及び格納部１３は、「動作が非常に遅く、ファンの音がしない。立ち上がらないこともある。」という文書と「立ち上がりが悪い。電源を切って再度立ち上げても同じ。」という文書とについて第４の実施形態と同様の処理を行う。 In this case, the binary classification estimating unit 12 and the storage unit 13 perform the same processing as that of the third embodiment on the document “Nothing is displayed on the screen even when the power is turned on”. The dividing unit 15, the binary classification estimating unit 12, and the storage unit 13 have a statement that "the operation is very slow and the fan does not make a sound. There is a case where the fan does not start." The same processing as in the fourth embodiment is performed for the document "Same as above when starting up again."

表示部１４は、格納部１３によって格納されている各文書についての結果情報を合わせてディスプレイ等に表示する。なお、表示部１４は、各文書についての結果情報を合わせて、通信ネットワークを介して他の情報処理装置に送信してもよい。 The display unit 14 displays the result information about each document stored in the storage unit 13 on a display or the like. The display unit 14 may transmit the result information of each document to another information processing device via a communication network together.

第５の実施形態における可視化装置２の可視化方法は、第３の実施形態の可視化方法と第４の実施形態の可視化方法との組合せである。具体的には、文書取得部１１は、複数の文書とそれぞれの文書に対応する分類を取得する。文書に１つの文のみが含まれる場合、可視化装置５は、第３の実施形態のステップＳ３２からＳ３４の処理を行う。また、文書に複数の文が含まれる場合、可視化装置２は、第４の実施形態のステップＳ４２からＳ４６の処理を行う。そして、格納部１３は、図１５に示すように、ステップＳ３４で格納された結果情報と、ステップＳ４６で格納された結果情報とを合わせて格納する。 The visualization method of the visualization device 2 according to the fifth embodiment is a combination of the visualization method of the third embodiment and the visualization method of the fourth embodiment. Specifically, the document acquisition unit 11 acquires a plurality of documents and a classification corresponding to each document. When the document includes only one sentence, the visualization device 5 performs the processing of steps S32 to S34 of the third embodiment. When the document includes a plurality of sentences, the visualization device 2 performs the processing of steps S42 to S46 of the fourth embodiment. Then, as shown in FIG. 15, the storage unit 13 stores the result information stored in step S34 and the result information stored in step S46 together.

以上のように、第５の実施形態によれば、文書取得部１１は、複数の文書と、それぞれの文書に対応する人手付与分類を取得する。そのため、利用者は、複数の文書それぞれについて、学習データに基づいて推定された分類と人手付与分類とを認識することができる。したがって利用者が複数の文書の属する分類を最終的に決定するときに、文書及び各文書に含まれる文それぞれの分類結果及び人手付与分類を鑑みて総合的な決定を行うことが可能となる。 As described above, according to the fifth embodiment, the document acquisition unit 11 acquires a plurality of documents and a manual assignment classification corresponding to each document. Therefore, the user can recognize the classification estimated based on the learning data and the manually assigned classification for each of the plurality of documents. Therefore, when the user finally decides the classification to which a plurality of documents belong, it is possible to make a comprehensive decision in view of the classification results of the documents and the sentences included in each document and the manual assignment classification.

また、上述の実施形態では、可視化装置１，２，３，４，５の各機能は、これらの機能を実現する処理内容を記述したプログラムをコンピュータの記憶部に記憶しておき、当該コンピュータのＣＰＵによってこのプログラムを読み出して実行することで実現してもよい。また、プログラムを記録媒体に記録したり、ネットワークを通して提供したりしてもよい。 Further, in the above-described embodiment, each function of the visualization devices 1, 2, 3, 4, and 5 stores a program describing processing contents for realizing these functions in a storage unit of the computer, and stores the program in the storage unit of the computer. It may be realized by reading and executing this program by the CPU. Further, the program may be recorded on a recording medium or provided through a network.

また、第１の実施形態では、図１６に示すように、可視化装置１は、さらに音声認識部１６を備えてもよい。音声認識部１６は、利用者が発した音声について音声認識処理を行い、文書に変換する。そして、文書取得部１１は、音声認識部１６によって変換された文書を取得し、上述と同様の処理を行う。これにより、音声で表される情報についても分類を推定することができる。第２〜第５の実施形態についても同様である。 Further, in the first embodiment, as shown in FIG. 16, the visualization device 1 may further include a voice recognition unit 16. The voice recognition unit 16 performs voice recognition processing on the voice uttered by the user and converts the voice into a document. Then, the document acquisition unit 11 acquires the document converted by the speech recognition unit 16, and performs the same processing as described above. This makes it possible to estimate the classification of information represented by speech. The same applies to the second to fifth embodiments.

上述の実施形態は代表的な例として説明したが、本発明の趣旨及び範囲内で、多くの変更及び置換ができることは当業者に明らかである。したがって、本発明は、上述の実施形態によって制限するものと解するべきではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。 Although the above embodiments have been described as representative examples, it will be apparent to those skilled in the art that many changes and substitutions can be made within the spirit and scope of the present invention. Therefore, the present invention should not be construed as being limited by the above-described embodiment, and various modifications and changes can be made without departing from the scope of the claims.

１，２，３，４，５可視化装置
１１文書取得部
１２２値分類推定部
１３格納部
１４表示部
１５分割部
１６音声認識部 1,2,3,4,5 Visualization device 11 Document acquisition unit 12 Binary classification estimation unit 13 Storage unit 14 Display unit 15 Division unit 16 Voice recognition unit

Claims

A document acquisition unit for acquiring a document based on a user operation;
A dividing unit that divides the document into a plurality of sentences;
A binary classification estimating unit that estimates whether the document and the plurality of sentences each belong to each of a plurality of predetermined classifications,
Before SL document and the plurality of sentences, and a display unit in which the document and each of the plurality of statements to display the result information and a classification result indicating whether belonging to each of the classification,
A visualization device comprising:

A document acquisition unit configured to acquire a document and a manual assignment classification for the document based on a user operation;
A dividing unit that divides the document into a plurality of sentences;
A binary classification estimating unit that estimates whether the document and the plurality of sentences each belong to each of a plurality of predetermined classifications,
The document and the plurality of sentences, the classification result estimated by the binary classification estimation unit, and the result information including the classification result by the manual assignment classification, and the classification result included in the result information is the estimated classification A display unit that displays the result or information indicating whether the result is a classification result according to the manual assignment classification,
A visualization device comprising:

A visualization method performed by a visualization device that estimates a classification of a document,
Obtaining a document based on a user operation;
Dividing the document into a plurality of sentences;
Estimating whether or not each of the document and the plurality of sentences belongs to each of a plurality of predetermined classifications,
Displaying result information including the document and the plurality of sentences, and a classification result indicating whether each of the document and the plurality of sentences belongs to each of the classifications,
A visualization method comprising:

Based on the operation of the user, a document, and a step of acquiring a classification result by manual assignment classification for the document,
Dividing the document into a plurality of sentences;
Estimating whether or not each of the document and the plurality of sentences belongs to each of a plurality of predetermined classifications,
The document and the plurality of sentences, the estimated classification result, and the result information including the classification result by the manual assignment classification, and whether the classification result included in the result information is the estimated classification result, Displaying result information including information indicating whether the result is a classification result by the assigned classification;
A visualization method comprising:

A program for causing a computer to function as the visualization device according to claim 1.