JP4772022B2

JP4772022B2 - Data management system and data management method

Info

Publication number: JP4772022B2
Application number: JP2007260366A
Authority: JP
Inventors: 幸雄秋貞
Original assignee: Chugoku Electric Power Co Inc
Current assignee: Chugoku Electric Power Co Inc
Priority date: 2007-10-03
Filing date: 2007-10-03
Publication date: 2011-09-14
Anticipated expiration: 2027-10-03
Also published as: JP2009093231A

Description

この発明は、電子化されたデータが記憶媒体に書き込まれて持ち出される際に、このデータを管理するデータ管理システムおよびデータ管理方法に関する。 The present invention relates to a data management system and a data management method for managing data when the digitized data is written on a storage medium and taken out.

電子化されたデータを記憶する記憶媒体として、ＵＳＢメモリ等の小型記憶媒体が広く普及している。こうした小型記憶媒体を用いると、データの持ち出しが容易であるために、データの持ち出しを管理する必要がある。特に、データが機密情報や業務情報をデータ化したものである場合には、各種の重要情報がデータに含まれているので、データの持ち出し管理は企業等において特に重要である。 As a storage medium for storing digitized data, a small storage medium such as a USB memory is widely used. When such a small storage medium is used, it is easy to take out data, so it is necessary to manage the taking out of data. In particular, when the data is confidential information or business information converted into data, various important information is included in the data. Therefore, data take-out management is particularly important in companies and the like.

このために、機密情報であることを示す機密フラグを用いて、データを管理するシステムがある（例えば、特許文献１参照。）。このシステムでは、あらかじめデータに対して、データの内容が機密情報であることを示す機密フラグを関連付けて保存している。そして、データが読み出された場合に、関連付けされた機密フラグを用いて、データが機密情報に該当するかどうかを判定し、機密情報である場合に、データを読み出した端末のＩＤ（ＩＤｅｎｔｉｆｉｃａｔｉｏｎ）を記憶して、データを管理している。
特開２００７−１１５２１号公報 For this reason, there is a system for managing data using a confidential flag indicating confidential information (see, for example, Patent Document 1). In this system, a confidential flag indicating that the content of the data is confidential information is stored in association with the data in advance. When the data is read, it is determined whether the data corresponds to the confidential information using the associated confidential flag. If the data is the confidential information, the ID (IDentification) of the terminal from which the data is read is determined. Is stored and data is managed.
JP 2007-11521 A

ところで、先に述べたシステムには次の課題がある。つまり、このシステムでは、各データに対してあらかじめ機密フラグを関連付けておく必要がある。つまり、事前にデータが機密情報かどうかを判断して、データを振り分ける必要がある。しかし、企業等で使用されるデータは更新されることがあり、こうした場合に、データが機密情報になるときや、その逆のときもある。特に、データが更新された場合に機密情報となったときに、機密フラグの変更がないと、データが機密扱いにはならない。つまり、機密フラグを事前に振り分けておくと、データの管理上、問題が発生する。 By the way, the system described above has the following problems. That is, in this system, it is necessary to associate a confidential flag with each data in advance. That is, it is necessary to determine whether the data is confidential information in advance and distribute the data. However, data used in companies and the like may be updated, and in such cases, the data may become confidential information and vice versa. In particular, if the confidential information is changed when the data is updated, the data will not be classified unless the confidential flag is changed. In other words, if the secret flag is assigned in advance, a problem occurs in data management.

この発明の目的は、前記の課題を解決し、データが機密情報かどうかを事前に振り分けることを不要にして、データの持ち出しを管理する、データ管理システムおよびデータ管理方法を提供することにある。 An object of the present invention is to provide a data management system and a data management method for solving the above-mentioned problems and managing the taking-out of data without making it necessary to sort out whether the data is confidential information in advance.

前記の課題を解決するために、請求項１の発明は、データの書き込みができる記憶媒体を利用可能であると共に組織体の各グループで使用されるパーソナルコンピュータと、該パーソナルコンピュータが取り扱うデータの中の機密情報を管理する管理サーバと、機密情報の管理に必要とするデータとして、該グループ毎に重要度が付加されたグループ別重要度データを記憶するデータベースサーバとがデータ通信可能に接続されているデータ管理システムであって、前記管理サーバは、所定の解析方法によって、収集した各文書に共通して現れる単語を低く重み付けを行い、特定の文書に現れる単語を高く重み付けを行い、該解析方法によって重み付けをした単語をキーワードとすると共に重みの大きさに応じて重要度を表す値を該キーワードに付加し、該キーワードと、該キーワードを使用するグループとを関連付けて機密データとして前記データベースサーバに記憶し、前記パーソナルコンピュータは、前記記憶媒体にデータを書き込むときに、該データから単語を抽出し、該単語と前記データベースサーバの機密データのキーワードとを対比して、該データが機密情報かどうかを判断し、該データが機密情報である場合に、該データに含まれるキーワードを使用するグループがあると、該キーワードをグループ別に分け、グループ毎に分けられた各キーワードの重要度を集計し、集計して得たグループ別の重要度と前記グループ別重要度データとを比較し、グループの重要度に比べて集計した重要度が大きい場合に、該グループの機密情報に該データが該当すると判断する、ことを特徴とするデータ管理システムである。 In order to solve the above-mentioned problems, the invention of claim 1 is characterized in that a storage medium on which data can be written can be used and a personal computer used in each group of an organization and data handled by the personal computer. A management server that manages the confidential information of the group and a database server that stores the importance data for each group to which the importance is added for each group as data necessary for the management of the confidential information is connected to be able to perform data communication. The management server is configured such that the management server weights words that appear commonly in each collected document by a predetermined analysis method, and weights words that appear in a specific document high, and the analysis method The keyword weighted by the keyword is used as a keyword, and a value representing the importance according to the weight is assigned to the keyword. Added to, and the keyword, stored in the database server as confidential data in association with each group using the keyword, the personal computer, when writing data in the storage medium, extracts words from the data The word and the keyword of the confidential data of the database server are compared to determine whether the data is confidential information. When the data is confidential information, a group that uses the keyword included in the data is If there is, the keyword is divided into groups, the importance of each keyword divided for each group is totaled, the group-by-group importance obtained by the aggregation is compared with the group-level importance data, and the group importance If time is greater importance and aggregated as compared to, it is determined that the data corresponds to the confidential information of the group, A data management system characterized and.

請求項１の発明では、機密情報を表すキーワードを機密データとして、あらかじめデータベースサーバが記憶している。こうした状態のときに、ＵＳＢメモリなどの記憶媒体にデータを書き込む際に、パーソナルコンピュータは、書き込まれるデータから単語を抽出する。この後、パーソナルコンピュータは、データから抽出した単語と機密データのキーワードとを対比して、データが機密情報かどうかを判断する。 In the invention of claim 1, the database server stores in advance keywords representing confidential information as confidential data. In such a state, when writing data to a storage medium such as a USB memory, the personal computer extracts words from the written data. Thereafter, the personal computer compares the word extracted from the data with the keyword of the confidential data to determine whether the data is confidential information.

請求項２の発明は、請求項１に記載のデータ管理システムにおいて、前記パーソナルコンピュータは、前記記憶媒体に書き込むデータが機密情報であるとき、該データのログ情報を前記データベースサーバに送り、前記データベースサーバは、前記パーソナルコンピュータからログ情報を受け取ると、該ログ情報を記録しておくことを特徴とする。 According to a second aspect of the present invention, in the data management system according to the first aspect, when the data written to the storage medium is confidential information, the personal computer sends log information of the data to the database server , and the database When the server receives log information from the personal computer, the server records the log information.

請求項３の発明は、請求項２に記載のデータ管理システムにおいて、前記ログ情報は、データを前記記憶媒体に書き込む実行ユーザと、該データに付けられている名称と、該データを書き込む日付けおよび時刻の少なくとも一方とからなることを特徴とする。 According to a third aspect of the present invention, in the data management system according to the second aspect, the log information includes an execution user who writes data to the storage medium, a name given to the data, and a date on which the data is written. And at least one of time.

請求項４の発明は、請求項２または３に記載のデータ管理システムにおいて、前記パーソナルコンピュータは、書き込むデータが機密情報であるとき、該データの持ち出し理由の入力を要求し、該持ち出し理由が特定の持ち出し理由であるとき、該データと該持ち出し理由とを前記ログ情報に付加することを特徴とする。 According to a fourth aspect of the present invention, in the data management system according to the second or third aspect , when the data to be written is confidential information, the personal computer requests input of the reason for taking out the data, and the reason for taking out is specified. The data and the reason for taking out are added to the log information.

請求項５の発明は、データの書き込みができる記憶媒体を利用可能であると共に組織体の各グループで使用されるパーソナルコンピュータと、該パーソナルコンピュータが取り扱うデータの中の機密情報を管理する管理サーバと、機密情報の管理に必要とするデータとして、該グループ毎に重要度が付加されたグループ別重要度データを記憶するデータベースサーバとがデータ通信可能に接続されているシステムのデータ管理方法であって、前記管理サーバにより、所定の解析方法によって、収集した各文書に共通して現れる単語を低く重み付けを行い、特定の文書に現れる単語を高く重み付けを行い、該解析方法によって重み付けをした単語をキーワードとすると共に重みの大きさに応じて重要度を表す値を該キーワードに付加し、該キーワードと、該キーワードを使用するグループとを関連付けて機密データとして前記データベースサーバに記憶し、前記パーソナルコンピュータにより、前記記憶媒体にデータを書き込むときに、該データから単語を抽出し、該単語と前記データベースサーバの機密データのキーワードとを対比して、該データが機密情報かどうかを判断し、該データが機密情報である場合に、該データに含まれるキーワードを使用するグループがあると、該キーワードをグループ別に分け、グループ毎に分けられた各キーワードの重要度を集計し、集計して得たグループ別の重要度と前記グループ別重要度データとを比較し、グループの重要度に比べて集計した重要度が大きい場合に、該グループの機密情報に該データが該当すると判断する、ことを特徴とするデータ管理方法である。 According to a fifth aspect of the present invention, there is provided a personal computer that can use a storage medium on which data can be written and is used in each group of an organization, and a management server that manages confidential information in data handled by the personal computer. A data management method for a system in which, as data necessary for management of confidential information, a database server that stores importance data for each group to which importance is added for each group is connected so that data communication is possible. The management server assigns a low weight to words commonly appearing in each collected document by a predetermined analysis method, assigns a high weight to words appearing in a specific document, and assigns words weighted by the analysis method to keywords. And a value indicating importance according to the size of the weight is added to the keyword, and the key Is stored in the database server as confidential data in association with a group that uses the keyword, and when the data is written to the storage medium by the personal computer, the word is extracted from the data, In contrast to the keyword of the confidential data of the database server, it is determined whether or not the data is confidential information. When the data is confidential information, if there is a group that uses the keyword included in the data, The keywords are divided into groups, the importance of each keyword divided for each group is aggregated, and the importance for each group obtained by aggregation is compared with the importance data for each group, and compared with the importance of the group. If aggregation degree of importance is high, it is determined that the data corresponds to the confidential information of the group, to characterized in that It is a data management method.

請求項１および請求項５の発明によれば、データを記録媒体に書き込む時点で、書き込みデータが機密情報であるかどうかを判断するので、従来のように、データが更新されたときに、データが機密情報から外れて管理されないといった、データの更新の影響を受けることを防ぐことができる。しかも、組織の各グループに応じて、持ち出しファイルを管理することができる。 According to the first and fifth aspects of the present invention, since it is determined whether or not the written data is confidential information when the data is written to the recording medium, the data is updated when the data is updated as in the prior art. Can be prevented from being affected by data updates, such as being unmanaged from confidential information. In addition, files taken out can be managed according to each group of the organization.

請求項２の発明によれば、記憶媒体に書き込むデータが機密情報であると、ログ情報をデータベースサーバに記憶するので、データが漏洩した場合、漏洩したデータの関連情報とログ情報とを照合することにより、情報の流出元を特定することを可能にする。 According to the invention of claim 2, when the data written to the storage medium is confidential information, the log information is stored in the database server. Therefore, when the data is leaked, the related information of the leaked data and the log information are collated. This makes it possible to identify the source of information leakage.

請求項３の発明によれば、ログ情報はデータを記憶媒体に書き込む実行ユーザ、データに付けられている名称、書き込みの日時からなるので、ログ情報のデータ量を少なくして、記憶手段の負担を軽くすることができる。 According to the invention of claim 3, since the log information includes the execution user who writes the data to the storage medium, the name attached to the data, and the date and time of writing, the data amount of the log information is reduced and the load on the storage means is reduced. Can be lightened.

請求項４の発明によれば、記憶媒体に書き込むデータの持ち出し理由が所定の持ち出し理由であるとき、このデータと持ち出し理由とをログ情報に付加して記録するので、持ち出し理由毎にデータを管理することを可能にする。また、データが組織体から漏洩した場合、漏洩した内容とログ情報のデータとを照合することにより、情報の流出元を特定することを可能にする。 According to the invention of claim 4 , when the reason for taking out the data to be written into the storage medium is a predetermined reason for taking out, the data and the reason for taking out are added to the log information and recorded, so that data is managed for each reason for taking out. Make it possible to do. In addition, when data leaks from an organization, it is possible to identify the leaked source of information by collating the leaked content with log information data.

次に、この発明の実施の形態について、図面を用いて詳しく説明する。企業内の重要情報には機密情報や業務情報などがあるが、以下の実施の形態では、企業内で扱われる機密情報を例として説明する。つまり、以下の実施の形態では、機密情報をデータ化したものを管理対象のデータとする。 Next, embodiments of the present invention will be described in detail with reference to the drawings. Important information in a company includes confidential information and business information. In the following embodiment, confidential information handled in a company will be described as an example. In other words, in the following embodiment, confidential information is converted into data as management target data.

（実施の形態１）
この実施の形態によるデータ管理システムについて、図１〜図３を用いて説明する。このデータ管理システムは、図１に示すように、企業に設けられている営業部門の各パーソナルコンピュータ（ＰＣ）１１、…、総務部門の各パーソナルコンピュータ（ＰＣ）１２が取り扱うデータの中で、データ化された機密情報を管理するものであり、各パーソナルコンピュータ１１および各パーソナルコンピュータ１２に加えて、各種データを管理する管理サーバ１と、データを記憶するデータベースサーバ２と、管理担当者によって操作される管理用パーソナルコンピュータ（ＰＣ）３とを備えている。管理サーバ１、データベースサーバ２、管理用パーソナルコンピュータ３、各パーソナルコンピュータ１１および各パーソナルコンピュータ１２は、ローカルエリアネットワークＮＷによってデータ通信可能に接続されている。 (Embodiment 1)
A data management system according to this embodiment will be described with reference to FIGS. As shown in FIG. 1, this data management system includes data among the data handled by each personal computer (PC) 11 in the sales department provided in the company, and each personal computer (PC) 12 in the general affairs department. Is managed by a person in charge of management, a management server 1 for managing various data, a database server 2 for storing data, in addition to each personal computer 11 and each personal computer 12. A management personal computer (PC) 3. The management server 1, the database server 2, the management personal computer 3, each personal computer 11, and each personal computer 12 are connected by a local area network NW so that data communication is possible.

管理サーバ１は、企業内で扱われる各種のデータをデータベースサーバ２に記憶して管理する。管理サーバ１は、パーソナルコンピュータ１１やパーソナルコンピュータ１２から、データのログ情報を受信すると、受信したログ情報をデータベースサーバ２に記憶する。 The management server 1 stores various data handled in the company in the database server 2 for management. When the management server 1 receives data log information from the personal computer 11 or the personal computer 12, the management server 1 stores the received log information in the database server 2.

また、管理サーバ１がデータベースサーバ２に記録するデータとして、社員データがある。社員データは各社員の所属などを記録したものであり、この社員データを図４に示す。この社員データには、企業内で社員を識別するための社員番号と、社員の氏名が記録されている。また、社員データには、企業内のコンピュータなどを使用する際の識別情報であるユーザＩＤ（ＩＤｅｎｔｉｆｉｃａｔｉｏｎ）と、パスワードが記録されている。さらに、社員データには、社員の企業内での所属（グループ）などが記録されている。こうした社員データは、データベースサーバ２の社員ＤＢ（データベース）に記録されている。 In addition, employee data is data that the management server 1 records in the database server 2. The employee data records the affiliation of each employee, and this employee data is shown in FIG. In the employee data, an employee number for identifying the employee in the company and the name of the employee are recorded. In the employee data, a user ID (IDentification), which is identification information when using a computer in a company, and a password are recorded. Furthermore, the employee data records employee affiliations (groups) within the company. Such employee data is recorded in an employee DB (database) of the database server 2.

管理サーバ１がデータベースサーバ２に記録するデータとして、機密データがある。機密データは文書の中で機密情報を特徴付ける単語をデータ化したものであり、管理サーバ１は機密データを次のようにして作成する。管理サーバ１は、パーソナルコンピュータ１１、１２で取り扱われる文書の中で、機密情報を含む文書を収集する。例えば、既に機密扱いなどにされている文書、図面などを含む文書、研究部門などのような特定のグループが取り扱う文書などが収集対象の文書となる。 There is confidential data as data that the management server 1 records in the database server 2. The confidential data is a word that characterizes confidential information in a document, and the management server 1 creates the confidential data as follows. The management server 1 collects documents including confidential information among documents handled by the personal computers 11 and 12. For example, documents that are already classified, such as documents that include drawings, documents that are handled by a specific group such as a research department, and the like are documents to be collected.

管理サーバ１は、機密情報を含む文書を収集すると、ＴＦ（ＴｅｒｍＦｒｅｑｕｅｎｃｙ）・ＩＤＦ（ＩｎｖｅｒｓｅＤｏｃｕｍｅｎｔＦｒｅｑｕｅｎｃｙ）法などのような解析方法を用いて、各文書に含まれる単語に対して重み付けを行う。ＴＦ・ＩＤＦ法は、収集した文書において、高い頻度で出てくる単語の重要度を下げ、低い頻度で出てくる単語の重要度を上げて、単語に対する重み付けを行う。これにより、収集した各文書に現れる、共通の単語の重要度は低く重み付けされ、特定の文書に現れる単語は高く重み付けされる。 When collecting documents including confidential information, the management server 1 weights words included in each document using an analysis method such as a TF (Term Frequency) / IDF (Inverse Document Frequency) method. The TF / IDF method weights words by reducing the importance of words appearing frequently and increasing the importance of words appearing less frequently in the collected documents. Thereby, the importance of common words appearing in each collected document is weighted low, and the words appearing in a specific document are weighted high.

管理サーバ１は、各文書中の単語に対して重み付けを行ってキーワードとし、各キーワードを重みの大きさによって分類する。そして、管理サーバ１は、分類したキーワードに対して重要度を表す値を付け、機密データとしてデータベースサーバ２の機密情報ＤＢ（データベース）に記憶する。データベースサーバ２に記憶される機密データを図５に示す。この機密データは、ＴＦ・ＩＤＦ法によって抽出されて重みが付けられたキーワードに対して、「１」〜「５」の数値が重要度として付与されている。さらに、機密データには、キーワードが所属するグループが記録されている。つまり、各グループには特有のキーワードがある。 The management server 1 weights the words in each document to obtain keywords, and classifies each keyword according to the magnitude of the weight. Then, the management server 1 attaches a value indicating the importance to the classified keyword and stores it as confidential data in the confidential information DB (database) of the database server 2. The confidential data stored in the database server 2 is shown in FIG. In the confidential data, numerical values “1” to “5” are assigned as importance levels to keywords that are extracted by the TF / IDF method and weighted. Further, the group to which the keyword belongs is recorded in the confidential data. That is, each group has a unique keyword.

なお、管理担当者が管理用パーソナルコンピュータ３を操作して、機密データのキーワードを編集することも可能である。つまり、機密データに対してキーワードの追加や重要度の変更等も行える。 It is also possible for the manager in charge to edit the keyword of the confidential data by operating the management personal computer 3. In other words, keywords can be added to the confidential data and the importance can be changed.

管理サーバ１がデータベースサーバ２に記録するデータとして、グループ別重要度データがある。グループ別重要度データは企業内の各グループの重要度を表すものであり、このグループ別重要度データを図６に示す。このグループ別重要度データには、企業内の各部門をそれぞれのグループとし、各グループの重要度を数値で表している。こうしたグループ別重要度データは、データベースサーバ２のグループ別サーバ送信判定ＤＢ（データベース）に記録されている。 As data recorded in the database server 2 by the management server 1, there is importance data for each group. The importance data for each group represents the importance of each group in the company, and this importance data for each group is shown in FIG. In the importance data for each group, each department in the company is defined as each group, and the importance of each group is expressed numerically. Such group importance data is recorded in the group server transmission determination DB (database) of the database server 2.

営業部門で使用されるパーソナルコンピュータ（ＰＣ）１１や、総務部門で使用されるパーソナルコンピュータ（ＰＣ）１２は基本的に同様であるので、パーソナルコンピュータ１１を例として説明する。パーソナルコンピュータ１１は、図２に示すように、ＵＳＢメモリ２０が利用可能なコンピュータであり、図３に示すように、データなどを伝送するバス１１Ａに接続されている処理部１１Ｂ、記憶部１１Ｃ、表示部１１Ｄ、入力部１１Ｅ、ＵＳＢインターフェース（Ｉ／Ｆ）１１Ｆ、読み取り部１１Ｇおよび通信部１１Ｈを備えている。 Since the personal computer (PC) 11 used in the sales department and the personal computer (PC) 12 used in the general affairs department are basically the same, the personal computer 11 will be described as an example. As shown in FIG. 2, the personal computer 11 is a computer that can use the USB memory 20, and as shown in FIG. 3, a processing unit 11B, a storage unit 11C, and the like connected to a bus 11A for transmitting data and the like. A display unit 11D, an input unit 11E, a USB interface (I / F) 11F, a reading unit 11G, and a communication unit 11H are provided.

入力部１１Ｅはキーボードやマウスなどの入力装置であり、入力部１１Ｅにはデータや各種の指示等が入力される。表示部１１Ｄは、ＬＣＤ（液晶ディスプレイ）などの表示装置であり、処理部１１Ｂの制御によって、入力部１１Ｅに入力されたデータ等を表示する。ＵＳＢインターフェース１１Ｆは、ＵＳＢメモリ２０を接続するためのインターフェースであり、処理部１１Ｂの制御によって、ＵＳＢメモリ２０に対してデータの読み書きを行う。読み取り部１１Ｇは、手書きの文字や印刷された文字を光学的に読み取って画像化する装置である。通信部１１Ｈは、処理部１１Ｂの制御によって、ローカルエリアネットワークＮＷを用いて、データの送受信を管理サーバ１などと行う。記憶部１１Ｃは、パーソナルコンピュータ１１のための各種のプログラムを記憶している。また、入力部１１Ｅに入力されたデータを、処理部１１Ｂの制御によって一時的に記憶する。 The input unit 11E is an input device such as a keyboard and a mouse, and data, various instructions, and the like are input to the input unit 11E. The display unit 11D is a display device such as an LCD (Liquid Crystal Display), and displays data and the like input to the input unit 11E under the control of the processing unit 11B. The USB interface 11F is an interface for connecting the USB memory 20, and reads / writes data from / to the USB memory 20 under the control of the processing unit 11B. The reading unit 11G is an apparatus that optically reads a handwritten character or a printed character to form an image. The communication unit 11H performs data transmission / reception with the management server 1 and the like using the local area network NW under the control of the processing unit 11B. The storage unit 11C stores various programs for the personal computer 11. Further, the data input to the input unit 11E is temporarily stored under the control of the processing unit 11B.

処理部１１Ｂは、パーソナルコンピュータ１１に関する各種の処理を行う。例えば、パーソナルコンピュータ１１が起動された後、処理部１１Ｂは、パーソナルコンピュータ１１を使用状態にするために、入力部１１Ｅに入力されたユーザＩＤとパスワードを記憶部１１Ｃに記憶する。 The processing unit 11 </ b> B performs various processes related to the personal computer 11. For example, after the personal computer 11 is activated, the processing unit 11B stores the user ID and password input to the input unit 11E in the storage unit 11C in order to put the personal computer 11 into a use state.

また、処理部１１Ｂは、記憶部１１Ｃが記憶している各種のプログラムを実行する。処理部１１Ｂが実行するプログラムには、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）処理がある。ＯＣＲ処理は、読み取り部１１Ｇなどが読み取った画像から、文字を識別してテキストに変換する処理である。 Further, the processing unit 11B executes various programs stored in the storage unit 11C. The program executed by the processing unit 11B includes OCR (Optical Character Reader) processing. The OCR process is a process of identifying a character from an image read by the reading unit 11G or the like and converting it into text.

処理部１１Ｂが実行するプログラムには、書き込み処理がある。書き込み処理は、ＵＳＢインターフェース１１ＦにＵＳＢメモリ２０が接続されたときに、実行されるプログラムであり、この書き込み処理により、ＵＳＢメモリ２０に対するファイル形式のデータなどの書き込みが可能になる。つまり、書き込み処理は、ＵＳＢメモリ２０等の小型記憶媒体用の書き込み専用のプログラムである。書き込み処理によってＵＳＢメモリ２０に書き込みが可能なデータには、文書ファイルや画像ファイルなどがある。 The program executed by the processing unit 11B includes a writing process. The writing process is a program that is executed when the USB memory 20 is connected to the USB interface 11F. By this writing process, file format data can be written to the USB memory 20. That is, the writing process is a write-only program for a small storage medium such as the USB memory 20. Data that can be written to the USB memory 20 by the writing process includes a document file and an image file.

処理部１１Ｂが実行するプログラムには、データ判定処理がある。データ判定処理は、ＵＳＢメモリ２０のような小型記憶媒体にファイル等のデータを書き込む場合に、機密情報がデータの内容に含まれているかどうかを判定して管理するためのプログラムである。データ判定処理は書き込み処理が起動された後で行われる。このデータ判定処理を図７〜図１１に示す。なお、小型記憶媒体に対する書き込み専用のプログラムにデータ判定処理を付加してもよい。つまり、書き込み処理とデータ判定処理とを一組として、小型記憶媒体に対する書き込み専用のプログラムとしてもよい。 The program executed by the processing unit 11B includes a data determination process. The data determination process is a program for determining and managing whether or not confidential information is included in the data contents when writing data such as a file in a small storage medium such as the USB memory 20. The data determination process is performed after the writing process is activated. This data determination process is shown in FIGS. Note that data determination processing may be added to a program dedicated to writing on a small storage medium. In other words, the writing process and the data determination process may be combined into a single program for writing to a small storage medium.

処理部１１Ｂは、データ判定処理を開始すると、ファイル指定処理を行う（ステップＳ１）。ステップＳ１のファイル指定処理は図８に示す処理であり、処理部１１Ｂは、小型記憶媒体に書き込むファイル（以下、「持ち出しファイル」という）と、持ち出しファイルの書き込み先（以下、「持ち出し先」という）とを指定するための入力画面を表示して（ステップＳ２１）、持ち出しファイルと持ち出し先の入力待ちとなる（ステップＳ２２）。この実施の形態では、持ち出し先がＵＳＢメモリ２０である。ステップＳ２２の後、持ち出しファイルと持ち出し先を指定するための入力が有ると（ステップＳ２３）、処理部１１Ｂは、持ち出しファイルの書き込み指示の入力を待つ（ステップＳ２４）。書き込み指示が有ると（ステップＳ２５）、処理部１１Ｂはファイル指定処理を終了する。なお、ステップＳ２３で持ち出しファイルと持ち出し先を指定するための入力が無い場合、また、ステップＳ２５で書き込み指示が無い場合、処理部１１Ｂは処理をステップＳ２２に戻す。 When the data determination process is started, the processing unit 11B performs a file designation process (step S1). The file designation process in step S1 is the process shown in FIG. 8, and the processing unit 11B has a file to be written to the small storage medium (hereinafter referred to as “take-out file”) and a write-out destination of the take-out file (hereinafter referred to as “take-out destination”). ) Is displayed (step S21), and the input of the takeout file and the takeout destination is awaited (step S22). In this embodiment, the take-out destination is the USB memory 20. After step S22, if there is an input for designating a take-out file and a take-out destination (step S23), the processing unit 11B waits for an input of a take-out file write instruction (step S24). If there is a write instruction (step S25), the processing unit 11B ends the file designation process. If there is no input for designating the take-out file and the take-out destination in step S23, or if there is no write instruction in step S25, the processing unit 11B returns the process to step S22.

ステップＳ１のファイル指定処理が終了すると、処理部１１Ｂは、持ち出しファイルからテキストを抽出して（ステップＳ２）、テキストの有無を判断する（ステップＳ３）。持ち出しファイルにテキストが有ると、処理部１１Ｂは、機密情報を表すキーワードをパターンマッチングで調べる（ステップＳ４）。ステップＳ４のパターンマッチングは、持ち出しファイルのテキストと、機密データ（図５）のキーワードとを比較して、持ち出しファイルに含まれているキーワードを見つけ出す処理である。また、処理部１１Ｂは、ファイルを持ち出す実行ユーザ、実行日時、持ち出しファイルのファイル名をログ情報とする。実行ユーザは、例えば営業部門の担当者がパーソナルコンピュータ１１の操作を開始するときに、パーソナルコンピュータ１１に入力する、社員データ（図４）のユーザＩＤとパスワードからなる情報であり、記憶部１１Ｃが記憶している情報である。 When the file designating process in step S1 ends, the processing unit 11B extracts text from the taken-out file (step S2) and determines whether there is text (step S3). If there is text in the taken-out file, the processing unit 11B checks a keyword representing confidential information by pattern matching (step S4). The pattern matching in step S4 is a process of finding out a keyword included in the take-out file by comparing the text of the take-out file with the keyword of the confidential data (FIG. 5). In addition, the processing unit 11B uses the execution user who takes out the file, the execution date and time, and the file name of the taken-out file as log information. The execution user is information including the user ID and password of employee data (FIG. 4) input to the personal computer 11 when a person in charge of the sales department starts operation of the personal computer 11, for example. It is memorized information.

この後、処理部１１Ｂは、ステップＳ４のパターンマッチングの結果から、持ち出しファイルが機密情報に該当するかどうかを判断する（ステップＳ６）。持ち出しファイルが機密情報であると、処理部１１Ｂは、グループ化処理を行う（ステップＳ７）。ステップＳ７のグループ化処理は図９に示す処理であり、処理部１１Ｂは、機密情報の重要度をグループ別に集計する（ステップＳ４１）。例えば図１２に示すように、テキストが「交通費」、「見積金額」といったキーワードを含む場合、処理部１１Ｂは、機密データ（図５）を用いて、グループ別にキーワードを分ける。この後、処理部１１Ｂは、各キーワードの重要度を集計し、グループ別の重要度を算出する。 Thereafter, the processing unit 11B determines whether the taken-out file corresponds to confidential information from the result of pattern matching in step S4 (step S6). If the taken-out file is confidential information, the processing unit 11B performs a grouping process (step S7). The grouping process in step S7 is the process shown in FIG. 9, and the processing unit 11B counts the importance of the confidential information for each group (step S41). For example, as illustrated in FIG. 12, when the text includes keywords such as “transportation cost” and “estimated amount”, the processing unit 11 </ b> B divides the keywords into groups using confidential data (FIG. 5). Thereafter, the processing unit 11B aggregates the importance of each keyword and calculates the importance for each group.

ステップＳ４１が終了すると、処理部１１Ｂは、集計によって得たグループ別の重要度と、グループ別重要度データ（図６）とを比較して（ステップＳ４２）、集計によって得たグループ別の重要度が大きいかどうかを判断する（ステップＳ４３）。集計によって得たグループ別の重要度がグループ別重要度データ（図６）に比べて大きい場合、持ち出しファイルが機密情報であり、かつ、特定グループの機密情報に該当するとして、処理部１１Ｂは、持ち出しファイルの中身であるファイル本体と、検出した機密情報を表すキーワードとをログ情報に付加して、ステップＳ５のログ情報を変更し（ステップＳ４４）、グループ化処理を終了する。なお、ステップＳ４３で、集計によって得たグループ別の重要度がグループ別重要度データ（図６）に比べて小さい場合、処理部１１ＢはステップＳ５のログ情報を変更しないで、グループ化処理を終了する。 When step S41 is completed, the processing unit 11B compares the importance for each group obtained by the aggregation and the importance data for each group (FIG. 6) (step S42), and the importance for each group obtained by the aggregation. Whether or not is large is determined (step S43). When the importance by group obtained by the aggregation is larger than the importance data by group (FIG. 6), the processing unit 11B assumes that the taken-out file is confidential information and corresponds to the confidential information of a specific group. The file body, which is the content of the take-out file, and a keyword representing the detected confidential information are added to the log information, the log information in step S5 is changed (step S44), and the grouping process ends. In step S43, if the importance by group obtained by the aggregation is smaller than the importance data by group (FIG. 6), the processing unit 11B ends the grouping process without changing the log information in step S5. To do.

ステップＳ７のグループ化処理が終了すると、処理部１１Ｂは、持ち出し理由指定処理を行う（ステップＳ８）。ステップＳ８の持ち出し理由指定処理は図１０に示す処理であり、処理部１１Ｂは、表示部１１Ｄを制御して、持ち出し理由選択画面を表示する（ステップＳ６１）。この持ち出し理由選択画面には、各持ち出し理由が表示される。持ち出し理由としては、例えば「委託先へ提供」や「社外に持ち出す」などがある。ステップＳ６１の後、処理部１１Ｂは、持ち出し理由の選択入力を待つ（ステップＳ６２）。選択入力が無ければ（ステップＳ６３）、処理部１１Ｂは処理をステップＳ６２に戻す。 When the grouping process in step S7 ends, the processing unit 11B performs a take-out reason designation process (step S8). The take-out reason designation process in step S8 is the process shown in FIG. 10, and the processing unit 11B controls the display unit 11D to display a take-out reason selection screen (step S61). Each take-out reason is displayed on the take-out reason selection screen. The reason for taking out is, for example, “providing to a contractor” or “taking outside”. After step S61, the processing unit 11B waits for selection input of the take-out reason (step S62). If there is no selection input (step S63), the processing unit 11B returns the process to step S62.

ステップＳ６３で持ち出し理由の選択入力が有ると、処理部１１Ｂは、選択された持ち出し理由の中で、あらかじめ指定されている特定の持ち出し理由（以下、「特定理由」という）が有るかどうかを判断する（ステップＳ６４）。選択された持ち出し理由に特定理由が有ると、処理部１１Ｂは、選択された持ち出し理由と、持ち出しファイルの中身であるファイル本体と、検出した機密情報を表すキーワードとをログ情報に付加して、ログ情報を変更し（ステップＳ６５）、持ち出し理由指定処理を終了する。もし、ステップＳ６４で、選択された持ち出し理由に特定理由が無いと、処理部１１Ｂは、ログ情報を変更しないでグループ化処理を終了する。また、ログ情報がファイル本体と、検出した機密情報を表すキーワードとを既に含むとき、処理部１１Ｂは、このログ情報に対して、選択された持ち出し理由だけを付加してログ情報を変更する。 If there is a selection input for the reason for take-out in step S63, the processing unit 11B determines whether there is a specific take-out reason (hereinafter referred to as “specific reason”) designated in advance among the selected reasons for take-out. (Step S64). If there is a specific reason for the selected take-out reason, the processing unit 11B adds the selected take-out reason, the file body that is the content of the take-out file, and a keyword representing the detected confidential information to the log information, The log information is changed (step S65), and the take-out reason designation process is terminated. If there is no specific reason for the selected take-out reason in step S64, the processing unit 11B ends the grouping process without changing the log information. Further, when the log information already includes a file body and a keyword representing the detected confidential information, the processing unit 11B changes the log information by adding only the selected take-out reason to the log information.

ステップＳ８が終了すると、処理部１１Ｂは、通信部１１Ｈを制御して、ログ情報を管理サーバ１に送信する（ステップＳ９）。この後、処理部１１Ｂは、ＵＳＢインターフェース１１Ｆに接続されているＵＳＢメモリ２０、つまり小型記憶媒体に持ち出しファイルを書き込み（ステップＳ１０）、データ判定処理を終了する。 When step S8 ends, the processing unit 11B controls the communication unit 11H to transmit log information to the management server 1 (step S9). Thereafter, the processing unit 11B writes the taken-out file in the USB memory 20 connected to the USB interface 11F, that is, the small storage medium (step S10), and ends the data determination process.

一方、ステップＳ３で、持ち出しファイルにテキストが無いと、処理部１１Ｂは、持ち出しファイルに対してＯＣＲ処理を行う（ステップＳ１１）。つまり、処理部１１Ｂは、持ち出しファイルから文字を識別して文書に変換する処理を行う。この後、処理部１１Ｂは、テキストを抽出したかどうかを判断する（ステップＳ１２）。テキストを抽出すると、処理部１１Ｂは、ステップＳ４以降の処理を行う。 On the other hand, if there is no text in the taken-out file in step S3, the processing unit 11B performs OCR processing on the taken-out file (step S11). That is, the processing unit 11B performs processing for identifying characters from the taken-out file and converting them into a document. Thereafter, the processing unit 11B determines whether or not text has been extracted (step S12). When the text is extracted, the processing unit 11B performs the processing after step S4.

また、テキストを抽出しないと、処理部１１Ｂは、ファイル数判別処理を行う（ステップＳ１３）。ステップＳ１３の持ち出しファイル数判別処理は図１１に示す処理であり、処理部１１Ｂは、ステップＳ１で指定された持ち出しファイルの数を調べ（ステップＳ８１）、持ち出しファイルが複数かどうかを判断する（ステップＳ８２）。持ち出しファイルが複数有ると、処理部１１Ｂは、表示部１１Ｄを制御して、持ち出し不可のエラーメッセージを表示し（ステップＳ８３）、ファイル数判別処理を終了する。ステップＳ８２で、持ち出しファイルが１つであると、処理部１１Ｂは、ファイルを持ち出す実行ユーザ、実行日時、持ち出しファイルのファイル名をログ情報とする（ステップＳ８４）。この後、処理部１１Ｂは、通信部１１Ｈを制御して、ログ情報を管理サーバ１に送信し（ステップＳ８５）、ＵＳＢインターフェース１１Ｆに接続されているＵＳＢメモリ２０に持ち出しファイルを書き込み（ステップＳ８６）、ファイル数判別処理を終了する。 If no text is extracted, the processing unit 11B performs a file number determination process (step S13). The taken-out file number determination process in step S13 is the process shown in FIG. 11, and the processing unit 11B checks the number of taken-out files specified in step S1 (step S81) and determines whether there are a plurality of taken-out files (step S81). S82). When there are a plurality of files to be taken out, the processing unit 11B controls the display unit 11D to display an error message indicating that the file cannot be taken out (step S83), and ends the file number determination process. If there is only one file to be taken out in step S82, the processing unit 11B uses the execution user who takes out the file, the execution date and time, and the file name of the taken-out file as log information (step S84). Thereafter, the processing unit 11B controls the communication unit 11H to transmit log information to the management server 1 (step S85), and writes a take-out file to the USB memory 20 connected to the USB interface 11F (step S86). Then, the file number determination process is terminated.

このようにして、処理部１１Ｂは、図７〜図１１に示すデータ判定処理を行う。そして、データ判定処理が終わると、続いて処理部１１Ｂは書き込み処理を終了する。 In this way, the processing unit 11B performs the data determination process shown in FIGS. When the data determination process ends, the processing unit 11B then ends the write process.

次に、この実施の形態のデータ管理システムを用いたデータ管理方法について説明する。管理サーバ１は、文書の中で機密情報を特徴付ける単語をデータ化して機密データ（図５）を作成し、このデータをデータベースサーバ２に記憶する。このとき、管理サーバ１は、パーソナルコンピュータ１１、１２で取り扱われる文書の中で、機密情報を含む文書を収集して、機密データ（図５）を作成する。また、管理担当者が管理用パーソナルコンピュータ３を操作して、機密データ（図５）を作成してもよい。さらに、データベースサーバ２が記憶している機密データ（図５）は、管理用パーソナルコンピュータ３からの修正が可能である。 Next, a data management method using the data management system of this embodiment will be described. The management server 1 creates confidential data (FIG. 5) by converting words characterizing confidential information in the document into data, and stores this data in the database server 2. At this time, the management server 1 collects documents including confidential information among documents handled by the personal computers 11 and 12, and creates confidential data (FIG. 5). Further, the person in charge of management may operate the management personal computer 3 to create confidential data (FIG. 5). Furthermore, the confidential data (FIG. 5) stored in the database server 2 can be corrected from the management personal computer 3.

こうした状態の場合に、例えば営業の担当者がパーソナルコンピュータ１１を使用するとき、パーソナルコンピュータ１１を起動し、社員データ（図４）に登録されているユーザＩＤとパスワードをパーソナルコンピュータ１１に入力して、パーソナルコンピュータ１１を使用状態にする。このとき、パーソナルコンピュータ１１は、入力されたユーザＩＤとパスワードを記憶する。こうして、担当者は、パーソナルコンピュータ１１を使用する。この後、ＵＳＢメモリ２０を使用するために、担当者がＵＳＢメモリ２０をパーソナルコンピュータ１１に差し込む。さらに、パーソナルコンピュータ１１に記憶されているファイルをＵＳＢメモリ２０に書き込もうとすると、書き込み専用のプログラムが実行されて、書き込み処理が行われ、続いて、データ判定処理が行われる。 In such a state, for example, when a sales representative uses the personal computer 11, the personal computer 11 is activated and the user ID and password registered in the employee data (FIG. 4) are input to the personal computer 11. Then, the personal computer 11 is put into use. At this time, the personal computer 11 stores the input user ID and password. Thus, the person in charge uses the personal computer 11. Thereafter, the person in charge inserts the USB memory 20 into the personal computer 11 in order to use the USB memory 20. Furthermore, when a file stored in the personal computer 11 is to be written to the USB memory 20, a write-only program is executed, a writing process is performed, and then a data determination process is performed.

この後、担当者により、ファイルが指定されて、書き込みの指示がパーソナルコンピュータ１１に入力された場合に、持ち出しファイルにテキストがあるとき、パーソナルコンピュータ１１は、持ち出しファイルの内容が機密情報であるかどうかに応じて、また、持ち出しファイルの重要度および持ち出し理由に応じて、ＵＳＢメモリ２０にファイルを書き込む際に、次のログ情報をデータベースサーバ２に送信する。
ａ．持ち出しファイルが機密情報でない場合、実行ユーザ、実行日時、ファイル名からなるログ情報を送信する。
ｂ．持ち出しファイルが機密情報である場合、あらかじめ設定された各グループの重要度に比べて集計された重要度が低いとき、実行ユーザ、実行日時、ファイル名からなるログ情報を送信する。
ｃ．持ち出しファイルが機密情報である場合、あらかじめ設定された各グループの重要度に比べて集計された重要度が高いとき、実行ユーザ、実行日時、ファイル名、ファイル本体、検出したキーワードからなるログ情報を送信する。
ｅ．持ち出しファイルが機密情報である場合、持ち出し理由に特定理由が含まれているとき、実行ユーザ、実行日時、ファイル名、ファイル本体、持ち出し理由からなるログ情報を送信する。
ｆ．持ち出しファイルが機密情報である場合、あらかじめ設定された各グループの重要度に比べて集計された重要度が高く、しかも、持ち出し理由に特定理由が含まれているとき、実行ユーザ、実行日時、ファイル名、ファイル本体、検出したキーワード、持ち出し理由からなるログ情報を送信する。 Thereafter, when a file is designated by the person in charge and a write instruction is input to the personal computer 11, if there is text in the take-out file, the personal computer 11 determines whether the content of the take-out file is confidential information. Depending on whether or not the file is written in the USB memory 20 according to the importance of the file to be taken out and the reason for taking it out, the next log information is transmitted to the database server 2.
a. If the taken-out file is not confidential information, log information including the execution user, execution date and time, and file name is transmitted.
b. When the taken-out file is confidential information, log information including an execution user, an execution date and time, and a file name is transmitted when the importance calculated is lower than the importance of each group set in advance.
c. When the taken-out file is confidential information, log information consisting of the execution user, execution date / time, file name, file body, and detected keyword is displayed when the importance is higher than the pre-set importance of each group. Send.
e. When the taken-out file is confidential information, when a specific reason is included in the taken-out reason, log information including the execution user, the execution date and time, the file name, the file body, and the reason for taking out is transmitted.
f. If the file to be taken out is confidential information, the importance calculated is higher than the importance of each group set in advance, and if the reason for taking out includes a specific reason, the execution user, execution date and time, file Send log information consisting of name, file body, detected keyword, and reason for taking out.

管理サーバ１は、こうしたログ情報を受信すると、このログ情報をデータベースサーバ２に記録して、持ち出しファイルの情報を管理する。もし、機密情報の流出が発生すると、管理担当者は、管理用パーソナルコンピュータ３を操作して、データベースサーバ２に記録したログ情報や社員データ（図４）を参照することにより、また、ファイル本体や持ち出し理由などにより、情報の流出元などを探すことが可能である。 When the management server 1 receives such log information, the management server 1 records the log information in the database server 2 and manages information on the taken-out file. If confidential information is leaked, the person in charge of management operates the management personal computer 3 to refer to log information and employee data (FIG. 4) recorded in the database server 2, and also to the file body. It is possible to find out the source of information leaks depending on the reason for taking it out.

こうして、この実施の形態によれば、ＵＳＢメモリ２０等の小型記憶媒体にデータを書き込むときに、機密かどうかを判断するので、従来のようにデータの更新の影響を受けることがない。また、この実施の形態によれば、データを小型記憶媒体に書き込む際に、このデータが機密情報であるかどうかを判断するので、従来のように、ファイルの更新時に機密フラグの更新等を行うことを不要にすることができる。さらに、持ち出しファイルが機密情報である場合、重要度や持ち出し理由に応じて、ファイル本体をログ情報としてデータベースサーバ２に記録するかどうかを判断するので、データベースサーバ２に保存するログ情報のデータ量を小さくすることができ、データベースサーバ２の負担を軽くすることが可能である。しかも、グループに応じて、また、持ち出し理由に応じて、持ち出しファイルを管理することができる。例えば、漏洩した内容とファイル本体とを照合することにより、情報の流出元を特定することを可能にする。 Thus, according to this embodiment, when data is written to a small storage medium such as the USB memory 20, it is determined whether or not it is confidential, so that it is not affected by data update as in the prior art. Also, according to this embodiment, when data is written to a small storage medium, it is determined whether or not the data is confidential information, so that the confidential flag is updated when the file is updated as in the prior art. Can be made unnecessary. Further, when the taken-out file is confidential information, it is determined whether or not the file body is recorded in the database server 2 as log information according to the importance or the reason for taking out, so the data amount of the log information stored in the database server 2 And the burden on the database server 2 can be reduced. In addition, it is possible to manage take-out files according to the group and according to the reason for take-out. For example, it is possible to identify the information leakage source by collating the leaked content with the file body.

（実施の形態２）
この実施の形態では、実施の形態１のデータ判定処理に代わって、簡略化したものを用いる。なお、この実施の形態では、先の実施の形態と同一もしくは同一と見なされる構成要素には、それと同じ参照符号を付けて、その説明を省略する。この実施の形態では、図１３のデータ判定処理を用いる。つまり、このデータ判定処理は、実施の形態１のデータ判定処理において、ステップＳ７のグループ化処理と、ステップＳ８の持ち出し理由指定処理を省略する。そして、これらの処理の代わりに、処理部１１Ｂは、ステップＳ６の後、持ち出しファイルの中身であるファイル本体と、検出した機密情報を表すキーワードとをログ情報に付加して、ログ情報を変更し（ステップＳ７Ａ）、次のステップＳ９の処理を行う。 (Embodiment 2)
In this embodiment, instead of the data determination process of the first embodiment, a simplified one is used. In this embodiment, components that are the same as or the same as those in the previous embodiment are given the same reference numerals, and descriptions thereof are omitted. In this embodiment, the data determination process of FIG. 13 is used. That is, this data determination process omits the grouping process in step S7 and the take-out reason designation process in step S8 in the data determination process of the first embodiment. Instead of these processes, the processing unit 11B changes the log information after step S6 by adding the file body that is the contents of the taken-out file and a keyword representing the detected confidential information to the log information. (Step S7A), the next step S9 is performed.

この実施の形態によれば、小型記憶媒体に書き込むときに、機密かどうかを判断するので、従来のようにデータの更新の影響を受けることがない。また、この実施の形態によれば、ファイルを小型記憶媒体に書き込む際に、このファイルが機密情報であるかどうかを判断するので、従来のように、ファイルの更新時に機密フラグの更新等を行うことを不要にすることができる。さらに、この実施の形態によれば、ファイル本体をログ情報としてデータベースサーバ２に保存するので、漏洩した内容とファイル本体とを照合することにより、情報の流出元を特定することを可能にする。 According to this embodiment, when writing to a small storage medium, since it is determined whether or not it is confidential, there is no influence of data update as in the conventional case. Also, according to this embodiment, when writing a file to a small storage medium, it is determined whether or not the file is confidential information. Therefore, the confidential flag is updated at the time of updating the file as in the prior art. Can be made unnecessary. Furthermore, according to this embodiment, since the file body is stored in the database server 2 as log information, it is possible to identify the leaked information source by collating the leaked content with the file body.

（実施の形態３）
この実施の形態では、実施の形態１のデータ判定処理を簡略化したものを用いる。なお、この実施の形態では、先の実施の形態と同一もしくは同一と見なされる構成要素には、それと同じ参照符号を付けて、その説明を省略する。この実施の形態では、実施の形態１のデータ判定処理において、ステップＳ７のグループ化処理と、ステップＳ８の持ち出し理由指定処理のどちらか一方を省略する。 (Embodiment 3)
In this embodiment, a simplified version of the data determination process of the first embodiment is used. In this embodiment, components that are the same as or the same as those in the previous embodiment are given the same reference numerals, and descriptions thereof are omitted. In this embodiment, in the data determination process of the first embodiment, either the grouping process in step S7 or the take-out reason specifying process in step S8 is omitted.

こうした実施の形態によれば、重要度に応じて、または、持ち出し理由により、ファイル本体をログ情報としてデータベースサーバ２に記録するかどうかを判断するので、データベースサーバ２に保存するログ情報のデータ量を小さくすることができる。 According to such an embodiment, since it is determined whether to record the file body as log information in the database server 2 according to the importance level or for the reason of taking out, the data amount of the log information stored in the database server 2 Can be reduced.

以上、この発明の各実施の形態を詳述してきたが、具体的な構成は各実施の形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計の変更等があっても、この発明に含まれる。例えば、各実施の形態では、小型記憶媒体としてＵＳＢメモリ２０を例としたが、小型記憶媒体はこれに限定されることなく、例えば、ＵＳＢ接続が可能な大容量記憶装置などであってもよい。 As mentioned above, although each embodiment of this invention has been described in detail, the specific configuration is not limited to each embodiment, and even if there is a design change or the like without departing from the gist of this invention, It is included in this invention. For example, in each embodiment, the USB memory 20 is taken as an example of the small storage medium. However, the small storage medium is not limited to this, and may be, for example, a mass storage device capable of USB connection. .

この発明の実施の形態１によるデータ管理システムを示す構成図である。It is a block diagram which shows the data management system by Embodiment 1 of this invention. パーソナルコンピュータとＵＳＢメモリを示す斜視図である。It is a perspective view which shows a personal computer and a USB memory. 各部門で用いられているパーソナルコンピュータの一例を示す構成図である。It is a block diagram which shows an example of the personal computer used in each division. データベースサーバが記憶する社員データの一例を示す図である。It is a figure which shows an example of the employee data which a database server memorize | stores. データベースサーバが記憶する機密データの一例を示す図である。It is a figure which shows an example of the confidential data which a database server memorize | stores. データベースサーバが記憶するグループ別重要度データの一例を示す図である。It is a figure which shows an example of the importance data classified by group which a database server memorize | stores. データ判定処理を示すフローチャートである。It is a flowchart which shows a data determination process. ファイル指定処理を示すフローチャートである。It is a flowchart which shows a file specification process. グループ化処理を示すフローチャートである。It is a flowchart which shows a grouping process. 持ち出し理由指定処理を示すフローチャートである。It is a flowchart which shows a take-out reason designation | designated process. ファイル数判別処理を示すフローチャートである。It is a flowchart which shows a file number discrimination | determination process. 機密情報のグループ化を説明する図である。It is a figure explaining grouping of confidential information. 実施の形態２によるデータ判定処理を示すフローチャートである。10 is a flowchart illustrating data determination processing according to the second embodiment.

Explanation of symbols

１管理サーバ
２データベースサーバ（記憶手段）
１１、１２パーソナルコンピュータ
２０ＵＳＢメモリ（記憶媒体） 1 management server 2 database server (storage means)
11, 12 Personal computer 20 USB memory (storage medium)

Claims

Necessary for managing confidential information, a personal computer that can use a storage medium that can write data, and that is used in each group of the organization, a management server that manages confidential information in the data handled by the personal computer, and As a data management system, a data base system that stores importance data for each group to which importance is added for each group is connected to the database server so that data communication is possible.
The management server weights words that appear in common in each collected document by a predetermined analysis method, weights words that appear in a specific document high, and uses the words weighted by the analysis method as keywords. And adding a value representing importance in accordance with the size of the weight to the keyword, associating the keyword with a group using the keyword and storing it as confidential data in the database server,
The personal computer, when writing data in the storage medium, extracts words from the data, and comparing the keyword of the sensitive data of the database server and said word, said data to determine whether confidential information When the data is confidential information, if there is a group that uses the keywords included in the data, the keywords are divided into groups, and the importance of each keyword divided for each group is tabulated and tabulated. Comparing the obtained importance by group and the importance data by group, and determining that the data corresponds to the confidential information of the group when the importance calculated is larger than the importance of the group;
A data management system characterized by that.

When the data to be written to the storage medium is confidential information, the personal computer sends log information of the data to the database server ,
The data management system according to claim 1, wherein when the database server receives log information from the personal computer, the database server records the log information.

3. The log information includes an execution user who writes data to the storage medium, a name given to the data, and at least one of a date and time at which the data is written. Data management system.

The personal computer requests input of the reason for taking out the data when the data to be written is confidential information, and adds the data and the reason for taking out to the log information when the reason for taking out is a specific reason for taking out. 4. The data management system according to claim 2 or 3 , wherein:

Necessary for managing confidential information, a personal computer that can use a storage medium that can write data, and that is used in each group of the organization, a management server that manages confidential information in the data handled by the personal computer, and A data management method for a system in which a database server that stores importance data for each group to which importance is added for each group is connected so as to be capable of data communication,
The management server assigns a low weight to words that appear in common in each collected document by a predetermined analysis method, weights a word that appears in a specific document high, and uses the words weighted by the analysis method as keywords. And adding a value representing importance in accordance with the size of the weight to the keyword, associating the keyword with a group using the keyword and storing it as confidential data in the database server,
When writing data to the storage medium by the personal computer, a word is extracted from the data, and the word is compared with a keyword of confidential data of the database server to determine whether the data is confidential information. When the data is confidential information, if there is a group that uses the keywords included in the data, the keywords are divided into groups, and the importance of each keyword divided for each group is tabulated and tabulated. Comparing the obtained importance by group and the importance data by group, and determining that the data corresponds to the confidential information of the group when the importance calculated is larger than the importance of the group;
A data management method characterized by the above.