JP5501896B2

JP5501896B2 - Pattern extraction apparatus, pattern extraction method, and pattern extraction program

Info

Publication number: JP5501896B2
Application number: JP2010180662A
Authority: JP
Inventors: 精一紺谷; 明通田中; 匡内山
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2010-08-12
Filing date: 2010-08-12
Publication date: 2014-05-28
Anticipated expiration: 2030-08-12
Also published as: JP2012038272A

Description

本発明は、教師あり機械学習に係り、特にテキストや画像などのデータをクラス分類するためのパターン抽出方法に関する。 The present invention relates to supervised machine learning, and more particularly to a pattern extraction method for classifying data such as text and images.

従来、パターン抽出方法として、非特許文献１に示すように線形判別分析という手法があった。線形判別分析は次のように、与えられたｎ組のデータｘ_i∈Ｒ^mおよびそのラベルｙ_i∈{１，…，ｃ}からクラス分類が行いやすいパターンｚ_i∈Ｒ^c-1，ｃ＜ｍを抽出する。 Conventionally, as a pattern extraction method, there has been a technique called linear discriminant analysis as shown in Non-Patent Document 1. In the linear discriminant analysis, patterns z _i εR ^c−1 , c that are easy to classify from given n sets of data x _i εR ^m and their labels y _i ε {1,. <M is extracted.

ｗ_iは、クラス間分散とクラス内分散の比Ｊ（ｗ）を最大化するように選ぶ。 w _i is selected to maximize the ratio J (w) of the interclass variance to the intraclass variance.

これは、下記の一般化固有値問題を解くことで求められる。 This can be obtained by solving the following generalized eigenvalue problem.

ｗ_iは、上位ｃ−１個の固有値の固有ベクトルとなる。 w _i is the eigenvector of the upper c-1 eigenvalues.

例えば、図９に示すようなｃｌａｓｓ１，ｃｌａｓｓ２のデータが与えられると、ベクトルｗは図９のｗで示す直線として求められる。この例は、クラス数ｃ＝２で、１（＝ｃ−１）次元の特徴が得られる。抽出されるパターンは、図１０に示すようになり、ｃｌａｓｓ１,ｃｌａｓｓ２がｘ軸上で上手く分離されていることが分かる（なお、図１０は、ｃｌａｓｓ１，ｃｌａｓｓ２の結果が見やすいように、ｙ軸の値をずらしてある。）。 For example, given class 1 and class 2 data as shown in FIG. 9, the vector w is obtained as a straight line indicated by w in FIG. In this example, the number of classes c = 2, and 1 (= c-1) -dimensional features are obtained. The extracted pattern is as shown in FIG. 10, and it can be seen that class 1 and class 2 are well separated on the x-axis (note that FIG. 10 shows that the results of class 1 and class 2 are easy to see. The value has been shifted.)

Ｃ．Ｍ．ビショップ著、（元田浩、栗田多喜男、樋口知之、松本裕治、村田昇共訳）、「パターン認識と機械学習上」、シュプリンガー・ジャパン株式会社、２００７年１２月、ｐｐ．１７７〜１９０。C. M.M. Bishop, Hiroshi Motoda, Takio Kurita, Tomoyuki Higuchi, Yuji Matsumoto, Noboru Murata, “Pattern Recognition and Machine Learning”, Springer Japan, December 2007, pp. 177-190.

しかしながら、上述した線形判別分析では、下記に示す３つの問題点があった。
１．クラス数ｃ−１の次元の特徴しか得られない。
２．各クラスｃのデータはガウス分布を仮定しているため、多峰性のデータにフィットしない。
３．各クラスｃの平均値を分離する特徴しか得られない。 However, the linear discriminant analysis described above has the following three problems.
1. Only dimensional features of class number c-1 can be obtained.
2. Since each class c data assumes a Gaussian distribution, it does not fit multimodal data.
3. Only features that separate the average values of each class c are obtained.

特に、図１１に示すようなｃｌａｓｓ１，ｃｌａｓｓ２のデータが与えられると、ベクトルｗは図１１のｗで示す直線として求められ、抽出されるパターンは、図１２に示すようになる。図１１のｃｌａｓｓ１，ｃｌａｓｓ２のデータは平均値がほぼ等しく、また、ｃｌａｓｓ１のデータが多峰性であることから、ｃｌａｓｓ１，ｃｌａｓｓ２がｘ軸上で分離できていない（なお、図１２は、ｃｌａｓｓ１，ｃｌａｓｓ２の結果が見やすいように、ｙ軸の値をずらしてある。）。 In particular, when data of class 1 and class 2 as shown in FIG. 11 are given, the vector w is obtained as a straight line indicated by w in FIG. 11, and the extracted pattern is as shown in FIG. The average values of the data of class 1 and class 2 in FIG. 11 are almost equal, and the data of class 1 are multimodal, so class 1 and class 2 cannot be separated on the x-axis (note that FIG. (The y-axis value is shifted so that the result of class 2 is easy to see.)

本発明は上記の課題を解決するものであり、クラス数以上の次元の特徴を得ると共に、多峰性のデータでも分類できるパターン抽出装置を提供することが主な課題となる。 The present invention solves the above-mentioned problems, and it is a main object to provide a pattern extraction device that can obtain features of dimensions more than the number of classes and can also classify even multimodal data.

本発明のパターン抽出装置の一態様は、入力されたデータからクラス分類を行うためのパターンを抽出するパターン抽出装置であって、前記入力されたデータのクラス内類似度を計算するクラス内類似度計算手段と、前記入力されたデータのクラス間距離を計算するクラス間距離計算手段と、入力されたデータと抽出されたパターンとの差異である圧縮歪みを計算する圧縮歪み計算手段と、前記データと圧縮表現から、前記圧縮歪みが小さくなる射影行列を求め、射影行列を正規化する射影行列計算手段と、前記データと射影行列から、圧縮歪み小さくなり、かつ、クラス内類似度およびクラス間距離が大きくなる圧縮表現を計算する圧縮表現計算手段と、を備え、前記射影行列と圧縮表現に基づいて、クラス分類を行うためのパターンを抽出することを特徴とする。 One aspect of the pattern extraction apparatus of the present invention is a pattern extraction apparatus that extracts a pattern for class classification from input data, and calculates the intra-class similarity of the input data. A calculation means; an interclass distance calculation means for calculating a distance between classes of the input data; a compression distortion calculation means for calculating a compression distortion that is a difference between the input data and the extracted pattern; and the data And a projection matrix calculation means for obtaining a projection matrix that reduces the compression distortion from the compressed expression and normalizing the projection matrix, and the data and the projection matrix reduce the compression distortion, and the intraclass similarity and the interclass distance And a compressed expression calculation means for calculating a compressed expression that increases, and extracts a pattern for class classification based on the projection matrix and the compressed expression. Characterized in that it.

本発明のパターン抽出方法の一態様は、入力されたデータからクラス分類を行うためのパターンを抽出するパターン抽出方法であって、クラス内類似度計算手段が、前記入力されたデータのクラス内類似度を計算するクラス内類似度計算ステップと、クラス間距離計算手段が、前記入力されたデータのクラス間距離を計算するクラス間距離計算ステップと、圧縮歪み計算手段が、入力されたデータと抽出されたパターンとの差異である圧縮歪みを計算するステップと、射影行列計算手段が、前記データと圧縮表現から、前記圧縮歪みが小さくなる射影行列を求め、射影行列を正規化する射影行列計算ステップと、前記圧縮表現計算手段が、前記データと射影行列から、前記圧縮歪みが小さくなり、かつ、クラス内類似度およびクラス間距離が大きくなる圧縮表現を計算する圧縮表現計算ステップと、を有し、前記射影行列と圧縮表現に基づいて、クラス分類を行うためのパターンを抽出することを特徴とする。 One aspect of the pattern extraction method of the present invention is a pattern extraction method for extracting a pattern for class classification from input data, wherein the intra-class similarity calculation means includes an intra-class similarity of the input data. An intra-class similarity calculation step for calculating a degree, an inter-class distance calculation means for calculating an inter-class distance calculation step for calculating an inter-class distance of the input data, and a compression distortion calculation means for extracting the input data and A step of calculating a compression distortion that is a difference from the pattern formed, and a projection matrix calculation means for obtaining a projection matrix that reduces the compression distortion from the data and the compressed expression, and a projection matrix calculation step of normalizing the projection matrix And the compressed expression calculation means reduces the compression distortion from the data and the projection matrix, and the similarity between classes and the distance between classes are Has a compressed representation calculating step of calculating the listening becomes compressed representation, and based on the compressed representation and the projection matrix, and extracts a pattern for performing the classification.

なお、本発明は、前記パターン抽出装置を構成する各手段としてコンピュータを機能させるためのプログラムとしても構成することができる。このプログラムはネットワークを通じた態様で提供してもよく、記録媒体に格納した状態でもよい。 The present invention can also be configured as a program for causing a computer to function as each means configuring the pattern extraction apparatus. This program may be provided through a network or may be stored in a recording medium.

本発明によれば、クラス数以上の次元の特徴を得ると共に、多峰性のデータでも分類できるパターン抽出装置を提供することが可能となる。 According to the present invention, it is possible to provide a pattern extraction device that can obtain features of dimensions more than the number of classes and can also classify multi-modal data.

実施形態におけるパターン抽出装置のブロック図である。It is a block diagram of the pattern extraction device in an embodiment. 実施形態におけるパターン抽出装置の処理ステップを示すフローチャートである。It is a flowchart which shows the process step of the pattern extraction apparatus in embodiment. 本実施形態のパターン抽出装置のハードウェア構成図である。It is a hardware block diagram of the pattern extraction apparatus of this embodiment. 実施形態の具体例におけるデータを示す図である。It is a figure which shows the data in the specific example of embodiment. 実施形態の具体例（ｔ＝１）におけるデータとＷを示す図である。It is a figure which shows the data and W in the specific example (t = 1) of embodiment. 実施形態の具体例（ｔ＝２）におけるデータとＷを示す図である。It is a figure which shows the data and W in the specific example (t = 2) of embodiment. 実施形態の具体例（ｔ＝３）におけるデータとＷを示す図である。It is a figure which shows the data and W in the specific example (t = 3) of embodiment. 実施形態の具体例におけるデータを示す図である。It is a figure which shows the data in the specific example of embodiment. 従来のパターン抽出方法におけるデータとｗを示す図である。It is a figure which shows the data and w in the conventional pattern extraction method. 従来のパターン抽出方法におけるデータを示す図である。It is a figure which shows the data in the conventional pattern extraction method. 従来のパターン抽出方法におけるデータとｗを示す図である。It is a figure which shows the data and w in the conventional pattern extraction method. 従来のパターン抽出方法におけるデータを示す図である。It is a figure which shows the data in the conventional pattern extraction method.

本発明は、入力されたデータからクラス分類を行うためのパターンを抽出するものであって、パターンを抽出する際に、クラス内のデータは近くに、クラス間のデータは遠くに配置されるようにパターン修正を行うものである。その結果、抽出されたパターンがクラス間で分離しやすくなる。 The present invention extracts a pattern for class classification from input data. When extracting a pattern, data in a class is arranged close to each other, and data between classes is arranged far away. The pattern is corrected. As a result, the extracted pattern can be easily separated between classes.

また、与えられたデータと抽出されたパターンとの差異（圧縮ひずみ）を小さくすることにより、元のデータの情報を可能な限り保持する働きをする。その結果、クラス数以上の特徴を有効に抽出することが可能となる。 In addition, by reducing the difference (compression distortion) between the given data and the extracted pattern, the information of the original data is held as much as possible. As a result, it is possible to effectively extract features exceeding the number of classes.

［実施形態］
≪構成例≫
図１〜図３に基づき本実施形態におけるパターン抽出装置１を説明する。 [Embodiment]
≪Configuration example≫
The pattern extraction apparatus 1 in this embodiment is demonstrated based on FIGS.

前記パターン抽出装置１は、教師あり機械学習に用いられ、例えば、テキストや画像等のデータをクラス分類するものである。 The pattern extraction apparatus 1 is used for supervised machine learning, and classifies data such as text and images, for example.

このパターン抽出装置１は、図３に示すように、コンピュータにより構成され、通常のコンピュータのハードウェアリソース、即ち、ＲＯＭ１１，ＲＡＭ１２，ＣＰＵ１３，通信インターフェイス（Ｉ／Ｆ）１４，ハードディスクドライブ装置１５，記憶媒体読取装置１６，記憶媒体１７などを備えている。 As shown in FIG. 3, the pattern extraction apparatus 1 is constituted by a computer, and includes hardware resources of a normal computer, that is, ROM 11, RAM 12, CPU 13, communication interface (I / F) 14, hard disk drive device 15, storage A medium reader 16 and a storage medium 17 are provided.

このハードウェアリソースとソフトウェアリソースとの協働の結果、前記パターン抽出装置１は、図１に示すように、データ入力手段２と、管理手段３と、クラス内類似度計算手段４と、クラス間距離計算手段５と、圧縮歪み計算手段６と、射影行列計算手段７と、圧縮表現計算手段８と、結果出力手段９と、を備える。 As a result of the cooperation between the hardware resource and the software resource, the pattern extraction apparatus 1 includes a data input unit 2, a management unit 3, an intraclass similarity calculation unit 4, Distance calculation means 5, compression distortion calculation means 6, projection matrix calculation means 7, compression expression calculation means 8, and result output means 9 are provided.

本実施形態におけるパターン抽出装置１は、データ入力手段２からデータＸ（ｘ_i，…，ｘ_n）を入力し、管理手段３において、全データ（ｘ_i，…，ｘ_n）の平均値μを算出し、全てのデータ（ｘ_i，…，ｘ_n）から平均値μを引く。 The pattern extraction apparatus 1 in the present embodiment inputs data X (x _i ,..., X _n ) from the data input means 2, and the management means 3 uses the average value μ of all data (x _i ,..., X _n ). And subtract the average value μ from all data (x _i ,..., X _n ).

そして、前記クラス内類似度計算手段４でクラス内のデータの類似度を算出し、前記クラス間距離計算手段５でクラス間のデータの距離計算を行う。また、圧縮歪み計算手段６は、与えられたデータと抽出されたパターンとの差異（圧縮歪み）を計算する。 Then, the intraclass similarity calculator 4 calculates the similarity of the data in the class, and the interclass distance calculator 5 calculates the distance of the data between classes. The compression distortion calculation means 6 calculates the difference (compression distortion) between the given data and the extracted pattern.

射影行列計算手段７は、データと圧縮表現Ｈから、前記圧縮歪みが小さくなる射影行列Ｗを求める。圧縮表現計算手段８は、データＸと射影行列Ｗから、圧縮歪みが小さくなり、かつ、クラス内類似度およびクラス間距離が大きくなる圧縮表現Ｈを求める。 The projection matrix calculation means 7 obtains a projection matrix W that reduces the compression distortion from the data and the compressed expression H. The compressed representation calculation means 8 obtains a compressed representation H from the data X and the projection matrix W that reduces the compression distortion and increases the intraclass similarity and the interclass distance.

本実施形態のパターン抽出装置１は、データＸ＝（х₁，…，х_n），х_i∈Ｒ^m，ラベルｙ∈｛１，…ｃ｝（ｃはクラス数），抽出する特徴の次元数ｋ（０＜ｋ＜ｍ），クラス内類似度を重視する度合いα≧０，クラス間距離を重視する度合いβ≧０，距離のスケールσ＞０，及び、繰り返し数Ｔ＞０を入力とし、射影行列Ｗ，全データの平均値μ，圧縮表現Ｈを出力する。 The pattern extraction apparatus 1 according to this embodiment includes data X = (х ₁ ,..., Х _n ), х _i ∈R ^m , labels y∈ {1,... C} (c is the number of classes), and the dimension of the feature to be extracted The input is a number k (0 <k <m), a degree α ≧ 0 that emphasizes intra-class similarity, a degree β ≧ 0 that emphasizes distance between classes, a scale of distance σ> 0, and a repetition number T> 0. , A projection matrix W, an average value μ of all data, and a compressed expression H are output.

図２は、本実施形態におけるパターン抽出装置１の処理ステップ（Ｓ１〜Ｓ６）を示すフローチャートである。本発明におけるパターン抽出方法は、以下の手順から成る。
Ｓ１：データＸ，ラベルｙ，抽出する特徴次元数ｋ，クラス内類似度を重視する度合いα，クラス間距離を重視する度合いβ，距離のスケールσ，繰り返し数Ｔを入力する。
Ｓ２：クラス内類似度Ｌ^W，クラス間距離Ｌ^B，圧縮表現Ｈを初期化する。
（ｆｏｒｔ＝１ｔｏｄｏ）
Ｓ３：ｔが繰り返し数Ｔよりも小さいか否かを判定する。
Ｓ４：射影行列Ｗを計算する。
Ｓ５：圧縮表現Ｈを計算する。
（ｅｎｄｆｏｒ）
Ｓ６：射影行列Ｗ，平均値μ，圧縮表現Ｈを出力する。 FIG. 2 is a flowchart showing the processing steps (S1 to S6) of the pattern extraction apparatus 1 in the present embodiment. The pattern extraction method in the present invention includes the following procedures.
S1: Input data X, label y, feature dimension number k to be extracted, degree α emphasizing intraclass similarity, degree β emphasizing distance between classes, distance scale σ, and number of repetitions T.
S2: Intraclass similarity L ^W , interclass distance L ^B , and compressed representation H are initialized.
(For t = 1 to do)
S3: It is determined whether or not t is smaller than the repetition number T.
S4: A projection matrix W is calculated.
S5: The compressed expression H is calculated.
(End for)
S6: A projection matrix W, an average value μ, and a compressed expression H are output.

以下、前記各ステップ（Ｓ１〜Ｓ６）について詳細に説明する。 Hereinafter, each of the steps (S1 to S6) will be described in detail.

Ｓ１：（データの入力）
データ入力手段２は、ネットワーク、または、ファイル等から、データХ（х₁，…，х_n），ラベルｙ∈｛１，…ｃ｝（ｃはクラス数），および、パラメータ（抽出する特徴の次元数ｋ（０＜ｋ＜ｍ），クラス内類似度を重視する度合いα≧０、クラス間距離を重視する度合いβ≧０、距離のスケールσ＞０，及び、繰り返し数Ｔ＞０）を入力する。 S1: (Data input)
The data input means 2 receives data Х (х ₁ ,..., Х _n ), label y∈ {1,... C} (c is the number of classes), and parameters (characteristics to be extracted) from a network or a file. Number of dimensions k (0 <k <m), degree α ≧ 0 that emphasizes intra-class similarity, degree β ≧ 0 that emphasizes distance between classes, distance scale σ> 0, and number of repetitions T> 0) input.

Ｓ２：（初期化）
［管理手段３］
全てのデータ（ｘ₁，…，ｘ_n）の平均値μを下記（９）式により算出する。 S2: (Initialization)
[Management means 3]
An average value μ of all data (x ₁ ,..., X _n ) is calculated by the following equation (9).

下記（１０）式に示すように、全てのデータ（ｘ₁，…，ｘ_n）から平均値μを引く。 As shown in the following equation (10), the average value μ is subtracted from all the data (x ₁ ,..., X _n ).

射影行列Ｗのサイズをｍ×ｋ，圧縮表現Ｈのサイズをｋ×ｎにする。ここで、ｍ，ｎはデータＸの行数，列数、また、ｋは入力時に指定する特徴の次元数を示す。 The size of the projection matrix W is m × k, and the size of the compressed representation H is k × n. Here, m and n indicate the number of rows and columns of the data X, and k indicates the number of dimensions of features specified at the time of input.

クラス内類似度計算手段４と、クラス間距離計算手段５の初期化を呼び出す。 The initialization of the intraclass similarity calculation means 4 and the interclass distance calculation means 5 is called.

［クラス内類似度計算手段４］
クラス内類似度行列Ｌ^Wを下記（１１）式により計算する。なお、下記（１１）式のＤ^Wはクラス内データの度数行列を示し、Ｓ^Wはクラス内内積行列を示す。ここで、下記（１２）式のｓ_ij ^WはＳ^Wのｉｊ要素を示し、下記（１３）式のｄ_ij ^WはＤ^Wのｉｊ要素を示す。 [Intraclass similarity calculation means 4]
The intraclass similarity matrix L ^W is calculated by the following equation (11). In the following equation (11), D ^W represents a frequency matrix of intra-class data, and S ^W represents an intra-class inner product matrix. Here, s _ij ^W below (12) represents the ij element of S ^W, d _ij ^W below (13) shows the ij element of D ^W.

［クラス間距離計算手段５］
クラス間類似度行列Ｌ^Bを下記（１４）式により計算する。なお、下記（１４）式のＤ^Bはクラス間データの度数行列を示し、Ｓ^Bはクラス間内積行列を示す。ここで、下記（１５）式のs_ij ^BはＳ^Bのｉｊ要素を示し、下記（１６）式のｄ_ij ^BはＤ^Bのｉｊ要素を示す。 [Distance calculation means 5]
The interclass similarity matrix L ^B is calculated by the following equation (14). Incidentally, D ^B below (14) shows the frequency matrix classes between data, S ^B represents the inter-class inner product matrix. Here, s _ij ^B below (15) represents the ij element of S ^B, d _ij ^B below (16) shows the ij element of D ^B.

［管理手段３］
圧縮表現Ｈを乱数で初期化する。 [Management means 3]
The compressed representation H is initialized with a random number.

Ｓ３：（ｔ≦Ｔの判定）
［管理手段３］
ｔ≦Ｔか否か判定する。即ち、指定された繰り返し数ＴだけＳ４，Ｓ５の処理を行っていればＳ６へ移行し、そうでなければＳ４へ移行する。 S3: (determination of t ≦ T)
[Management means 3]
It is determined whether t ≦ T. That is, if the processes of S4 and S5 are performed for the designated number of repetitions T, the process proceeds to S6, and otherwise, the process proceeds to S4.

Ｓ４：（射影行列Ｗの計算）
［射影行列計算手段７］
圧縮歪みＪ^C（Ｗ，Ｈ）と、クラス内距離の総和Ｊ^W（Ｈ）と、クラス間距離の総和Ｊ^B（Ｈ）と、から成る下記（１７）式を、射影行列Ｗに関して最適化する。すなわち、下記（１７）式を微分すると下記（１８）式となり、この（１８）式が０となるように、射影行列Ｗで最適化する。 S4: (Calculation of projection matrix W)
[Projection matrix calculation means 7]
The following equation (17) consisting of compression distortion J ^C (W, H), intra-class distance sum J ^W (H), and inter-class distance sum J ^B (H) is optimized for the projection matrix W. To do. That is, when the following equation (17) is differentiated, the following equation (18) is obtained, and the projection matrix W is optimized so that this equation (18) becomes zero.

圧縮歪み計算手段６を呼び出し、前記（１８）式中の∂Ｊ^C／∂Ｗ（データと抽出されたパターンとの差異を小さくする項）を計算させる。 The compression strain calculation means 6 is called to calculate ∂J ^C / ∂W (term for reducing the difference between the data and the extracted pattern) in the equation (18).

［圧縮歪み計算手段６］
射影行列計算に際して、圧縮歪みＪ^C（Ｗ，Ｈ）の下記（１９）式の微分値を求め、データと抽出されたパターンとの差異を小さくする項∂Ｊ^C／∂Ｗを下記（２０）式により計算し、射影行列計算手段７に出力する。 [Compression strain calculation means 6]
In calculating the projection matrix, the differential value of the following equation (19) of the compression distortion J ^C (W, H) is obtained, and the term ∂ J ^C / ∂ W that reduces the difference between the data and the extracted pattern is set as (20) Calculation is performed using an equation and output to the projection matrix calculation means 7.

［射影行列計算手段７］
前記（１８），（２０）式から下記（２１）式が成立する。そして、下記（２１）式の射影行列の転置行列Ｗ^Tについての連立一次方程式をハウスホルダー変換、または、ガウスの消去法等で解いて、射影行列Ｗを求める。 [Projection matrix calculation means 7]
The following equation (21) is established from the equations (18) and (20). Then, the projection matrix W is obtained by solving simultaneous linear equations for the transposed matrix W ^T of the projection matrix of the following formula (21) by Householder transformation or Gaussian elimination.

次に、射影行列Ｗの各列ベクトルを下記（２２）式により、正規化する。 Next, each column vector of the projection matrix W is normalized by the following equation (22).

Ｓ５：（圧縮表現Ｈの計算）
［圧縮表現計算手段８］
圧縮歪みＪ^C（Ｗ，Ｈ）と、クラス内距離の総和Ｊ^W（Ｈ）と、クラス間距離の総和Ｊ^B（Ｈ）と、から成る下記（２３）式を圧縮表現Ｈに関して最適化する。すなわち、下記（２３）式を微分すると下記（２４）式となり、下記（２４）式が０となるように、圧縮表現Ｈに関して最適化する。 S5: (Calculation of compressed expression H)
[Compressed expression calculation means 8]
The following expression (23) consisting of compression distortion J ^C (W, H), intra-class distance sum J ^W (H), and inter-class distance sum J ^B (H) is optimized for compressed expression H. . That is, the following expression (23) is differentiated to obtain the following expression (24), and the compressed expression H is optimized so that the following expression (24) becomes zero.

圧縮歪み計算手段６，クラス内類似度計算手段４，クラス間距離計算手段５を呼び出し、前記（２４）式中の∂Ｊ^C／∂Ｈ（データと抽出されたパターンとの差異を小さくする項），∂Ｊ^W／∂Ｈ（クラス内距離の総和Ｊ^Wを小さくする項，すなわち，クラス内データの類似度を大きくする項），∂Ｊ^B／∂Ｈ（クラス間データの距離を大きくする項）を計算させる。 The compression strain calculation means 6, the intraclass similarity calculation means 4, and the interclass distance calculation means 5 are called, and ∂J ^C / ∂H (the term for reducing the difference between the data and the extracted pattern) in the equation (24). ), ∂J ^W / .differential.H (claim to reduce the sum J ^W class within the distance, i.e., terms of increasing the degree of similarity of the class in the data), to increase the ∂J ^B / ∂H (distance interclass data )).

［圧縮歪み計算手段６］
圧縮表現Ｈの計算に際して、圧縮歪みＪ^C（Ｗ，Ｈ）の下記（１９）式を微分し、データと抽出されたパターンとの差異を小さくする項∂Ｊ^C／∂Ｈを下記（２５）式により計算し、圧縮表現計算手段８に出力する。 [Compression strain calculation means 6]
In calculating the compressed expression H, the following equation (19) of the compression distortion J ^C (W, H) is differentiated, and the term ∂J ^C / ∂H that reduces the difference between the data and the extracted pattern is expressed as (25) It is calculated by the formula and output to the compressed expression calculation means 8.

［クラス内類似度計算手段４］
圧縮表現Ｈの計算に際して、クラス内距離の総和Ｊ^W（Ｈ）の下記（２６）式を微分し、クラス内のデータの類似度を大きくする項∂Ｊ^W／∂Ｈを下記（２７）式により計算し、圧縮表現計算手段８に出力する。 [Intraclass similarity calculation means 4]
When calculating the compressed expression H, the following expression (26) of the sum J ^W (H) of the distance within the class is differentiated, and the term ∂ J ^W / ∂H that increases the similarity of the data in the class is expressed by the following expression (27) And output to the compressed expression calculation means 8.

［クラス間距離計算手段５］
圧縮表現Ｈの計算に際して、クラス間距離の総和Ｊ^B（Ｈ）の下記（２８）式を微分し、クラス間のデータの距離を大きくする項∂Ｊ^B／∂Ｈを下記（２９）式により計算し、圧縮表現計算手段８に出力する。 [Distance calculation means 5]
In calculating the compressed expression H, the following equation (28) of the sum of distances between classes J ^B (H) is differentiated, and the term ∂J ^B / ∂H that increases the data distance between classes is expressed by the following equation (29). Calculate and output to the compressed expression calculation means 8.

［圧縮表現計算手段８］
前記（２４），（２５），（２７），（２９）式から下記（３０）式が成立する。そして、下記（３０）式のｖｅｃ（Ｈ）についての連立一次方程式をハウスホルダー変換、または、ガウス消去法等で解いて、圧縮表現Ｈを求める。ここで、（３０）式中のＡは下記（３１）式、Ｃは下記（３２）式とする。 [Compressed expression calculation means 8]
The following equation (30) is established from the equations (24), (25), (27), and (29). Then, the simultaneous linear equation for vec (H) in the following equation (30) is solved by Householder transformation, Gaussian elimination, or the like to obtain the compressed expression H. Here, A in the equation (30) is the following equation (31), and C is the following equation (32).

また、ｖｅｃ（・）は、行列の列ベクトルを縦に積み重ねた列ベクトルである。すなわち、下記（３３）式，（３４）に示すようになる。 Further, vec (•) is a column vector in which matrix column vectors are vertically stacked. That is, the following expressions (33) and (34) are obtained.

Ｓ４〜Ｓ５の処理ステップをＴ回繰り返したら、Ｓ６へ移行する。 When the processing steps S4 to S5 are repeated T times, the process proceeds to S6.

Ｓ６：（結果の出力）
［結果出力手段９］ネットワーク、または、ファイルなどに最後に正規化した射影行列Ｗ，平均値μ，および最後に求めた圧縮表現Ｈを出力する。 S6: (Result output)
[Result output means 9] The projection matrix W last normalized, the average value μ, and the compression expression H obtained last are output to a network or a file.

［具体例］
次に、具体例に基づいて、本実施形態のパターン抽出方法を説明する。 [Concrete example]
Next, the pattern extraction method of this embodiment will be described based on a specific example.

Ｓ１：（データの入力）
データ入力手段２は、図４に示すデータＸと、ラベルｙを入力する。データХの次元ｍ＝２，データ数ｎ＝２００とする。また、他のパラメータ（入力時に指定する特徴の次元数ｋ，クラス内類似度を重視する度合いα，クラス間距離を重視する度合いβ，距離のスケールσ，繰り返し数Ｔ）として、下記（３５）式〜（３９）式を入力する。 S1: (Data input)
The data input means 2 inputs the data X and the label y shown in FIG. It is assumed that the dimension of the data basket m = 2 and the number of data n = 200. In addition, as other parameters (number of feature dimensions k specified at the time of input, degree α of importance in intra-class similarity, degree β of importance of distance between classes, distance scale σ, number of repetitions T), the following (35) Expressions (39) are input.

Ｓ２：（初期化）
管理手段３は、データ（ｘ_i，…，ｘ_n）の平均値μを計算し、全てのデータ（ｘ_i，…，ｘ_n）から平均値μを引く。本具体例において、平均値μは下記（４０）式となる。 S2: (Initialization)
Management means 3, the data _{_{(x i, ..., x n}} ) calculating an average value μ of, all the data _{_{(x i, ..., x n}} ) subtracting the average value μ from. In this specific example, the average value μ is expressed by the following equation (40).

管理手段３は、射影行列Ｗのサイズを２（ｍ）×１（ｋ），圧縮表現Ｈのサイズを１（ｋ）×２００（ｎ）と設定する。そして、クラス内類似度計算手段４とクラス間距離計算手段５の初期化を呼び出し、クラス内類似度行列Ｌ^Wとクラス間距離行列Ｌ^Bとを計算させる。また、圧縮表現Ｈを乱数で初期化する。 The management means 3 sets the size of the projection matrix W to 2 (m) × 1 (k) and the size of the compressed expression H to 1 (k) × 200 (n). Then, the initialization of the intraclass similarity calculation means 4 and the interclass distance calculation means 5 is called to calculate the intraclass similarity matrix L ^W and the interclass distance matrix L ^B. Also, the compressed expression H is initialized with a random number.

Ｓ３：（ｔ≦Ｔの判定）
管理手段３はループ部（Ｓ３〜Ｓ５）の処理に入る。ｔ＝１の場合、ｔ（＝１）＜Ｔ（＝３）であるため、Ｓ４へ移行し、管理手段３は射影行列計算手段７を呼び出す。 S3: (determination of t ≦ T)
The management means 3 enters the processing of the loop part (S3 to S5). When t = 1, since t (= 1) <T (= 3), the process proceeds to S4, and the management unit 3 calls the projection matrix calculation unit 7.

Ｓ４：（射影行列Ｗの計算）
射影行列計算手段７は、圧縮歪み計算手段６を呼び出し、前記（２０）式を計算させ射影行列Ｗを求め、射影行列Ｗを正規化する。その結果は、下記（４１）式となり、この射影行列Ｗは図５に示すようになる。 S4: (Calculation of projection matrix W)
The projection matrix calculation means 7 calls the compression distortion calculation means 6 to calculate the expression (20) to obtain the projection matrix W and normalize the projection matrix W. The result is the following equation (41), and this projection matrix W is as shown in FIG.

Ｓ５：（圧縮表現Ｈの計算）
管理手段３は、圧縮表現計算手段８を呼び出す。この圧縮表現計算手段８は、圧縮歪み計算手段６を呼び出して前記（２５）式を計算させ、クラス内類似度計算手段４を呼び出して前記（２７）式を計算させ、クラス間距離計算手段５を呼び出して前記（２９）式を計算させ、圧縮表現Ｈを求める。 S5: (Calculation of compressed expression H)
The management means 3 calls the compressed expression calculation means 8. The compression expression calculation means 8 calls the compression distortion calculation means 6 to calculate the expression (25), calls the intraclass similarity calculation means 4 to calculate the expression (27), and the interclass distance calculation means 5 To calculate the expression (29) to obtain the compressed expression H.

Ｓ３：ｔ＝２の場合、ｔ（＝２）＜Ｔ（＝３）であるため、Ｓ４，Ｓ５へ移行し、管理手段３は射影行列計算手段７と圧縮表現計算手段８を呼び出す。 S3: When t = 2, since t (= 2) <T (= 3), the process proceeds to S4 and S5, and the management unit 3 calls the projection matrix calculation unit 7 and the compressed expression calculation unit 8.

Ｓ４，Ｓ５：ｔ＝１の時と同様に、射影行列Ｗ，圧縮表現Ｈを計算する。その結果、射影行列Ｗは下記（４２）式に示す値となり、図６に示すようになる。 S4, S5: The projection matrix W and the compressed expression H are calculated in the same manner as when t = 1. As a result, the projection matrix W becomes a value shown in the following equation (42), as shown in FIG.

Ｓ３：ｔ＝３の場合、ｔ（＝３）≦Ｔ（＝３）であるため、管理手段３は射影行列計算手段７と圧縮表現計算手段８を呼び出し、Ｓ４，Ｓ５へ移行する。 S3: When t = 3, since t (= 3) ≦ T (= 3), the management unit 3 calls the projection matrix calculation unit 7 and the compressed expression calculation unit 8 and proceeds to S4 and S5.

Ｓ４，Ｓ５：射影行列計算手段７と圧縮表現計算手段８において、ｔ＝１，２の時と同様に射影行列Ｗ，圧縮表現Ｈを計算する。その結果、射影行列Ｗは下記（４３）式に示す値となり、図７に示すようになる。 S4, S5: The projection matrix calculation means 7 and the compressed expression calculation means 8 calculate the projection matrix W and the compressed expression H in the same manner as when t = 1 and 2. As a result, the projection matrix W becomes a value shown in the following equation (43), as shown in FIG.

Ｓ３：ｔ＝４の場合、ｔ（＝４）＞Ｔ（＝３）であるため、管理手段３はループを終了させ、Ｓ６に移行する。 S3: When t = 4, since t (= 4)> T (= 3), the management unit 3 ends the loop and proceeds to S6.

Ｓ６：結果出力手段９は、ｔ＝３で正規化した射影行列Ｗ，平均値μ，ｔ＝３で求めた圧縮表現Ｈをネットワークやファイルに出力する。 S6: The result output means 9 outputs the projection matrix W normalized by t = 3, the average value μ, and the compressed expression H obtained by t = 3 to the network or file.

上記の処理（Ｓ１〜Ｓ６）の結果、得られた圧縮表現Ｈを図８に示す。ｃｌａｓｓ１とｃｌａｓｓ２がｘ軸上で分離できていることが分かる（なお、ｃｌａｓｓ１，ｃｌａｓｓ２の結果が見やすいように、ｙ軸の値をずらしてある）。 FIG. 8 shows the compressed expression H obtained as a result of the above processing (S1 to S6). It can be seen that class1 and class2 can be separated on the x-axis (note that the y-axis values are shifted so that the results of class1 and class2 are easy to see).

また、新規のデータХについては、下記（４４）式をｈについて解くことで得られる。 The new data Х can be obtained by solving the following equation (44) for h.

以上示したように、本実施形態は、クラス内類似度計算手段４とクラス間距離計算手段５が、クラス内のデータは近くに、クラス間のデータは遠くに配置されるようにパターン修正を行う。こうすることで、抽出されたパターンがクラス間で分離しやすくなる。 As described above, in the present embodiment, the intra-class similarity calculation unit 4 and the inter-class distance calculation unit 5 perform pattern correction so that the data in the class is located near and the data between the classes is located far away. Do. By doing so, the extracted pattern can be easily separated between classes.

また、圧縮歪み計算手段６は、与えられたデータと抽出されたパターンとの差異（圧縮歪み）を小さくすることで、元のデータの情報を可能な限り保持する働きをする。こうすることで、クラス数以上の特徴を有効に抽出することができる。 Further, the compression distortion calculation means 6 functions to retain the information of the original data as much as possible by reducing the difference (compression distortion) between the given data and the extracted pattern. By doing so, it is possible to effectively extract features exceeding the number of classes.

その結果、抽出する特徴の次元はパラメータｋ≦ｍによって変えることができ、かつ、圧縮歪み計算手段６により、抽出されたパターンと元のデータХとの差異が小さくなるので、有効な特徴が得られる。 As a result, the dimension of the feature to be extracted can be changed by the parameter k ≦ m, and the difference between the extracted pattern and the original data Х is reduced by the compression distortion calculation means 6, so that an effective feature can be obtained. It is done.

また、各クラスのデータがガウス分布であることを仮定しておらず、各クラスのデータが多峰性であったとしても、クラス分類をすることができる。 Further, it is not assumed that the data of each class has a Gaussian distribution, and classification can be performed even if the data of each class is multimodal.

さらに、各クラスの平均値に依存せず、平均値が等しくても分離することが可能となる。 Furthermore, it does not depend on the average value of each class and can be separated even if the average values are equal.

本発明は、前記パターン抽出装置１の各手段２〜９の一部もしくは全部として、コンピュータを機能させるプログラムとしても構成することができる。この場合には、Ｓ１〜Ｓ６の全てのステップあるいは一部のステップをコンピュータに実行させる。 The present invention can also be configured as a program that causes a computer to function as part or all of the means 2 to 9 of the pattern extraction apparatus 1. In this case, all or some of the steps S1 to S6 are executed by the computer.

このプログラムは、ｗｅｂサイトや電子メールなどネットワークを通じて提供することができる。また、プログラムは、ＣＤ−ＲＯＭ，ＤＶＤ−ＲＯＭ，ＣＤ−Ｒ，ＣＤ−ＲＷ，ＤＶＤ−Ｒ，ＤＶＤ−ＲＷ，ＭＯ，ＨＤＤ，Ｂｌｕ−ｒａｙＤｉｓｋ（登録商標）などの記録媒体に記録して、保存・配布することも可能である。この記録媒体は、記録媒体駆動装置を利用して読み出され、そのプログラムコード自体が前記実施形態の処理を実行するので、該記録媒体も本発明を実行する。 This program can be provided through a network such as a web site or electronic mail. Further, the program is recorded on a recording medium such as a CD-ROM, DVD-ROM, CD-R, CD-RW, DVD-R, DVD-RW, MO, HDD, Blu-ray Disk (registered trademark), It can also be stored and distributed. This recording medium is read using a recording medium driving device, and the program code itself executes the processing of the above-described embodiment. Therefore, the recording medium also executes the present invention.

１…パターン抽出装置
２…データ入力手段
３…管理手段
４…クラス内類似度計算手段
５…クラス間距離計算手段
６…圧縮歪み計算手段
７…射影行列計算手段
８…圧縮表現計算手段
９…結果出力手段 DESCRIPTION OF SYMBOLS 1 ... Pattern extraction apparatus 2 ... Data input means 3 ... Management means 4 ... Intraclass similarity calculation means 5 ... Inter-class distance calculation means 6 ... Compression distortion calculation means 7 ... Projection matrix calculation means 8 ... Compression expression calculation means 9 ... Result Output means

Claims

A pattern extraction device for extracting a pattern for classifying from input data,
An intraclass similarity calculation means for calculating an intraclass similarity of the input data;
An interclass distance calculation means for calculating an interclass distance of the input data;
Compression distortion calculation means for calculating a compression distortion that is a difference between the input data and a pattern that is a compressed representation of the data ;
A projection matrix calculating means for obtaining a projection matrix that reduces the compression distortion from the data and a compressed representation of the data, and normalizing the projection matrix;
Compression expression calculating means for calculating a compressed expression of the data from which the compression distortion is reduced and the intraclass similarity and the interclass distance are increased from the data and the projection matrix,
A pattern extraction apparatus for extracting a pattern for classifying from input data, that is, the compressed expression that reduces compression distortion and increases the similarity in class and the distance between classes.

The intraclass similarity calculation means calculates a differential value of the sum of the intraclass distances,
The interclass distance calculation means calculates a differential value of the sum of the interclass distances,
The compression strain calculating means calculates a differential value of the compression strain;
The projection matrix calculation means optimizes the expression including the compression distortion, the sum of the distances within the class, and the sum of the distances between the classes with respect to the projection matrix. Obtaining an expression to be optimized with respect to the matrix, further calculating a simultaneous linear equation of a projection matrix consisting of data and a compressed expression based on the differential value of the compression distortion, obtaining a projection matrix from the simultaneous linear equation,
The compressed expression calculation means optimizes the expression including the compression distortion, the sum of the distances within the class, and the sum of the distances between classes with respect to the compressed expression. An expression that optimizes the expression is obtained, and based on the optimized expression, a compression composed of data and a projection matrix based on the differential value of the compression distortion, the differential value of the sum of the distances within the class, and the differential value of the sum of the distances between the classes. 2. The pattern extraction apparatus according to claim 1, wherein a simultaneous linear equation of expression is calculated, and a compressed expression is obtained from the simultaneous linear equation.

Calculation of the differential value of the sum of intra-class distances in the intra-class similarity calculation means, calculation of the differential value of the sum of inter-class distances in the inter-class distance calculation means, calculation of the differential value of compressive strains in the compression strain calculation means The process of the projection matrix calculation means and the process of the compression expression calculation means are repeated a plurality of times, and the pattern for class classification is extracted based on the projection matrix normalized at the end and the compression expression obtained at the end. The pattern extraction device according to claim 2.

The projection matrix calculating means and the compressed expression calculating means are:
4. The pattern extraction apparatus according to claim 2, wherein the simultaneous linear equations of the projection matrix and the simultaneous linear equations of the compressed expression are solved by Householder transformation or Gaussian elimination.

A pattern extraction method for extracting a pattern for classifying from input data,
An intraclass similarity calculating means for calculating an intraclass similarity of the input data;
An interclass distance calculating means for calculating an interclass distance of the input data;
A step of calculating a compression distortion which is a difference between the input data and a pattern which is a compressed expression of the data ;
A projection matrix calculating means for obtaining a projection matrix that reduces the compression distortion from the data and a compressed representation of the data, and normalizing the projection matrix;
Yes said compressed representation calculation means, from the data and the projection matrix, wherein the compressive strain is reduced, and the compressed representation calculating step of calculating a compressed representation of the data distance between classes in similarity and class increases, the And
A pattern extraction method for extracting a pattern for performing class classification from input data, that is, the compressed expression that reduces compression distortion and increases the intraclass similarity and the interclass distance.

The projection matrix calculation step includes:
The compression strain calculating means calculates a differential value of the compression strain;
The projection matrix calculation means optimizes the expression including the compression distortion, the sum of the distances within the class, and the sum of the distances between the classes with respect to the projection matrix. Obtaining an expression to be optimized with respect to the matrix, further calculating a simultaneous linear equation of a projection matrix consisting of data and a compressed expression based on the differential value of the compression distortion, obtaining a projection matrix from the simultaneous linear equation,
The compressed expression calculation step includes:
The intraclass similarity calculation means calculates a differential value of the sum of intraclass distances,
The inter-class distance calculation means calculates a differential value of the sum of the inter-class distances,
The compression strain calculating means calculates a differential value of the compression strain;
The compressed expression calculation means optimizes the expression including the compression distortion, the sum of the distances within the class, and the sum of the distances between the classes with respect to the compressed expression. An expression that optimizes the expression is obtained, and based on the optimized expression, a compression composed of data and a projection matrix based on the differential value of the compression distortion, the differential value of the sum of the distances within the class, and the differential value of the sum of the distances between the classes. 6. The pattern extraction method according to claim 5, wherein a simultaneous linear equation of expression is calculated, and a compressed expression is obtained from the simultaneous linear equation.

The projection matrix calculation step and the compressed expression calculation step are repeated a plurality of times, and a pattern for class classification is extracted based on the projection matrix normalized last and the compression expression calculated last. Item 7. The pattern extraction method according to Item 5 or 6.

The projection matrix calculation step and the compressed expression calculation step are:
8. The pattern extraction method according to claim 6, wherein the simultaneous linear equations of the projection matrix and the simultaneous linear equations of the compressed expression are solved by Householder transformation or Gaussian elimination.

The program for functioning a computer as each means which comprises the pattern extraction apparatus of any one of Claims 1-4.