JPH0326877B2

JPH0326877B2 -

Info

Publication number: JPH0326877B2
Application number: JP59123175A
Authority: JP
Inventors: Tozen Hai; Yukikazu Kaburayama; Eiichiro Yamamoto
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1984-06-15
Filing date: 1984-06-15
Publication date: 1991-04-12
Also published as: JPS6121583A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は認識速度の向上を図つたパターン認識
装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a pattern recognition device that improves recognition speed.

パターン認識装置たとえば文字認識装置におい
ては、通常、カテゴリ（文字の種別）毎に標準文
字パターンから抽出した特徴を特徴辞書として備
え、入力文字パターンすなわち被認識文字パター
ンから抽出した特徴と最も高い類似度が得られる
カテゴリを前記特徴辞書の中から選び出し、これ
をもつて被認識文字パターンのカテゴリとして決
定している。 Pattern Recognition Device For example, a character recognition device usually has a feature dictionary containing features extracted from standard character patterns for each category (type of character), and selects features that have the highest degree of similarity to the features extracted from the input character pattern, that is, the character pattern to be recognized. A category from which the character pattern can be obtained is selected from the feature dictionary, and this is determined as the category of the character pattern to be recognized.

また前記の特徴として、例えば漢字等において
は、投影の形状・閉ループの数・分離ストローク
の数など複数種の特徴が用いられる。 Furthermore, as the above-mentioned characteristics, for example, in Chinese characters, a plurality of types of characteristics are used, such as the shape of projection, the number of closed loops, and the number of separation strokes.

ところで、前記最も高い類似度が得られるカテ
ゴリを選び出すために、被認識文字パターンから
抽出した特徴と特徴辞書に記憶されるすべてのカ
テゴリ毎の特徴との類似度を計算をおこなうとす
ると、例えば特徴の種類が10種、またカテゴリが
2000種の場合には、類似度の計算を20000回もお
こなう必要がある。 By the way, in order to select the category with the highest degree of similarity, if we calculate the degree of similarity between the features extracted from the character pattern to be recognized and the features for each category stored in the feature dictionary, for example, the feature There are 10 types and categories.
In the case of 2,000 species, it is necessary to perform similarity calculations 20,000 times.

このため、類似度計算の回数を減らすために、
認識を一次認識と二次認識との２段回に分け、一
次認識では、一部の特徴のみを用いた粗い認識に
よつて、類似度の高い複数のカテゴリを侯補とし
て選択し、二次認識では、一次認識で選択された
複数のカテゴリの中から、すべての特徴を用いた
精密な認識によつて、入力パターンと最も類似度
の高いカテゴリを選ぶという方式のパターン認識
装置が提案されている。 Therefore, in order to reduce the number of similarity calculations,
Recognition is divided into two stages: primary recognition and secondary recognition. In the primary recognition, multiple categories with high similarity are selected as candidates through rough recognition using only some features, and the secondary recognition For recognition, a pattern recognition device has been proposed that selects the category with the highest degree of similarity to the input pattern from among multiple categories selected in primary recognition through precise recognition using all features. There is.

この際、一次認識における複数のカテゴリの選
択を能率的におこなうことによつて、認識速度を
更に向上することが望まれる。 At this time, it is desired to further improve recognition speed by efficiently selecting a plurality of categories in primary recognition.

[Conventional technology]

第２図は入力パターンから抽出した特徴と特徴
辞書に記憶する特徴との類似度計算の回数の減少
を図つたパターン認識装置の従来例である。 FIG. 2 shows a conventional example of a pattern recognition device that aims to reduce the number of similarity calculations between features extracted from an input pattern and features stored in a feature dictionary.

図において、１はカテゴリ毎に標準文字パター
ンから抽出した所定の複数種類の特徴を特徴辞書
として備え記憶する特徴辞書、２は入力パターン
から所定の前記と同じ複数種類の特徴を抽出する
特徴抽出部、３は特徴抽出部２によつて抽出され
た入力パターンの特徴の中の所定の一部の特徴
と、特徴辞書１に記憶されるカテゴリ毎の標準特
徴の中の前記と同じ一部の特徴との類似度を求め
る第一の認識部（一次認識）、４は第一の認識部
３によつて得られた類似度の順に上位所定位まで
のカテゴリを選択する選択部、５は選択部４によ
つて選択された各カテゴリの標準特徴を特徴辞書
１の中から取出して記憶するバツフア、６は特徴
抽出部２によつて抽出された入力パターンの特徴
と、バツフア５に記憶される所定数のカテゴリ毎
の標準特徴との類似度を求め、類似度が最も高い
カテゴリを入力パターンのカテゴリとして決定す
る第二の認識部（二次認識）である。 In the figure, 1 is a feature dictionary that stores a plurality of predetermined features extracted from standard character patterns for each category as a feature dictionary, and 2 is a feature extraction unit that extracts the same plurality of predetermined features from an input pattern. , 3 are a predetermined part of the features of the input pattern extracted by the feature extractor 2 and the same part of the standard features for each category stored in the feature dictionary 1. 4 is a selection unit that selects categories up to a predetermined top rank in the order of similarity obtained by the first recognition unit 3; 5 is a selection unit 4 extracts and stores the standard features of each category selected from the feature dictionary 1; 6 extracts and stores the standard features of each category selected by the feature dictionary 1; This is a second recognition unit (secondary recognition) that calculates the degree of similarity with the standard feature for each category of numbers, and determines the category with the highest degree of similarity as the category of the input pattern.

上記構成のものにおいては、たとえば、特徴辞
書１に記憶するカテゴリの数を2000種類、特徴辞
書１に記憶するカテゴリ毎の標準特徴の種類、す
なわち特徴抽出部２において抽出する特徴の種類
を10種類、第一の認識部３において用いる特徴の
数を１種類、第二の認識部６において用いる特徴
の数を10種類、選択部４において選択するカテゴ
リを上位20位までとすると、第一の認識部３では
類似度の計算を2000回、また第二の認識部６では
類似度の計算を（20×10）回、合わせて2200回の
類似度計算をおこなうことになり、前記20000回
に比べると計算回数を大幅に減少できることが分
かる。 In the above configuration, for example, the number of categories stored in the feature dictionary 1 is 2000, and the types of standard features for each category stored in the feature dictionary 1, that is, the types of features extracted in the feature extraction unit 2, are 10 types. , the number of features used in the first recognition unit 3 is 1 type, the number of features used in the second recognition unit 6 is 10 types, and the categories selected in the selection unit 4 are the top 20, then the first recognition Part 3 calculates the similarity 2000 times, and the second recognition part 6 calculates the similarity (20×10) times, for a total of 2200 times, compared to the 20000 times described above. It can be seen that the number of calculations can be significantly reduced.

[Problem that the invention seeks to solve]

しかし前記構成のものにおいては、選択部４に
おいては、2000種類のカテゴリに対して第一の認
識部で求められた類似度の中から、ソーテイング
処理等によつて上位20位までを選びだす必要があ
り、そのための処理に長時間を要するという問題
点がある。 However, in the configuration described above, the selection unit 4 needs to select the top 20 by sorting processing etc. from among the similarities determined by the first recognition unit for 2000 categories. There is a problem that the processing for this requires a long time.

[Means for solving problems]

上記の如く問題点は本発明によりカテゴリ毎の
標準特徴を記憶する第一の特徴辞書と、カテゴリ
毎にそれぞれのカテゴリとそれに類似するカテゴ
リを含むカテゴリ群及びこのカテゴリ群の特徴の
平均値を記憶する第二の特徴辞書と、入力パター
ンの特徴を抽出する特徴抽出部と、前記抽出され
た入力パターンの特徴と第二の特徴辞書に記憶さ
れる特徴の平均値との類似度によつて入力パター
ンの属するカテゴリ群を決定する第一の認識部
と、前記抽出された入力パターンの特徴と第一の
特徴辞書に記憶される標準特徴のうち前記第一の
認識部によつて決定されたカテゴリ群に含まれる
カテゴリの標準特徴との類似度によつて前記入力
パターンのカテゴリを決定する第二の認識部とを
備えてなることを特徴とするパターン認識装置に
よつて解決される。 As mentioned above, the problem is that the present invention stores a first feature dictionary that stores standard features for each category, a category group that includes each category and categories similar to it, and the average value of the features of this category group. a second feature dictionary that extracts the features of the input pattern; and a feature extractor that extracts the features of the input pattern; a first recognition unit that determines the category group to which the pattern belongs; and a category determined by the first recognition unit among the extracted features of the input pattern and standard features stored in the first feature dictionary. The problem is solved by a pattern recognition device characterized by comprising: a second recognition unit that determines the category of the input pattern based on the degree of similarity with the standard feature of the category included in the group.

[Effect]

すなわち本発明では、カテゴリ毎の標準特徴を
記憶する第一の特徴辞書のほかに、カテゴリ毎に
それぞれのカテゴリに類似するカテゴリとこれら
カテゴリ群毎の平均特徴とを記憶する第二の特徴
辞書を設け、入力パターンの特徴と第二の特徴辞
書に記憶される平均特徴との類似度によつて、入
力パターンの属するカテゴリ群を決定することに
よつて、たとえば、第二の特徴辞書に記憶するカ
テゴリ群毎のカテゴリの数を10個とすれば、第一
の認識部で求められた類似度の中から、上位２位
までのカテゴリ群を選びだすことによつて、入力
パターンに類似する上位約20位までのカテゴリが
選択でき、前記従来例に比しソーテイングに要す
る時間を大幅に減少することができる。 That is, in the present invention, in addition to a first feature dictionary that stores standard features for each category, a second feature dictionary that stores categories similar to each category and average features for each category group is provided for each category. For example, by determining the category group to which the input pattern belongs based on the degree of similarity between the features of the input pattern and the average feature stored in the second feature dictionary, the category group is stored in the second feature dictionary. Assuming that the number of categories in each category group is 10, the top two categories that are similar to the input pattern are selected from the similarity determined by the first recognition unit. Up to about 20 categories can be selected, and the time required for sorting can be significantly reduced compared to the conventional example.

〔Example〕

次に本発明の要旨を第１図に示す実施例によつ
て具体的に説明する。 Next, the gist of the present invention will be specifically explained using an embodiment shown in FIG.

第１図は本発明一実施例の構成を示すブロツク
図であり、第２図と共通する符号は同一の対象物
を指すほか、７はカテゴリ毎にそれぞれのカテゴ
リに類似するカテゴリとこれらカテゴリ群毎の平
均特徴とを記憶する第二の特徴辞書である。 FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention. Reference numerals common to those in FIG. 2 refer to the same objects, and 7 indicates a category similar to each category and a group of these categories. This is a second feature dictionary that stores average features for each.

また３′は特徴抽出部２によつて抽出された入
力パターンの特徴と第二の特徴辞書に７記憶され
る平均特徴との類似度によつて前記入力パターン
の属するカテゴリ群を決定する第一の認識部、ま
た４′は第一の認識部３′によつて得られた類似度
の順に上位所定位までのカテゴリ群を選択する選
択部、５′は選択部４′によつて選択されたカテゴ
リ群に含まれる各カテゴリの標準特徴を第一の特
徴辞書１の中から取出して記憶するバツフアであ
る。 Further, 3' is a first part that determines the category group to which the input pattern belongs based on the degree of similarity between the features of the input pattern extracted by the feature extractor 2 and the average features stored in the second feature dictionary. 4' is a selection unit that selects categories up to a predetermined top rank in the order of similarity obtained by the first recognition unit 3', and 5' is a selection unit that selects categories selected by the selection unit 4'. This is a buffer that extracts standard features of each category included in a category group from the first feature dictionary 1 and stores them.

なお第二の特徴辞書をさらに説明すると、認識
すべきカテゴリをC₁，C₂，……C_oとしたとき各
カテゴリについてそのカテゴリの類似カテゴリを
調べる。即ちカテゴリCiのｋ個の類似カテゴリを
Ci(1)、Ci(2)、……Ci(k)としこれを総べてのｉにつ
いて決定する。 To further explain the second feature dictionary, when the categories to be recognized are C ₁ , C ₂ , . . . C _o , similar categories to that category are checked for each category. That is, k similar categories of category Ci are
Let Ci(1), Ci(2), . . . Ci(k) be determined for all i.

ついでカテゴリCiについての大分類辞書にカテ
ゴリCiとカテゴリCi(1)、Ci(2)、……Ci(k)の特徴の
平均値を用いる。 Next, the average value of the features of category Ci and categories Ci(1), Ci(2), . . . Ci(k) is used in the major classification dictionary for category Ci.

このことはCiとその類似カテゴリを全部平均し
たような仮のカテゴリ例えばカテゴリGiとよぶ
ことにすると、カテゴリGiの辞書を作成するこ
とに相当する。 This corresponds to creating a dictionary of category Gi, which is a temporary category that is the average of all Ci and its similar categories, for example, category Gi.

かかる大分類辞書を第３図に示す。この大分類
辞書が本発明での第二の特徴辞書である。 Such a major classification dictionary is shown in FIG. This major classification dictionary is the second feature dictionary in the present invention.

以上のような構成において、第二の特徴辞書７
に記憶するカテゴリ群毎のカテゴリの数を10個と
するほか、前記実施例と同様に、同一の特徴辞書
１に記憶するカテゴリの数を2000種類、第一の特
徴辞書１に記憶するカテゴリ毎の標準特徴の種
類、すなわち特徴抽出部２において抽出する特徴
の種類を10種類、第一の認識部３′において用い
る特徴の数を１種類、第二の認識部６において用
いる特徴の数を10種類とすると、第一の認識部
３′では類似度の計算を従来例と同様に2000回お
こない、この結果第３図の例でカテゴリCi即ち
Gi辞書が選ばれたら、これはGi辞書作成に関与
したCiの類似のカテゴリとCiの計（ｋ＋１）個、
即ち10個のカテゴリが選択されたことになる。こ
れは一つの大分類辞書の選択によつて10個のカテ
ゴリの選択を行なつたのと等価の効果を示す。こ
のようにして選び出されたGi辞書はその中から
上位２位までのカテゴリ群を選びだすことによつ
て、入力パターンに類似する上位約20位までのカ
テゴリが選択でき、このあと第二の認識部６にお
ける約（20×10）回の類似度計算によつて入力パ
ターンのカテゴリを決定することができる。 In the above configuration, the second feature dictionary 7
In addition to setting the number of categories stored in each category group to 10, as in the above embodiment, the number of categories stored in the same feature dictionary 1 is set to 2000 types, and the number of categories stored in the first feature dictionary 1 is set to 10. The types of standard features, that is, the types of features extracted in the feature extraction unit 2 are 10 types, the number of features used in the first recognition unit 3' is 1 type, and the number of features used in the second recognition unit 6 is 10 types. The first recognition unit 3' calculates the similarity 2000 times in the same way as in the conventional example, and as a result, in the example of Fig. 3, the category Ci, that is,
Once a Gi dictionary is selected, it is a total of (k+1) similar categories and Cis that were involved in creating the Gi dictionary,
In other words, 10 categories have been selected. This shows an effect equivalent to selecting 10 categories by selecting one major classification dictionary. By selecting the top two categories from the Gi dictionary selected in this way, it is possible to select the top 20 categories that are similar to the input pattern. The category of the input pattern can be determined by performing similarity calculations approximately (20×10) times in the recognition unit 6.

すなわち類似度計算の回数は従来例とほぼ同数
であるが、前記従来例に比し、選択部４′におい
てソーテイングに要する時間を大幅に減少するこ
とができる。 That is, although the number of similarity calculations is approximately the same as in the conventional example, the time required for sorting in the selection section 4' can be significantly reduced compared to the conventional example.

〔Effect of the invention〕

以上説明したように、本発明によればパターン
認識装置の認識速度を大幅に向上することができ
る。 As described above, according to the present invention, the recognition speed of a pattern recognition device can be significantly improved.

[Brief explanation of drawings]

第１図は本発明一実施例の構成図、第２図は従
来例の構成図を示す。第３図は本発明における第
２の特徴辞書を示す。図中、１は第一の特徴辞書、２は特徴抽出部、
３′は第一の認識部、６は第二の認識部、７は第
二の特徴辞書である。 FIG. 1 shows a configuration diagram of an embodiment of the present invention, and FIG. 2 shows a configuration diagram of a conventional example. FIG. 3 shows a second feature dictionary in the present invention. In the figure, 1 is the first feature dictionary, 2 is the feature extraction unit,
3' is a first recognition section, 6 is a second recognition section, and 7 is a second feature dictionary.

Claims

[Claims]

1. A first feature dictionary that stores standard features for each category; a second feature dictionary that stores, for each category, a category group including each category and categories similar thereto, and the average value of the features of this category group; a feature extraction unit that extracts features of the input pattern, and a category group to which the input pattern belongs is determined based on the degree of similarity between the extracted features of the input pattern and an average value of features stored in a second feature dictionary. a first recognition unit, and standard features of categories included in the category group determined by the first recognition unit among the standard features stored in the extracted input pattern features and the first feature dictionary; a second recognition unit that determines a category of the input pattern based on the degree of similarity between the input pattern and the input pattern.