JP4148966B2

JP4148966B2 - Pattern matching apparatus, program for realizing the same, and recording medium

Info

Publication number: JP4148966B2
Application number: JP2005307248A
Authority: JP
Inventors: 至幸小山
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2005-10-21
Filing date: 2005-10-21
Publication date: 2008-09-10
Anticipated expiration: 2025-10-21
Also published as: JP2007115112A

Description

本発明は、読み取った文字画像を文字として認識するパターン照合装置及びそれを実現するためのプログラム、記録媒体に関する。 The present invention relates to a pattern matching device that recognizes a read character image as a character, a program for realizing the same, and a recording medium.

従来、漢字を含む文字を対象とする文字認識において、様々な識別法が提案されているが、漢字は１字１字のパターンが複雑で、字種も多く、更に印刷文字でも明朝体やゴシック体などがあり、手書き文字まで含めると機械で認識させることが非常に難しい。このため、個々の装置において、マッチング法や構造解析などの種々の方法の組み合わせによって識別精度向上及び処理速度の高速化のための工夫が試みられている。 Conventionally, various recognition methods have been proposed for character recognition including characters including kanji, but kanji has a complicated pattern of one character and many types of characters, and even printed characters can be used in the Mincho style. There are Gothic fonts, etc., and it is very difficult to make it recognized by a machine if even handwritten characters are included. For this reason, in each device, attempts have been made to improve the identification accuracy and increase the processing speed by combining various methods such as a matching method and structural analysis.

従来の文字画像の識別法には、一般にパターン照合法と構造化解析法とがある。印刷文字の識別にはパターン照合法が用いられることが多い。パターン照合法は、画像から抽出した文字パターンを、辞書が持つ標準パターンとマッチングさせ、文字を認識する。このマッチングには複合類似度が用いられることが多い。 Conventional character image identification methods generally include a pattern matching method and a structured analysis method. A pattern matching method is often used for identifying printed characters. In the pattern matching method, a character pattern extracted from an image is matched with a standard pattern held in a dictionary to recognize a character. For this matching, a composite similarity is often used.

また、複合類似度のマッチングには計算時間が掛かるため、まずは単純類似度を計算し、単純類似度の値が高い上位の候補に対してのみ複合類似度を計算する手法が知られている。 In addition, since it takes a long time to calculate the composite similarity, a method is known in which simple similarity is calculated first, and composite similarity is calculated only for higher candidates having a high simple similarity value.

図１１は、従来のパターン照合法を説明するフローチャートである。まず、ステップＳ５０において、入力された画像から文字の外接枠に沿って文字画像を切り出す。図１２が切り出された文字画像の一例を示す図である。文字画像「あ」に外接するように四角形の枠で切り出されている。 FIG. 11 is a flowchart for explaining a conventional pattern matching method. First, in step S50, a character image is cut out along the circumscribed frame of characters from the input image. FIG. 12 is a diagram illustrating an example of a character image cut out. A square frame is cut out so as to circumscribe the character image “A”.

次に、ステップＳ５１へ進んで、切り出された文字画像からその文字の特徴を抽出する。この特徴抽出の手法としてはメッシュ特徴がよく知られている。メッシュ特徴とは、切り出された文字画像をメッシュに分割し、各メッシュにおける画素数を数値化して正規化を行う処理である。図１３は、図１２の文字画像をメッシュ特徴で分割した図である。図１３では１文字画像を縦横にそれぞれ８等分して６４メッシュに区切っている。図１４は、図１３の各メッシュにおける画素数を数値化して正規化した特徴データである。それぞれのメッシュの中で画像の特徴量（０〜１２８）を抽出し、８×８のマトリクスで表している。 Next, it progresses to step S51 and the characteristic of the character is extracted from the cut-out character image. A mesh feature is well known as a feature extraction method. The mesh feature is a process of dividing the cut character image into meshes, and normalizing the number of pixels in each mesh. FIG. 13 is a diagram in which the character image of FIG. 12 is divided by mesh features. In FIG. 13, a single character image is divided into 8 meshes vertically and horizontally and divided into 64 meshes. FIG. 14 shows characteristic data obtained by normalizing the number of pixels in each mesh of FIG. Image features (0 to 128) are extracted from each mesh and represented by an 8 × 8 matrix.

次に、ステップＳ５２へ進んで、入力された文字画像の特徴データと、辞書が有する標準パターンである特徴データとの単純類似度を計算する。入力された文字画像の特徴データをＸ、辞書の或るカテゴリの特徴データをＹとすると、単純類似度Ｓ（Ｘ，Ｙ）は次の式で表される。カテゴリは、１文字のデータに相当する。但し、同じ文字でもフォントによって形状がかなり違うものは別々のカテゴリとなる。

Next, the process proceeds to step S52, and the simple similarity between the feature data of the input character image and the feature data which is a standard pattern included in the dictionary is calculated. If the feature data of the input character image is X and the feature data of a certain category in the dictionary is Y, the simple similarity S (X, Y) is expressed by the following equation. The category corresponds to one character data. However, even the same characters with different shapes depending on the font are in different categories.

分母の値は、入力された文字画像の特徴データ及び辞書が有する特徴データを正規化しておくと一定になるので分子の計算だけで処理できる。分子の値は、８×８のメッシュ特徴を用いた場合は、数２のように計算される。

Since the denominator value becomes constant when the feature data of the input character image and the feature data of the dictionary are normalized, the denominator value can be processed only by calculating the numerator. The value of the numerator is calculated as shown in Equation 2 when an 8 × 8 mesh feature is used.

ステップＳ５２では、上記の計算を辞書が有する全てのカテゴリに対して行う。なお、入力された文字画像の特徴データと辞書が有する特徴データとの近さを表す指標としては、類似度の他にシティーブロック距離やユークリッド距離などを用いてもよい。 In step S52, the above calculation is performed for all categories of the dictionary. In addition to the similarity, a city block distance, an Euclidean distance, or the like may be used as an index representing the proximity between the input character image feature data and the feature data of the dictionary.

次に、ステップＳ５３へ進んで、全てのカテゴリに対して計算された単純類似度の値の中から上位ｎ個（ｎは自然数）のカテゴリを抽出する。そして、ステップＳ５４へ進んで、抽出されたｎ個のカテゴリに対して複合類似度を計算する。次に、ステップＳ５５へ進んで、計算されたｎ個の複合類似度の中から上位ｍ個（ｍは自然数）の候補文字を抽出する。そして、ステップＳ５６へ進んで、抽出されたｍ個の候補文字の文字コードを認識結果として出力する。 Next, proceeding to step S53, the top n categories (n is a natural number) are extracted from the values of simple similarity calculated for all categories. Then, the process proceeds to step S54, and the composite similarity is calculated for the n extracted categories. Next, the process proceeds to step S55, and the top m (m is a natural number) candidate characters are extracted from the calculated n composite similarities. Then, the process proceeds to step S56, and the extracted character codes of the m candidate characters are output as recognition results.

上記のパターン照合法においては、ステップＳ５２の単純類似度の計算処理が、全体の処理量の大部分を占める。そこで処理速度の更なる高速化のための工夫が試みられている。 In the pattern matching method, the simple similarity calculation process in step S52 occupies most of the entire processing amount. Thus, attempts have been made to further increase the processing speed.

例えば特許文献１には、辞書が有するカテゴリを類似カテゴリグループに分け、各グループを代表する特徴データと入力された文字画像の特徴データとのマッチングを図１１のステップＳ５２の前に行い、マッチングした上位グループのみステップＳ５２以降の処理を行うことが開示されている。 For example, in Patent Document 1, the categories of the dictionary are divided into similar category groups, and the feature data representing each group and the feature data of the input character image are matched before step S52 in FIG. It is disclosed that only the upper group performs the processing after step S52.

また特許文献２には、入力された文字画像の特徴データを各行に区切り、各行の最大の値をマークし、また辞書が有する特徴データに対しても予め各行単位に最大の値をとる可能性がある場所全てにマークしておき、マークした箇所が一致するか否かを照合し、一致するときのみ図１１のステップＳ５２以降の処理を行うことが開示されている。
特開昭６３−２６３５９０号公報特許第２９３８２７６号公報 In Patent Document 2, the feature data of the input character image is divided into lines, the maximum value of each line is marked, and the feature data of the dictionary may have a maximum value for each line in advance. It is disclosed that all locations are marked, collated whether the marked locations match, and the processing after step S52 in FIG. 11 is performed only when they match.
JP-A 63-263590 Japanese Patent No. 2938276

しかしながら、特許文献１では類似カテゴリグループへ分ける際に理想的な分割を行うのが難しいという問題がある。また入力された文字画像の特徴データとグループの代表ベクトルとのマッチングになるので、辞書データとしては本来近い特徴データを有するカテゴリを含むグループではなく、別のグループが近くなることが多発する。このため複数のグループを候補として残すことになり、あまり候補が絞れないことがある。 However, Patent Document 1 has a problem that it is difficult to perform ideal division when dividing into similar category groups. In addition, since the feature data of the input character image is matched with the representative vector of the group, the dictionary data frequently comes close to another group instead of a group including a category having inherently similar feature data. For this reason, a plurality of groups are left as candidates, and the candidates may not be narrowed down.

また特許文献２では、特徴データの各行に対して最大の値の場所を求めているが、最大の値の場所は非常にぶれやすいため、予め用意しておく辞書データを作成するのが難しく、漏れが発生する可能性が高い。また、漏れを防ぐために広範囲に最大の値を取り得るマークをつければ、大多数のものと一致してしまい、あまり高速化できない。 Further, in Patent Document 2, the location of the maximum value is obtained for each row of the feature data. However, since the location of the maximum value is very easy to shake, it is difficult to create dictionary data prepared in advance. There is a high possibility of leakage. If a mark that can take the maximum value in a wide range is provided in order to prevent leakage, it matches with the majority, and the speed cannot be increased very much.

更に、特許文献１及び２では、辞書の特徴データ以外に高速化に使うデータを予め用意しておかねばならず、データ容量の増加に繋がるという問題もある。 Further, in Patent Documents 1 and 2, there is a problem that data used for speeding up must be prepared in advance in addition to dictionary feature data, leading to an increase in data capacity.

本発明は、文字認識のマッチングの計算量を減らして高速化するとともに、正確に文字認識するパターン照合装置を提供することを目的とする。また、そのパターン照合装置を実現するためのプログラムとプログラムを記録した記録媒体とを提供することも目的とする。 It is an object of the present invention to provide a pattern matching apparatus that accurately recognizes characters while reducing the amount of calculation for matching for character recognition to increase the speed. Another object of the present invention is to provide a program for realizing the pattern matching device and a recording medium on which the program is recorded.

上記目的を達成するために本発明は、入力された文字画像の特徴データと、辞書が有する複数カテゴリの特徴データとを比較して文字認識するパターン照合装置であって、前記辞書が有する複数カテゴリの特徴データから、各カテゴリの各要素に対して、第１の閾値以下であるか否かを示す２値の値である辞書照合データを求める手段と、前記入力された文字画像の特徴データから、各要素に対して、第２の閾値以上であるか否かを示す２値の値である入力照合データを求める手段と、前記辞書照合データと前記入力照合データとを照合し、一致する要素の個数が所定値以下のカテゴリのみ辞書とのマッチング計算を行う手段とを備えたことを特徴とする。 In order to achieve the above object, the present invention provides a pattern matching apparatus that recognizes characters by comparing feature data of an input character image with feature data of a plurality of categories included in the dictionary, the plurality of categories included in the dictionary. Means for obtaining dictionary collation data which is a binary value indicating whether or not each element of each category is equal to or less than a first threshold value from the feature data of the category, and from the feature data of the input character image , A means for obtaining input collation data that is a binary value indicating whether or not each element is equal to or greater than a second threshold, and collating the dictionary collation data with the input collation data, and matching elements Means for performing matching calculation with a dictionary only for a category whose number is equal to or less than a predetermined value.

この構成によると、辞書照合データと入力照合データとを照合し、全く一致しない若しくは一致する個数が少ない場合のみ、類似度の計算を行うことで高速化を図ることができる。入力された文字画像の特徴データの各要素に対して第２の閾値以上であるか否かを求めることは、閾値をある程度大きくした場合に文字部分であるかどうかを示す意味がある。また辞書が有する複数カテゴリの特徴データの各要素に対して第１の閾値以下であるか否かを求めることは、閾値をある程度小さくした場合に背景部分であるかどうかを示す意味がある。辞書の特徴データは数多くのパターンを平均化したものであるので、閾値を小さくした場合に、その閾値以下であるということは絶対的に背景部分であることを意味し、該当文字が入力された場合にはその箇所は文字部分にはならないことを利用している。また本来背景部分になる要素に文字部分が重なる程に入力された文字画像の特徴データが変形している場合には、辞書とのマッチングが行われないことになる。しかし、この場合マッチングを行ったとしても類似度は数２の計算で求められるので、入力された文字画像の特徴データの大きい部分に相当するｘ_iと対応する辞書の要素ｙ_iの積が小さいことから結果的に類似度の値も小さくなり、マッチング計算を全て行ったとしても最終的に候補になる可能性は低くなる。 According to this configuration, the dictionary collation data and the input collation data are collated, and only when there is no coincidence or the number of coincidence is small, the speed can be increased by calculating the similarity. Obtaining whether or not each element of the feature data of the input character image is greater than or equal to the second threshold has a meaning of indicating whether or not it is a character portion when the threshold is increased to some extent. Further, obtaining whether or not each element of the feature data of a plurality of categories included in the dictionary is equal to or less than the first threshold has a meaning indicating whether or not it is a background portion when the threshold is reduced to some extent. Since the feature data of the dictionary is an average of many patterns, when the threshold value is reduced, being below that threshold means that it is absolutely the background part, and the corresponding character has been entered. In some cases, it is used that the part does not become a character part. In addition, when the feature data of the character image input so that the character part overlaps the element that originally becomes the background part, the matching with the dictionary is not performed. However, even if matching is performed in this case, the degree of similarity can be obtained by the calculation of Formula 2, so that the product of x _i corresponding to a large portion of the feature data of the input character image and the corresponding element y _i of the dictionary is small. As a result, the value of the similarity also becomes small, and even if all the matching calculations are performed, the possibility of finally becoming a candidate is low.

また本発明においては、辞書照合データを予め保持しておく必要はなく、装置の起動時に一度だけ値を求める処理を行えばよいので、辞書の特徴データ以外の余分なデータを持たなくてもよい。 Further, in the present invention, it is not necessary to store dictionary collation data in advance, and it is only necessary to obtain a value once when the apparatus is started up, so there is no need to have extra data other than dictionary feature data. .

入力された文字画像の特徴データより求めた入力照合データと、辞書の特徴データより求めた辞書照合データとの照合によりマッチングを行うか否かを判定できるので、高速な演算が可能である。また認識結果の正確性も増す。更に入力照合データと辞書照合データとの照合をまとまった単位の論理積で行えば判定に要する演算量を減らすことが可能である。また辞書の特徴データから求める処理を装置の起動時に行うことで、予め辞書照合データを持っておく必要がなく、データ容量の削減を図ることが可能である。 Since it is possible to determine whether or not to perform matching by collating input collation data obtained from input character image feature data and dictionary collation data obtained from dictionary feature data, high-speed calculation is possible. Also, the accuracy of the recognition result is increased. Furthermore, if the collation between the input collation data and the dictionary collation data is performed by a logical product of a unit, it is possible to reduce the amount of calculation required for the determination. Further, by performing the processing to be obtained from the feature data of the dictionary at the time of starting the apparatus, it is not necessary to have dictionary collation data in advance, and the data capacity can be reduced.

図１は、パターン照合装置の主要な構成を示すブロック図である。パターン照合装置１０は入力装置１１と出力装置１２とに接続されている。入力装置１１は、イメージスキャナなどからなり文字画像を読み取りパターン照合装置１０へ読み取った文字画像を送る。出力装置１２は、液晶表示装置などからなりパターン照合装置１０で認識した文字が出力される。 FIG. 1 is a block diagram showing the main configuration of the pattern matching apparatus. The pattern matching device 10 is connected to an input device 11 and an output device 12. The input device 11 is composed of an image scanner or the like, and reads the character image and sends the read character image to the pattern matching device 10. The output device 12 is composed of a liquid crystal display device or the like and outputs characters recognized by the pattern matching device 10.

パターン照合装置１０は、本装置全体を制御する制御部１３と、入力装置１１から送られてきた画像を１文字分の文字画像に切り出す切り出し部１４と、予め定められた方法で、切り出された文字画像の特徴量を抽出し、後述の特徴データを作成する特徴抽出部１５と、単純類似度を計算する前にマッチングの計算を行う候補を絞る大分類部１６と、辞書２１を構成する単純類似度用辞書特徴データ１７及び複合類似度用辞書特徴データ１８と、入力された文字画像の特徴データと単純類似度用辞書特徴データ１７や複合類似度用辞書特徴データ１８との類似度を計算するマッチング部１９と、大分類処理を行う際に用いる辞書照合データを単純類似度用辞書特徴データ１７から作成する辞書照合データ作成部２０とを備えている。 The pattern matching apparatus 10 is cut out by a predetermined method, a control unit 13 that controls the entire apparatus, a cutout unit 14 that cuts out an image sent from the input device 11 into a character image for one character, and a predetermined method. A feature extraction unit 15 that extracts feature values of a character image and creates feature data to be described later, a large classification unit 16 that narrows down candidates for matching calculation before calculating a simple similarity, and a simple that constitutes the dictionary 21 Similarity degree dictionary feature data 17 and composite similarity dictionary feature data 18, and similarity between the input character image feature data and simple similarity dictionary feature data 17 and composite similarity dictionary feature data 18 are calculated. And a dictionary matching data creating unit 20 for creating dictionary matching data used when performing the large classification process from the dictionary feature data 17 for simple similarity.

図２は、パターン照合法を説明するフローチャートである。まず、ステップＳ１０において、切り出し部１４は入力装置１１に入力された画像を受け取り、画像中の文字の外接枠に沿って文字画像を切り出す。切り出された文字画像の一例は図１２と同様である。 FIG. 2 is a flowchart for explaining the pattern matching method. First, in step S10, the cutout unit 14 receives an image input to the input device 11, and cuts out a character image along a circumscribed frame of characters in the image. An example of the clipped character image is the same as in FIG.

次に、ステップＳ１１へ進んで、切り出された文字画像からその文字の特徴を抽出する。この特徴抽出の手法としてはメッシュ特徴を用いることができる。メッシュ特徴とは、切り出された文字画像をメッシュに分割し、各メッシュにおける画素数を数値化して正規化を行う処理である。文字画像をメッシュ特徴で分割した一例は図１３と同様である。図１３では１文字画像を縦横にそれぞれ８等分して６４メッシュに区切っている。各メッシュにおける画素数を数値化して正規化した特徴データは図１４と同様である。それぞれのメッシュの中で画像の特徴量を抽出し、８×８のマトリクスで表している。 Next, it progresses to step S11 and the characteristic of the character is extracted from the cut-out character image. A mesh feature can be used as the feature extraction method. The mesh feature is a process of dividing the cut character image into meshes, and normalizing the number of pixels in each mesh. An example in which the character image is divided by the mesh feature is the same as in FIG. In FIG. 13, a single character image is divided into 8 meshes vertically and horizontally and divided into 64 meshes. The characteristic data obtained by normalizing the number of pixels in each mesh by numerical values is the same as that shown in FIG. Image features are extracted from each mesh and represented by an 8 × 8 matrix.

次に、ステップＳ１２へ進んで、大分類処理を行う。ステップＳ１３で単純類似度を計算する前にマッチングの計算を行う候補を絞る処理である。この処理の詳細については後述する。 Next, it progresses to step S12 and performs a large classification process. This is a process for narrowing down candidates for matching calculation before calculating the simple similarity in step S13. Details of this processing will be described later.

次に、ステップＳ１３へ進んで、ステップＳ１２で絞られた候補に対して単純類似度の計算を行う。単純類似度は、入力された文字画像の特徴データと、単純類似度用辞書特徴データ１７とから計算する。入力された文字画像の特徴データをＸ、辞書の或るカテゴリの特徴データをＹとすると、単純類似度Ｓ（Ｘ，Ｙ）は数１で表される。カテゴリは、１文字のデータに相当する。但し、同じ文字でもフォントによって形状がかなり違うものは別々のカテゴリとなる。 Next, the process proceeds to step S13, and simple similarity is calculated for the candidates narrowed down in step S12. The simple similarity is calculated from the input character image feature data and the simple similarity dictionary feature data 17. When the feature data of the input character image is X and the feature data of a certain category of the dictionary is Y, the simple similarity S (X, Y) is expressed by the following equation (1). The category corresponds to one character data. However, even the same characters with different shapes depending on the font are in different categories.

分母の値は、入力された文字画像の特徴データ及び辞書が有する特徴データを正規化しておくと一定になるので分子の計算だけで処理できる。分子の値は、８×８のメッシュ特徴を用いた場合は、数２のように計算される。なお、入力された文字画像の特徴データと辞書が有する特徴データとの近さを表す指標としては、類似度の他にシティーブロック距離やユークリッド距離などを用いてもよい。 Since the denominator value becomes constant when the feature data of the input character image and the feature data of the dictionary are normalized, the denominator value can be processed only by calculating the numerator. The value of the numerator is calculated as shown in Equation 2 when an 8 × 8 mesh feature is used. In addition to the similarity, a city block distance, an Euclidean distance, or the like may be used as an index representing the proximity between the input character image feature data and the feature data of the dictionary.

次に、ステップＳ１４へ進んで、全てのカテゴリに対して計算された単純類似度の値の中から上位ｎ個（ｎは自然数）のカテゴリを抽出する。そして、ステップＳ１５へ進んで、抽出されたｎ個のカテゴリに対して複合類似度を計算する。次に、ステップＳ１６へ進んで、計算されたｎ個の複合類似度の中から上位ｍ個（ｍは自然数）の候補文字を抽出する。そして、ステップＳ１７へ進んで、抽出されたｍ個の候補文字の文字コードを認識結果として出力する。以上が１文字分の文字認識処理であり、通常は切り出す文字画像がなくなるまで繰り返される。 In step S14, the top n categories (n is a natural number) are extracted from the values of simple similarity calculated for all categories. Then, the process proceeds to step S15, and the composite similarity is calculated for the n extracted categories. Next, the process proceeds to step S16, and the top m (m is a natural number) candidate characters are extracted from the calculated n composite similarities. In step S17, the extracted character codes of the m candidate characters are output as recognition results. The above is the character recognition process for one character, and is normally repeated until there is no character image to be cut out.

図３は、辞書照合データ作成部２０の動作を示すフローチャートである。まず、ステップＳ２０において、辞書２１から単純類似度用辞書特徴データ１７を取り出す。 FIG. 3 is a flowchart showing the operation of the dictionary collation data creation unit 20. First, in step S <b> 20, simple similarity dictionary feature data 17 is extracted from the dictionary 21.

次に、ステップＳ２１へ進んで、辞書照合データ作成部２０内に設けられたカウンタ（不図示）の値ｉ（ｉは自然数）を初期値である１にセットする。次に、ステップＳ２２へ進んで、特徴データの６４個の中のｉ番目（カウンタの値）の要素の値が閾値β（βは自然数）以下であるか否かを判定する。 Next, the process proceeds to step S21, and a value i (i is a natural number) of a counter (not shown) provided in the dictionary collation data creation unit 20 is set to 1 which is an initial value. Next, the process proceeds to step S22, and it is determined whether or not the value of the i-th (counter value) element among the 64 pieces of feature data is equal to or less than a threshold value β (β is a natural number).

ステップＳ２２において、特徴データのｉ番目の要素の値が閾値β以下の場合は、ステップＳ２３へ進んで該当するｂｉｔを１にセットする。一方、ステップＳ２２において、特徴データのｉ番目の要素の値が閾値βより大きい場合は、ステップＳ２４へ進んで該当するｂｉｔを０にセットする。そして、ステップＳ２３又はステップＳ２４からステップＳ２５へ進んでカウンタの値ｉをインクリメントする。 In step S22, when the value of the i-th element of the feature data is equal to or less than the threshold value β, the process proceeds to step S23 and the corresponding bit is set to 1. On the other hand, if the value of the i-th element of the feature data is larger than the threshold value β in step S22, the process proceeds to step S24 and the corresponding bit is set to 0. Then, the process proceeds from step S23 or step S24 to step S25 to increment the counter value i.

次に、ステップＳ２６へ進んで、カウンタの値ｉが６４より大きいか否かを判定する。ステップＳ２６において、カウンタの値ｉが６４より大きい場合は、特徴データの全要素に対するチェックが終わったと判断して、ステップＳ２７へ進む。一方、ステップＳ２６において、カウンタの値ｉが６４以下の場合は、ステップＳ２２に戻る。 Next, the process proceeds to step S26, where it is determined whether or not the counter value i is larger than 64. In step S26, if the counter value i is larger than 64, it is determined that all the elements of the feature data have been checked, and the process proceeds to step S27. On the other hand, if the counter value i is 64 or less in step S26, the process returns to step S22.

ステップＳ２７では、全てのカテゴリに対して処理が終わったか否かを判定する。ステップＳ２７において、全てのカテゴリに対して処理が終わっている場合は、辞書照合データの作成を終了する。一方、全てのカテゴリに対して処理が終わっていない場合は、ステップＳ２０に戻る。 In step S27, it is determined whether or not processing has been completed for all categories. In step S27, if the processing has been completed for all categories, the creation of dictionary collation data is terminated. On the other hand, if the processing has not been completed for all categories, the process returns to step S20.

例えば、β＝２とした場合の辞書照合データの例を説明する。図４（ａ）は、文字「あ」の辞書の特徴データを示したものであり、６４ｂｉｔの辞書照合データ”1100011100000000000000011000000000000000000001000000000000000000”が作られる。図４（ｂ）は、イメージをわかりやすくするため各要素に対して１になった部分を黒、０になった部分を白で表したものである。 For example, an example of dictionary collation data when β = 2 will be described. FIG. 4A shows characteristic data of the dictionary of the character “A”, and 64-bit dictionary collation data “1100011100000000000000011000000000000000000001000000000000000000” is created. In FIG. 4B, for easy understanding of the image, the portion that is 1 for each element is represented by black, and the portion that is 0 is represented by white.

また、図５（ａ）は、文字「い」の辞書の特徴データを示したものであり、６４ｂｉｔの辞書照合データ”0011100100111000001110000011110000001100000011000000110000001111”が作られる。図５（ｂ）は、イメージをわかりやすくするため各要素に対して１になった部分を黒、０になった部分を白で表したものである。 FIG. 5A shows the characteristic data of the dictionary of the character “I”, and 64-bit dictionary collation data “0011100100111000001110000011110000001100000011000000110000001111” is created. FIG. 5 (b) shows a portion that becomes 1 for each element in black and a portion that becomes 0 in white for easy understanding of the image.

また、図６（ａ）は、文字「会」の辞書の特徴データを示したものであり、６４ｂｉｔの辞書照合データ” 1100011110000001000000000000000000000000000000001000000100000000”が作られる。図６（ｂ）は、イメージをわかりやすくするため各要素に対して１になった部分を黒、０になった部分を白で表したものである。 FIG. 6A shows the characteristic data of the dictionary of the character “kai”, and 64-bit dictionary collation data “1100011110000001000000000000000000000000000000001000000100000000” is created. FIG. 6 (b) shows a portion that becomes 1 for each element in black and a portion that becomes 0 in white for easy understanding of the image.

このように、全てのカテゴリについて照合用の６４ｂｉｔデータが作られる。なお、辞書照合データの作成処理は、パターン照合装置１０の起動時に１度だけ行えばよい。またＲＯＭ（不図示）等の容量に余裕があるならば、起動時に作成するのではなく、予め作成したデータを保存しておいてもよい。 In this way, 64-bit data for collation is created for all categories. Note that the dictionary collation data creation process need only be performed once when the pattern collation apparatus 10 is activated. Further, if there is a sufficient capacity of a ROM (not shown) or the like, data created in advance may be stored instead of being created at startup.

図７は、図２のステップＳ１２の大分類処理を説明するフローチャートである。まず、ステップＳ３０において、入力された文字画像の特徴データから閾値αを用いて入力照合データを作成する。この処理の詳細については後述する。 FIG. 7 is a flowchart for explaining the large classification processing in step S12 of FIG. First, in step S30, input collation data is created from the input character image feature data using the threshold value α. Details of this processing will be described later.

次に、ステップＳ３１へ進んで、ステップＳ３０で作成された入力照合データと、図３のフローで作成された辞書照合データとの照合を行う。この照合は、６４ｂｉｔデータの論理積によって行えばよい。処理装置の能力によって一度に扱えるｂｉｔ数は異なるので、６４ｂｉｔの処理能力があれば一度に処理すればよいし、３２ｂｉｔの処理能力があれば上位３２ｂｉｔと下位３２ｂｉｔの２つに分けて処理すればよい。同様に、１６ｂｉｔの処理能力であれば４つに分けて処理すればよい。 Next, the process proceeds to step S31, where the input collation data created in step S30 is collated with the dictionary collation data created in the flow of FIG. This collation may be performed by a logical product of 64-bit data. The number of bits that can be handled at a time differs depending on the capacity of the processing device. Therefore, if there is a 64-bit processing capacity, it may be processed at one time. Good. Similarly, if the processing capability is 16 bits, the processing may be divided into four.

次に、ステップＳ３２へ進んで、ステップＳ３１での照合の結果、一致する要素の数がｐ個（ｐは自然数）以下であるか否かを判定する。具体的には、ステップＳ３１で照合結果が６４ｂｉｔのｂｉｔ列になっているので、その中で１になっているｂｉｔの数がｐ個以下かどうか判定すればよい。１になっているかどうかの判定は各ビット毎にチェックを行ってもよいし、例えば８ｂｉｔ単位に分割し、予め８ｂｉｔの取りうる値で１になっている数を返すテーブルを用いて計算してもよい。図８が８ｂｉｔの値に対して１になっている数を返すテーブルの例である。また特にｐ＝０の場合においてはステップＳ３１の処理において結果が０であるか否かで判断できるので、一致している数を数える必要がなくより高速に処理が行える。 Next, the process proceeds to step S32, and it is determined whether or not the number of matching elements is equal to or less than p (p is a natural number) as a result of the collation in step S31. Specifically, since the collation result is a 64-bit bit string in step S31, it may be determined whether or not the number of bits that are 1 is less than or equal to p. Whether or not it is 1 may be checked for each bit. For example, it is calculated using a table that divides into 8 bits and returns a number that is 1 in 8 bits. Also good. FIG. 8 is an example of a table that returns a number that is 1 for an 8-bit value. In particular, when p = 0, it can be determined whether or not the result is 0 in the process of step S31, so that it is not necessary to count the number of coincidence, and the process can be performed at a higher speed.

ステップＳ３２において、一致する要素の数がｐ個以下である場合は、ステップＳ３３へ進んで、当該カテゴリを候補対象としてチェックする。一方、ステップＳ３２において、一致する要素の数がｐ個より多い場合は、ステップＳ３４へ進んで、当該カテゴリを候補対象としない。 If the number of matching elements is not more than p in step S32, the process proceeds to step S33, and the category is checked as a candidate target. On the other hand, if the number of matching elements is larger than p in step S32, the process proceeds to step S34, and the category is not set as a candidate target.

ステップＳ３３又はステップＳ３４からステップＳ３５へ進んで、全てのカテゴリに対して照合が終わったか否かを判定する。ステップＳ３５において、全てのカテゴリに対して照合が終わっている場合は、ステップＳ３６へ進む。一方、全てのカテゴリに対して照合が終わっていない場合は、ステップＳ３１に戻る。 It progresses to step S35 from step S33 or step S34, and it is determined whether collation was completed with respect to all the categories. If it is determined in step S35 that all categories have been verified, the process proceeds to step S36. On the other hand, if collation has not been completed for all categories, the process returns to step S31.

ステップＳ３６では、ステップＳ３３で候補対象としてチェックしたカテゴリの数がｋ個（ｋは自然数）以下であるか否かを判定する。これは、入力される文字画像の特徴データによっては、閾値αの設定値では候補が残りすぎる場合があり、それをチェックする処理である。 In step S36, it is determined whether or not the number of categories checked as candidate targets in step S33 is equal to or less than k (k is a natural number). This is a process of checking if there are too many candidates remaining with the set value of the threshold value α depending on the feature data of the input character image.

ステップＳ３６において、候補対象としてチェックしたカテゴリの数がｋ個以下の場合は、大分類処理を終了する。一方、ステップＳ３６において、候補対象としてチェックしたカテゴリの数がｋ個より多い場合は、ステップＳ３７へ進んで閾値αの値を変更する。具体的には、閾値αの値を低くすることにより、文字とみなされる要素を増やす。つまり、ステップＳ３０で作成される６４ｂｉｔの入力照合データにおいて１になるｂｉｔを増やし、ステップＳ３１の照合結果でｐ個以下でない可能性を増やす。 In step S36, if the number of categories checked as candidate targets is k or less, the large classification process is terminated. On the other hand, if the number of categories checked as candidate targets is greater than k in step S36, the process proceeds to step S37 to change the value of the threshold α. Specifically, by reducing the value of the threshold value α, elements that are regarded as characters are increased. That is, the number of 1 bits in the 64-bit input collation data created in step S30 is increased, and the possibility that the collation result in step S31 is not less than p is increased.

図９は、図７のステップＳ３０の入力照合データの作成処理を説明するフローチャートである。まず、ステップＳ４０において、カウンタ（不図示）の値ｉを初期値である１にセットする。次に、ステップＳ４１へ進んで、入力された文字画像の特徴データの６４個の中のｉ番目（カウンタの値）の要素の値が閾値α（αは自然数）以上であるか否かを判定する。 FIG. 9 is a flowchart for explaining the input collation data creation process in step S30 of FIG. First, in step S40, a value i of a counter (not shown) is set to 1 which is an initial value. Next, the process proceeds to step S41, where it is determined whether or not the value of the i-th (counter value) element among the 64 feature data of the input character image is greater than or equal to a threshold value α (α is a natural number). To do.

ステップＳ４１において、特徴データのｉ番目の要素の値が閾値α以上の場合は、ステップＳ４２へ進んで該当するｂｉｔを１にセットする。一方、ステップＳ４１において、特徴データのｉ番目の要素の値が閾値αより小さい場合は、ステップＳ４３へ進んで該当するｂｉｔを０にセットする。そして、ステップＳ４２又はステップＳ４３からステップＳ４４へ進んでカウンタの値ｉをインクリメントする。 If the value of the i-th element of the feature data is greater than or equal to the threshold value α in step S41, the process proceeds to step S42 and the corresponding bit is set to 1. On the other hand, if the value of the i-th element of the feature data is smaller than the threshold value α in step S41, the process proceeds to step S43 and the corresponding bit is set to 0. Then, the process proceeds from step S42 or step S43 to step S44 to increment the counter value i.

次に、ステップＳ４５へ進んで、カウンタの値ｉが６４より大きいか否かを判定する。ステップＳ４５において、カウンタの値ｉが６４より大きい場合は、入力された文字画像の特徴データの全要素に対するチェックが終わったと判断して、入力照合データの作成を終了する。一方、ステップＳ４５において、カウンタの値ｉが６４以下の場合は、ステップＳ４１に戻る。 Next, the process proceeds to step S45, where it is determined whether or not the counter value i is larger than 64. In step S45, if the counter value i is larger than 64, it is determined that all elements of the feature data of the input character image have been checked, and the creation of the input collation data is terminated. On the other hand, if the counter value i is 64 or less in step S45, the process returns to step S41.

例えば、α＝３０、ｐ＝０とした場合の大分類処理を説明する。図７のステップＳ３０の処理により入力された文字画像の特徴データから６４ｂｉｔの入力照合データ、例えば”0001000001111110001000000011111001101001100110011011001100000110”が作られる。図１０は、この入力照合データのイメージをわかりやすくするため各要素に対して１になった部分を黒、０になった部分を白で表したものである。 For example, a large classification process when α = 30 and p = 0 will be described. 64-bit input collation data, for example, “0001000001111110001000000011111001101001100110011011001100000110” is generated from the feature data of the character image input by the process of step S30 in FIG. FIG. 10 shows the portion of the input verification data in black and the portion of 0 in white for easy understanding of the input collation data image.

図７のステップＳ３１において、例えば図４の「あ」との照合結果は”0000000000000000000000000000000000000000000000000000000000000000”となり、一致する数が０（ｐ＝０）であり、候補対象になることが分かる。また図５の「い」との照合結果は”0001000000111000001000000011110000001000000010000000000000000110”となり、一致する数が０ではないので、候補対象にならないことがわかる。また図６の「会」との照合結果は”0000000000000000000000000000000000000000000000001000000100000000”となり、一致する数が０ではないので、候補対象にならないことがわかる。 In step S31 of FIG. 7, for example, the collation result with “A” in FIG. Also, the collation result with “I” in FIG. 5 is “0001000000111000001000000011110000001000000010000000000000000110”, and since the number of matches is not 0, it can be seen that the candidate is not a candidate. In addition, the collation result with “Meeting” in FIG. 6 is “0000000000000000000000000000000000000000000000001000000100000000”, and since the number of matches is not 0, it can be seen that it is not a candidate target.

このように大分類処理を行い、辞書の特徴データをうまく利用することでマッチする可能性がある文字は落とさずに候補を絞り高速化できる。 By performing large classification processing in this way and using the feature data of the dictionary well, candidates can be narrowed down and speeded up without dropping characters that may be matched.

なお、本実施形態で用いた数値は特定の値ではなく、適宜設定すればよい。また図７のステップＳ３６やステップＳ３７で候補が多すぎる場合の処理を行っているが、逆に候補が少なすぎる場合のチェックを行い、αの値を増やすことで候補を増やすことも可能である。さらに入力された文字画像の特徴データの閾値αを変更させるだけでなく、辞書が有する特徴データに対しての閾値βを複数用意しておき、条件に合わせて照合するデータを変えることも可能である。また図７のステップＳ３６やステップＳ３７の処理を行わず、残った候補がいくつであってもかまわないように変更してもよい。 The numerical values used in the present embodiment are not specific values and may be set as appropriate. Further, although processing is performed when there are too many candidates in step S36 and step S37 in FIG. 7, it is also possible to increase the number of candidates by checking if there are too few candidates and increasing the value of α. . In addition to changing the threshold value α of the feature data of the input character image, it is possible to prepare a plurality of threshold values β for the feature data of the dictionary and change the data to be collated according to the conditions. is there. Further, the process of step S36 and step S37 of FIG. 7 may be omitted, and the number of remaining candidates may be changed so that it does not matter.

なお、上記のパターン照合装置１０には、上記の動作を実行するためのパターン照合プログラムが搭載されている。また、そのパターン照合プログラムはＲＯＭ、ＨＤＤ、ＣＤ、ＤＶＤ等の記録媒体に記録されて用いられる。 The pattern matching apparatus 10 is equipped with a pattern matching program for executing the above-described operation. The pattern matching program is recorded on a recording medium such as a ROM, HDD, CD, or DVD.

本発明のパターン照合装置の主要な構成を示すブロック図である。It is a block diagram which shows the main structures of the pattern collation apparatus of this invention. パターン照合法を説明するフローチャートである。It is a flowchart explaining a pattern matching method. 本発明の辞書照合データ作成部の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the dictionary collation data preparation part of this invention. （ａ）は、文字「あ」の辞書の特徴データを示したものであり、（ｂ）は、図４（ａ）をイメージで表したものである。(A) shows the characteristic data of the dictionary of the character “A”, and (b) shows an image of FIG. 4 (a). （ａ）は、文字「い」の辞書の特徴データを示したものであり、（ｂ）は、図５（ａ）をイメージで表したものである。(A) shows the characteristic data of the dictionary of the character “I”, and (b) shows the image of FIG. 5 (a). （ａ）は、文字「会」の辞書の特徴データを示したものであり、（ｂ）は、図６（ａ）をイメージで表したものである。(A) shows the characteristic data of the dictionary of the character “kai”, and (b) shows the image of FIG. 6 (a). 図２のステップＳ１２の大分類処理を説明するフローチャートである。It is a flowchart explaining the large classification process of step S12 of FIG. ８ｂｉｔの値に対して１になっている数を返すテーブルの例である。It is an example of the table which returns the number which is 1 with respect to the value of 8 bits. 図７のステップＳ３０の入力照合データの作成処理を説明するフローチャートである。It is a flowchart explaining the creation process of the input collation data of step S30 of FIG. 本発明の入力照合データの一例をイメージで表したものである。An example of the input collation data of the present invention is represented by an image. 従来のパターン照合法を説明するフローチャートである。It is a flowchart explaining the conventional pattern matching method. 従来の切り出された文字画像の一例を示す図である。It is a figure which shows an example of the conventional character image cut out. 図１２の文字画像をメッシュ特徴で分割した図である。It is the figure which divided | segmented the character image of FIG. 12 by the mesh characteristic. 図１３の各メッシュにおける画素数を数値化して正規化した特徴データである。14 is characteristic data obtained by normalizing the number of pixels in each mesh in FIG.

Explanation of symbols

１０パターン照合装置
２１辞書 10 Pattern matching device 21 Dictionary

Claims

A pattern matching device that recognizes characters by comparing feature data of an input character image and feature data of a plurality of categories of a dictionary,
Means for obtaining dictionary collation data which is a binary value indicating whether or not the value is equal to or less than a first threshold value for each element of each category from the feature data of a plurality of categories included in the dictionary; Means for obtaining input collation data, which is a binary value indicating whether or not each element is greater than or equal to a second threshold value from the feature data of the character image; and the dictionary collation data and the input collation data. A pattern matching apparatus comprising: means for matching and performing a matching calculation with a dictionary only for a category in which the number of matching elements is equal to or less than a predetermined value.

2. The pattern matching apparatus according to claim 1, wherein the dictionary matching data creation process is performed only once when the apparatus is activated.

3. The pattern matching apparatus according to claim 1, wherein the dictionary matching data is matched with the input matching data, and matching calculation with the dictionary is performed only for a category having zero matching elements.

The pattern matching apparatus according to claim 1, wherein when the dictionary matching data and the input matching data are matched, a logical product operation is performed collectively in a certain unit.

In a pattern matching program that recognizes characters by comparing feature data of an input character image with feature data of a plurality of categories that the dictionary has by controlling the computer,
From the feature data of a plurality of categories possessed by the dictionary, dictionary collation data which is a binary value indicating whether or not each element of each category is equal to or less than a first threshold is obtained, and the input character image The input collation data which is a binary value indicating whether or not each element is equal to or greater than the second threshold is obtained from the feature data, and the dictionary collation data and the input collation data are collated and matched. A pattern matching program that performs matching calculation with a dictionary only for a category in which the number of elements to be performed is a predetermined value or less.

A recording medium on which the pattern matching program according to claim 5 is recorded.