JP5729348B2

JP5729348B2 - Character recognition device and character recognition method

Info

Publication number: JP5729348B2
Application number: JP2012098539A
Authority: JP
Inventors: 室崎　隆; 隆室崎
Original assignee: Denso Corp
Current assignee: Denso Corp
Priority date: 2012-04-24
Filing date: 2012-04-24
Publication date: 2015-06-03
Anticipated expiration: 2032-04-24
Also published as: JP2013228781A

Description

本発明は、文字認識装置および文字認識方法に関するものである。 The present invention relates to a character recognition device and a character recognition method.

従来、特許文献１には、実際に誤読の起こる可能性のあるモデル組に関する情報を記憶しておく文字認識方法が記載されている。 Conventionally, Patent Document 1 describes a character recognition method for storing information on a model set that may actually cause misreading.

この従来技術では、整合処理によって得られた二つの候補モデルがＭｉ１、Ｍｉ２であった場合（Ｍｉ１、Ｍｉ２のいずれが第１位、第２位であるかを問わない）、テーブル要素Ａｉが参照されることになる。 In this prior art, when the two candidate models obtained by the matching process are Mi1 and Mi2 (regardless of which of Mi1 and Mi2 is the first and second), the table element Ai refers to Will be.

そして、認識処理対象の文字画像と各候補モデル（ここではＭｉ１、Ｍｉ２）との相違度Ｄｉ１、Ｄｉ２を、対応した再評価係数ｋｉ１、ｋｉ２を乗じて再評価し、ｋｉ１×Ｄｉ１≦ｋｉ２×Ｄｉ２のときはモデルＭｉ１の文字カテゴリを最終結果とし、ｋｉ１×Ｄｉ１＞ｋｉ２×Ｄｉ２のときはモデルＭｉ２の文字カテゴリを最終結果とする。 Then, the degree of difference Di1, Di2 between the character image to be recognized and each candidate model (here, Mi1, Mi2) is re-evaluated by multiplying the corresponding re-evaluation coefficients ki1, ki2, and ki1 × Di1 ≦ ki2 × Di2 In this case, the character category of the model Mi1 is the final result, and when ki1 × Di1> ki2 × Di2, the character category of the model Mi2 is the final result.

特開平８−１９４７７８号公報JP-A-8-194778

しかしながら、上記従来技術によると、候補モデルとの相違度を係数と掛け合わせて再評価しているに過ぎず、実質的に文字認識をしていない。また、評価係数の求め方が不明であるし、そもそも誤認識しない評価係数を予め決定することは困難であると考えられる。 However, according to the above prior art, the degree of difference from the candidate model is merely re-evaluated by multiplying it with a coefficient, and character recognition is not substantially performed. Moreover, it is considered that it is difficult to determine in advance an evaluation coefficient that is not erroneously recognized.

本発明は上記点に鑑みて、文字の認識精度を向上することを目的とする。 The present invention has been made in view of the above points, and an object thereof is to improve the recognition accuracy of characters.

上記目的を達成するため、請求項１に記載の発明では、文字が記された被検査物１を撮影した検査画像を取得する画像取得手段１１と、
検査画像の文字領域から求めた特徴量に基づいて、文字領域に写っている文字を識別するサポートベクターマシン１２９と、
複数のモデル画像の文字部分相互間の相違領域Ａ１を細分化した第１セルＢ１の輝度値を特徴量としてサポートベクターマシン１２９の学習データを生成する学習データ生成手段Ｓ１２０とを備えることを特徴とする。 In order to achieve the above object, according to the first aspect of the present invention, an image acquisition unit 11 that acquires an inspection image obtained by photographing the inspection object 1 on which characters are written;
A support vector machine 129 for identifying a character appearing in the character area based on the feature amount obtained from the character area of the inspection image;
And learning data generation means S120 for generating learning data of the support vector machine 129 using the luminance value of the first cell B1 obtained by subdividing the difference area A1 between the character portions of the plurality of model images as a feature amount. To do.

これによると、複数文字相互間の相違を顕在化させることができるので、文字の誤認識を抑制して認識精度を向上させることができる。 According to this, since a difference between a plurality of characters can be made apparent, erroneous recognition of characters can be suppressed and recognition accuracy can be improved.

なお、この欄および特許請求の範囲で記載した各手段の括弧内の符号は、後述する実施形態に記載の具体的手段との対応関係を示すものである。 In addition, the code | symbol in the bracket | parenthesis of each means described in this column and the claim shows the correspondence with the specific means as described in embodiment mentioned later.

第１実施形態における文字認識装置の全体構成図である。It is a whole block diagram of the character recognition apparatus in 1st Embodiment. 図１の処理部を示すブロック図である。It is a block diagram which shows the process part of FIG. サポートベクターマシンの概念図である。It is a conceptual diagram of a support vector machine. サポートベクターマシンの学習処理を示すフローチャートである。It is a flowchart which shows the learning process of a support vector machine. サポートベクターマシンの学習処理においてパターンマッチングを行った結果の例を示すグラフである。It is a graph which shows the example of the result of having performed pattern matching in the learning process of a support vector machine. サポートベクターマシンの学習処理において作成された類似グループの例を示す図表である。It is a graph which shows the example of the similar group produced in the learning process of a support vector machine. サポートベクターマシンの学習データ生成処理を示すフローチャートである。It is a flowchart which shows the learning data generation process of a support vector machine. サポートベクターマシンの学習データ生成処理において画像処理を行った結果の例を示す図である。It is a figure which shows the example of the result of having performed image processing in the learning data generation process of a support vector machine. サポートベクターマシンの学習データ生成処理において作成された学習テーブルの例を示す図表である。It is a graph which shows the example of the learning table produced in the learning data generation process of a support vector machine. 文字認識装置の文字認識処理を示すフローチャートである。It is a flowchart which shows the character recognition process of a character recognition apparatus. 文字認識装置の文字認識処理に用いられた文字領域画像の例を示す図である。It is a figure which shows the example of the character area image used for the character recognition process of a character recognition apparatus.

以下、一実施形態を説明する。図１に示す文字認識装置１０は、車両のエンジンに燃料を供給する燃料ポンプの生産ラインに設置され、燃料ポンプの部品の表面に刻印された型番等の文字（本例では、アルファベット大文字）を認識する。 Hereinafter, an embodiment will be described. A character recognition device 10 shown in FIG. 1 is installed in a production line of a fuel pump that supplies fuel to an engine of a vehicle. Characters such as a model number stamped on the surface of a fuel pump component (in this example, uppercase letters) recognize.

文字認識装置１０は、先ずパターンマッチングにより文字認識を行って候補文字を選択し、候補文字に類似する類似文字がある場合、類似文字との相違領域等から抽出したセルの輝度値をサポートベクターマシンの入力特徴量として、候補文字である可能性を表すプロバビリティ（確信度）を求める。 The character recognition device 10 first performs character recognition by pattern matching to select a candidate character, and if there is a similar character similar to the candidate character, the brightness value of the cell extracted from a different area from the similar character is supported by the support vector machine As the input feature quantity, probability (confidence) indicating the possibility of being a candidate character is obtained.

文字認識装置１０は、撮像部１１と処理部１２とを備えている。撮像部１１は、被検査物であるワーク１を撮影して検査画像を取得する画像取得手段であり、取得した検査画像を処理部１２へ送信する。撮像部１１は、ワーク１を照明する照明光源を有してもよい。 The character recognition device 10 includes an imaging unit 11 and a processing unit 12. The imaging unit 11 is an image acquisition unit that acquires an inspection image by photographing the workpiece 1 as an inspection object, and transmits the acquired inspection image to the processing unit 12. The imaging unit 11 may have an illumination light source that illuminates the workpiece 1.

処理部１２は、撮像部１１が取得した検査画像に基づいてワーク１表面に刻印された文字を認識するとともに種々の制御を行う。処理部１２は、パーソナルコンピュータおよびその周辺機器で構成されている。 The processing unit 12 recognizes characters stamped on the surface of the workpiece 1 based on the inspection image acquired by the imaging unit 11 and performs various controls. The processing unit 12 includes a personal computer and its peripheral devices.

図２に示すように、処理部１２は、制御手段１２１、通信手段１２２、記憶手段１２３、パターンマッチング手段１２４、ＸＯＲ演算手段１２５、ＡＮＤ演算手段１２６、セル抽出手段１２７、平均輝度値算出手段１２８およびサポートベクターマシン１２９等を有している。 As shown in FIG. 2, the processing unit 12 includes a control unit 121, a communication unit 122, a storage unit 123, a pattern matching unit 124, an XOR operation unit 125, an AND operation unit 126, a cell extraction unit 127, and an average luminance value calculation unit 128. And a support vector machine 129 and the like.

制御手段１２１は、パーソナルコンピュータの中央演算装置（ＣＰＵ）と、その周辺回路などで構成され、ＣＰＵに読み込まれたプログラムにしたがって動作し、撮像部１１および処理部１２の各手段を制御する。 The control unit 121 includes a central processing unit (CPU) of a personal computer and its peripheral circuits, and operates according to a program read into the CPU, and controls each unit of the imaging unit 11 and the processing unit 12.

通信手段１２２は、処理部１２と、撮像部１１等の機器との間で制御信号、画像データおよびデータ信号を送受信する通信インタフェースであり、Ｉ／Ｏポートおよびそのドライバで構成される。 The communication unit 122 is a communication interface that transmits and receives control signals, image data, and data signals between the processing unit 12 and devices such as the imaging unit 11, and includes an I / O port and its driver.

処理部１２は、通信手段１２２を通じて撮像部１１から検査画像を受信する。制御手段１２１で生成された制御信号は、通信手段１２２を通じて撮像部１１へ送信される。処理部１２は、認識した文字の情報を、通信手段１２２を通じて外部の機器へ出力する。 The processing unit 12 receives the inspection image from the imaging unit 11 through the communication unit 122. The control signal generated by the control unit 121 is transmitted to the imaging unit 11 through the communication unit 122. The processing unit 12 outputs the recognized character information to an external device through the communication unit 122.

記憶手段１２３は、ランダムアクセスメモリ（ＲＡＭ）やリードオンリメモリ（ＲＯＭ）といった半導体メモリ、磁気ディスク、光ディスクなどの記憶媒体、および記憶媒体へのアクセス装置などで構成されており、処理部１２の制御を行うプログラムや種々のデータを記憶する。 The storage unit 123 includes a semiconductor memory such as a random access memory (RAM) and a read only memory (ROM), a storage medium such as a magnetic disk and an optical disk, an access device to the storage medium, and the like. A program for performing and various data are stored.

記憶手段１２３が記憶するデータとしては、例えば、ワーク１に刻印される可能性のある各文字（以下、認識対象文字という）に対応するテンプレート画像、サポートベクターマシン１２９の学習データ、および撮像部１１が撮影した検査画像などがある。 Data stored in the storage unit 123 includes, for example, a template image corresponding to each character that may be engraved on the work 1 (hereinafter referred to as a recognition target character), learning data of the support vector machine 129, and the imaging unit 11. There are inspection images taken by.

パターンマッチング手段１２４、ＸＯＲ演算手段１２５、ＡＮＤ演算手段１２６、セル抽出手段１２７、平均輝度値算出手段１２８およびサポートベクターマシン１２９は、例えばＣＰＵ上で実行されるプログラムにより実装される機能モジュールである。これらの手段１２４〜１２９は、ＣＰＵとは別個の画像処理用プロセッサを備える専用処理ボードとして実装されてもよい。 The pattern matching unit 124, the XOR operation unit 125, the AND operation unit 126, the cell extraction unit 127, the average luminance value calculation unit 128, and the support vector machine 129 are functional modules implemented by programs executed on the CPU, for example. These means 124 to 129 may be mounted as a dedicated processing board including an image processing processor separate from the CPU.

パターンマッチング手段１２４は、入力画像を記憶手段１２３から読み出したテンプレート画像と比較してパターンマッチングを行って、入力画像に含まれる文字を認識する。具体的には、パターンマッチング手段１２４は、入力画像と各テンプレート画像との一致度を表すスコアを求め、そのスコアが最大となるテンプレート画像を決定する。本例では、スコアを下記の数式１に示す正規化相関係数で求める。 The pattern matching unit 124 compares the input image with the template image read from the storage unit 123 and performs pattern matching to recognize characters included in the input image. Specifically, the pattern matching unit 124 obtains a score representing the degree of coincidence between the input image and each template image, and determines a template image that maximizes the score. In this example, the score is obtained by the normalized correlation coefficient shown in the following formula 1.

ここでＲはスコア（一致度）であり、Ｉは入力画像の輝度値であり、Ｔはテンプレート画像の輝度値であり、ｗは画像の幅であり、ｈは画像の高さである。

Here, R is a score (degree of coincidence), I is the luminance value of the input image, T is the luminance value of the template image, w is the width of the image, and h is the height of the image.

入力画像に含まれる文字とテンプレート画像に含まれる文字とが完全に一致する場合、スコアＲ＝１となり、入力画像に含まれる文字とテンプレート画像に含まれる文字とに全く相関が無い場合、スコアＲ＝０となる。 When the character included in the input image and the character included in the template image completely match, the score R = 1, and when there is no correlation between the character included in the input image and the character included in the template image, the score R = 0.

ＸＯＲ演算手段１２５は、２つの画像の排他的論理和（ＸＯＲ）を求める。ＡＮＤ演算手段１２６は、２つの画像の論理積（ＡＮＤ）を求める。セル抽出手段１２７は、画像中の所定領域から所定の大きさのセルを抽出する。平均輝度値算出手段１２８は、セル抽出手段１２７が抽出した各セルの平均輝度値を算出する。ＸＯＲ演算手段１２５、ＡＮＤ演算手段１２６、セル抽出手段１２７および平均輝度値算出手段１２８の詳細については後述する。 The XOR operation means 125 obtains an exclusive OR (XOR) of the two images. The AND operation means 126 calculates a logical product (AND) of the two images. The cell extracting unit 127 extracts a cell having a predetermined size from a predetermined region in the image. The average luminance value calculating unit 128 calculates the average luminance value of each cell extracted by the cell extracting unit 127. Details of the XOR operation means 125, the AND operation means 126, the cell extraction means 127, and the average luminance value calculation means 128 will be described later.

サポートベクターマシン１２９は、平均輝度値算出手段１２８が算出した各セルの平均輝度値を特徴量として、画像中に写っている文字を識別する。図３に、サポートベクターマシン１２９の概念図を示す。 The support vector machine 129 identifies characters appearing in the image using the average luminance value of each cell calculated by the average luminance value calculating unit 128 as a feature amount. FIG. 3 shows a conceptual diagram of the support vector machine 129.

サポートベクターマシン１２９は、所定の識別対象物が、複数のカテゴリの何れかに属する場合、その識別対象物から求めた特徴量に基づいて、その識別対象物を何れのカテゴリに属するかを判定する識別器である。 When the predetermined identification object belongs to any of a plurality of categories, the support vector machine 129 determines to which category the identification object belongs based on the feature amount obtained from the identification object. It is a discriminator.

カテゴリ間の境界は、各カテゴリに属する学習データの特徴量のうち、隣接するカテゴリに属する学習データの特徴量との距離が最も近いものの組で表される。このカテゴリ間の境界を表す特徴量は、サポートベクトルと呼ばれる。 The boundary between categories is represented by a set of features having the shortest distance from feature amounts of learning data belonging to adjacent categories among feature amounts of learning data belonging to each category. A feature amount representing a boundary between categories is called a support vector.

図３の例では、丸印で示された各点が、カテゴリＣ１に属する特徴量であり、このうち特徴量２０１〜２０３が、カテゴリＣ１のサポートベクトルである。また、菱形で示された各点が、カテゴリＣ２に属する特徴量であり、このうち特徴量２０４〜２０６が、カテゴリＣ２のサポートベクトルである。 In the example of FIG. 3, each point indicated by a circle is a feature quantity belonging to the category C1, and among these, the feature quantities 201 to 203 are support vectors of the category C1. Further, each point indicated by a rhombus is a feature quantity belonging to the category C2, and among these, the feature quantities 204 to 206 are support vectors of the category C2.

サポートベクターマシン１２９では、識別精度を向上するために、カテゴリＣ１のサポートベクトルと、カテゴリＣ２のサポートベクトル間の距離（マージン）が最大化されるように、サポートベクトルが決定される。 In the support vector machine 129, in order to improve the identification accuracy, the support vector is determined so that the distance (margin) between the support vector of the category C1 and the support vector of the category C2 is maximized.

サポートベクターマシン１２９では、カテゴリ間の境界が非線形な場合でも、カーネル関数を利用して、学習データの特徴量を高次元に写像した上でサポートベクトルを決定することにより、各カテゴリに属する特徴量を線形分離可能とすることで、良好な識別性能を得ることができる。 In the support vector machine 129, even when the boundary between categories is non-linear, the feature quantity belonging to each category is determined by mapping the feature quantity of the learning data to a higher dimension using a kernel function and determining the support vector. By making it possible to perform linear separation, it is possible to obtain good discrimination performance.

本実施形態では、互いに類似する複数の認識対象文字の相違領域等から抽出したセルの平均輝度値を学習データの特徴量としてサポートベクターマシン１２９を予め学習させた。サポートベクターマシン１２９の学習に用いる学習データの生成方法については後述する。 In the present embodiment, the support vector machine 129 is trained in advance using the average luminance value of cells extracted from different regions of a plurality of recognition target characters that are similar to each other as the feature amount of the learning data. A method of generating learning data used for learning of the support vector machine 129 will be described later.

サポートベクターマシン１２９は、複数の画像の相違領域等から抽出したセルの平均輝度値を受け取ると、それを入力特徴量とすることにより、特定の認識対象文字である確信度を表すプロバビリティを求める。 When the support vector machine 129 receives the average luminance value of the cells extracted from the different regions of the plurality of images, the support vector machine 129 obtains the probability indicating the certainty that is a specific recognition target character by using the average luminance value as the input feature amount. .

処理部１２は、パターンマッチング手段１２４またはサポートベクターマシン１２９による文字認識結果を、ディスプレイに表示してユーザに報知したり通信手段１２２を介して通信可能に接続された他の機器へ出力したりする。 The processing unit 12 displays the character recognition result by the pattern matching unit 124 or the support vector machine 129 on a display and notifies the user, or outputs the result to other devices connected to be communicable via the communication unit 122. .

次に、サポートベクターマシン１２９の学習方法を説明する。サポートベクターマシン１２９の学習は、処理部１２が図４のフローチャートに示す処理を実行することによって行われる。 Next, a learning method of the support vector machine 129 will be described. Learning of the support vector machine 129 is performed by the processing unit 12 executing the processing shown in the flowchart of FIG.

まずステップＳ１００では、学習に用いる入力画像を生成する。具体的には、認識対象文字に対応するモデル画像の大きさなどを自動調整する。本例では、モデル画像は予め記憶手段１２３に記憶されている。 First, in step S100, an input image used for learning is generated. Specifically, the size of the model image corresponding to the recognition target character is automatically adjusted. In this example, the model image is stored in the storage unit 123 in advance.

続くステップＳ１１０では、類似文字のグルーピングを行う。具体的には、パターンマッチング手段１２４が、各テンプレート画像相互間で、上述の数式１に示す正規化相関係数を用いてパターンマッチングを行う。そして、パターンマッチングのスコアＲが閾値（本例では０．７）を超えた文字を類似文字としてグルーピングする。 In subsequent step S110, similar characters are grouped. Specifically, the pattern matching unit 124 performs pattern matching between the template images using the normalized correlation coefficient expressed by the above-described Equation 1. Then, characters whose pattern matching score R exceeds a threshold value (0.7 in this example) are grouped as similar characters.

図５は、認識対象文字「Ｃ」についてパターンマッチングを行った結果の例を示している。この場合、認識対象文字「Ｄ」、「Ｇ」、「Ｏ」、「Ｑ」のスコアが０．７を超えるため、認識対象文字「Ｃ」と、類似文字「Ｃ」、「Ｄ」、「Ｇ」、「Ｏ」、「Ｑ」との組合せでグルーピングする。 FIG. 5 shows an example of the result of pattern matching for the recognition target character “C”. In this case, since the scores of the recognition target characters “D”, “G”, “O”, and “Q” exceed 0.7, the recognition target character “C” and the similar characters “C”, “D”, “ Group by a combination of “G”, “O”, and “Q”.

本例では、２種類の文字の組合せでグルーピングして類似グループを作成する。具体的には、「Ｃ、Ｄ」、「Ｃ、Ｇ」、「Ｃ、Ｏ」および「Ｃ、Ｑ」の４つの類似グループを作成する。 In this example, a similar group is created by grouping with a combination of two types of characters. Specifically, four similar groups “C, D”, “C, G”, “C, O”, and “C, Q” are created.

このようなグルーピングを「Ｃ」以外の認識対象文字についても行う。本例では、学習対象文字がアルファベット大文字であるので、例えば図６に示すような類似グループが作成される。作成された類似グループは、記憶手段１２３に書き込まれて登録される。 Such grouping is also performed on recognition target characters other than “C”. In this example, since the learning target character is an uppercase alphabet, for example, a similar group as shown in FIG. 6 is created. The created similar group is written and registered in the storage unit 123.

続くステップＳ１２０では、サポートベクターマシン１２９の学習に用いる学習データを生成する。したがって、ステップＳ１２０は学習データ生成手段を構成している。ステップＳ１２０の詳細を図７に示す。 In subsequent step S120, learning data used for learning of the support vector machine 129 is generated. Therefore, step S120 constitutes learning data generation means. Details of step S120 are shown in FIG.

まずステップＳ１２１０では、ステップＳ１１０で作成した類似グループの各認識対象文字に対応する各テンプレート画像を記憶手段１２３から読み出して入力する。 First, in step S1210, each template image corresponding to each recognition target character of the similar group created in step S110 is read from the storage unit 123 and input.

続くステップＳ１２２０では、ステップＳ１２１０で入力された各テンプレート画像について、文字部分の相違領域を抽出し、抽出した文字部分の相違領域を多数個のセルに細分化する。続くステップＳ１２３０では、ステップＳ１２２０で細分化した多数個のセルについて学習データを生成する。 In the following step S1220, a character area difference area is extracted from each template image input in step S1210, and the extracted character area difference area is subdivided into a number of cells. In subsequent step S1230, learning data is generated for a large number of cells subdivided in step S1220.

図８（ａ）は、「Ｃ、Ｄ」の類似グループについてステップＳ１２２０、Ｓ１２３０を実行した例を示し、図８（ｂ）は、「Ｃ、Ｏ」の類似グループについてステップＳ１２２０、Ｓ１２３０を実行した例を示している。 FIG. 8A shows an example in which steps S1220 and S1230 are executed for similar groups “C, D”, and FIG. 8B shows that steps S1220 and S1230 are executed for similar groups “C, O”. An example is shown.

ステップＳ１２２０では、ＸＯＲ演算手段１２５が類似グループの各認識対象文字に対応する各テンプレート画像の各文字部分に対して排他的論理和（ＸＯＲ）を求めることによって文字部分の相違領域Ａ１（図中の白色の領域）を抽出し、セル抽出手段１２７が相違領域Ａ１から第１セルＢ１を抽出することによって相違領域Ａ１を多数個の第１セルＢ１に細分化する。 In step S1220, the XOR operation means 125 obtains an exclusive logical sum (XOR) for each character portion of each template image corresponding to each recognition target character in the similar group, whereby the character portion difference area A1 (in the figure). White area) is extracted, and the cell extraction means 127 extracts the first cell B1 from the different area A1, thereby subdividing the different area A1 into a number of first cells B1.

図８（ａ）の例では、「Ｃ」、「Ｄ」のテンプレート画像の各文字部分に対して排他的論理和（ＸＯＲ）を求めることによって文字部分の相違領域Ａ１を抽出し、図８（ｂ）の例では、「Ｃ」、「Ｏ」のテンプレート画像の各文字部分に対して排他的論理和（ＸＯＲ）を求めることによって文字部分の相違領域Ａ１を抽出した。 In the example of FIG. 8A, a character area difference area A1 is extracted by obtaining an exclusive OR (XOR) for each character part of the template images “C” and “D”. In the example of b), the character part difference area A1 is extracted by obtaining an exclusive OR (XOR) for each character part of the template images “C” and “O”.

本例では、相違領域Ａ１から四角形の第１セルＢ１を抽出している。相違領域Ａ１から第１セルＢ１を抽出する方法としては、例えば、第１セルＢ１に対応した四角形のパターンで相違領域をサーチングすることによって、相違領域を多数個の第１セルＢ１に細分化することができる。 In this example, a rectangular first cell B1 is extracted from the different area A1. As a method of extracting the first cell B1 from the different area A1, for example, the different area is subdivided into a number of first cells B1 by searching the different area with a rectangular pattern corresponding to the first cell B1. can do.

ステップＳ１２３０では、平均輝度値算出手段１２８が各第１セルＢ１の平均輝度値を算出し、その平均輝度値を学習データとする。本例では、各第１セルＢ１の画像（グレー画像）の平均輝度値を、黒を０、白を１として正規化して学習データとする。 In step S1230, the average luminance value calculation unit 128 calculates the average luminance value of each first cell B1, and uses the average luminance value as learning data. In this example, the average luminance value of the image (gray image) of each first cell B1 is normalized with black being 0 and white being 1 to obtain learning data.

続くステップＳ１２４０では、ステップＳ１２１０で入力された各テンプレート画像について、背景部分（文字以外の部分）の相違領域を抽出し、抽出した背景部分の相違領域を多数個のセルに細分化する。続くステップＳ１２５０では、ステップＳ１２４０で細分化した多数個のセルについて学習データを作成する。 In subsequent step S1240, a different area of the background part (part other than the character) is extracted from each template image input in step S1210, and the extracted different area of the background part is subdivided into a number of cells. In subsequent step S1250, learning data is created for a large number of cells subdivided in step S1240.

図８（ｃ）は、「Ｃ、Ｄ」の類似グループについてステップＳ１２４０、Ｓ１２５０を実行した例を示し、図８（ｄ）は、「Ｃ、Ｏ」の類似グループについてステップＳ１２４０、Ｓ１２５０を実行した例を示している。 FIG. 8C shows an example in which steps S1240 and S1250 are executed for the similar group “C, D”, and FIG. 8D shows that steps S1240 and S1250 are executed for the similar group “C, O”. An example is shown.

ステップＳ１２４０では、ＸＯＲ演算手段１２５が類似グループの各認識対象文字に対応する各テンプレート画像の各背景部分に対して排他的論理和（ＸＯＲ）を求めることによって背景部分の相違領域Ａ２（図中の白色の領域）を抽出し、セル抽出手段１２７が相違領域Ａ２から第２セルＢ２を抽出することによって相違領域Ａ２を多数個の第２セルＢ２に細分化する。 In step S1240, the XOR operation means 125 obtains an exclusive OR (XOR) for each background portion of each template image corresponding to each recognition target character in the similar group, thereby differentiating the background portion difference area A2 (in the figure). White area) is extracted, and the cell extracting means 127 extracts the second cell B2 from the different area A2, thereby subdividing the different area A2 into a plurality of second cells B2.

図８（ｃ）の例では、「Ｃ」、「Ｄ」のテンプレート画像の各背景部分に対して排他的論理和（ＸＯＲ）を求めることによって背景部分の相違領域Ａ２を抽出し、図８（ｄ）の例では、「Ｃ」、「Ｏ」のテンプレート画像の各背景部分に対して排他的論理和（ＸＯＲ）を求めることによって背景部分の相違領域Ａ２を抽出した。 In the example of FIG. 8C, the background region difference area A2 is extracted by obtaining an exclusive OR (XOR) for each background portion of the template images “C” and “D”. In the example of d), the background portion difference area A2 is extracted by obtaining an exclusive OR (XOR) for each background portion of the template images “C” and “O”.

本例では、ステップＳ１２２０と同様に、相違領域Ａ２から四角形の第２セルＢ２を抽出している。なお、図８（ｄ）の例では、相違領域Ａ２の大きさが小さいため、第２セルＢ２が１つも抽出されていない。 In this example, a rectangular second cell B2 is extracted from the different area A2 as in step S1220. In the example of FIG. 8D, since the size of the different area A2 is small, no second cell B2 is extracted.

ステップＳ１２５０では、ステップＳ１２３０と同様に、平均輝度値算出手段１２８が各第２セルＢ２の平均輝度値を算出し、その平均輝度値を学習データとする。 In step S1250, as in step S1230, the average luminance value calculation unit 128 calculates the average luminance value of each second cell B2, and uses the average luminance value as learning data.

続くステップＳ１２６０では、ステップＳ１２１０で入力された各テンプレート画像について、文字部分の共通領域を抽出し、抽出した文字部分の共通領域を多数個のセルに細分化する。続くステップＳ１２７０では、ステップＳ１２６０で細分化した多数個のセルについて学習データを作成する。 In subsequent step S1260, a common area of the character part is extracted from each template image input in step S1210, and the extracted common area of the character part is subdivided into a number of cells. In the subsequent step S1270, learning data is created for a large number of cells subdivided in step S1260.

図８（ｅ）は、「Ｃ、Ｄ」の類似グループについてステップＳ１２６０、Ｓ１２７０を実行した例を示し、図８（ｆ）は、「Ｃ、Ｏ」の類似グループについてステップＳ１２６０、Ｓ１２７０を実行した例を示している。 FIG. 8E shows an example in which steps S1260 and S1270 are executed for the similar group “C, D”, and FIG. 8F shows that steps S1260 and S1270 are executed for the similar group “C, O”. An example is shown.

ステップＳ１２６０では、ＡＮＤ演算手段１２６が類似グループの各認識対象文字に対応する各テンプレート画像の各文字部分に対して論理積（ＡＮＤ）を求めることによって文字部分の共通領域Ａ３（図中の白色の領域）を抽出し、セル抽出手段１２７が共通領域Ａ３から第３セルＢ３を抽出することによっ共通領域Ａ３を多数個の第３セルＢ３に細分化する。 In step S1260, the AND operation means 126 obtains a logical product (AND) for each character portion of each template image corresponding to each recognition target character of the similar group, thereby obtaining a common area A3 (white color in the figure) of the character portion. Area), and the cell extraction means 127 extracts the third cell B3 from the common area A3, thereby subdividing the common area A3 into a number of third cells B3.

図８（ｅ）の例では、「Ｃ」、「Ｄ」のテンプレート画像の各文字部分に対して論理積（ＡＮＤ）を求めることによって文字部分の共通領域Ａ３を抽出し、図８（ｆ）の例では、「Ｃ」、「Ｏ」のテンプレート画像の各文字部分に対して論理積（ＡＮＤ）を求めることによって文字部分の共通領域Ａ３を抽出した。 In the example of FIG. 8E, the common area A3 of the character part is extracted by obtaining a logical product (AND) for each character part of the template images “C” and “D”, and FIG. In the example, the common area A3 of the character part is extracted by obtaining a logical product (AND) for each character part of the template images “C” and “O”.

本例では、ステップＳ１２２０、Ｓ１２４０と同様に、共通領域Ａ３から四角形の第３セルＢ３を抽出している。 In this example, a rectangular third cell B3 is extracted from the common area A3, as in steps S1220 and S1240.

ステップＳ１２７０では、ステップＳ１２３０、Ｓ１２５０と同様に、平均輝度値算出手段１２８が各第３セルＢ３の平均輝度値を算出し、その平均輝度値を学習データとする。 In step S1270, as in steps S1230 and S1250, the average luminance value calculation unit 128 calculates the average luminance value of each third cell B3, and uses the average luminance value as learning data.

続くステップＳ１２８０では、ステップＳ１２３０、Ｓ１２５０、Ｓ１２７０で作成した学習データ、すなわち各第１、第２、第３セルＢ１、Ｂ２、Ｂ３の平均輝度値データを併合してＳＶＭ学習テーブルを作成する。図９は、「Ｃ、Ｏ」の類似グループについて作成した学習テーブルの例を示している。 In subsequent step S1280, the learning data created in steps S1230, S1250, and S1270, that is, the average luminance value data of the first, second, and third cells B1, B2, and B3 are merged to create an SVM learning table. FIG. 9 shows an example of a learning table created for a similar group of “C, O”.

ステップＳ１２８０では、第１、第２、第３セルＢ１、Ｂ２、Ｂ３の個数を調整して重み付けを行う。具体的には、文字部分および背景部分の相違領域Ａ１、Ａ２の第１、第２セルＢ１、Ｂ２の合計個数が、文字部分の共通領域の第３セルＢ３の個数よりも多くなるように、第１、第２、第３セルＢ１、Ｂ２、Ｂ３を適宜間引きする。 In step S1280, weighting is performed by adjusting the number of first, second, and third cells B1, B2, and B3. Specifically, the total number of the first and second cells B1 and B2 in the different areas A1 and A2 of the character part and the background part is larger than the number of the third cells B3 in the common area of the character part. The first, second, and third cells B1, B2, and B3 are thinned out as appropriate.

より具体的には、学習テーブル上のセルの全個数（図９の例では１００個）に対して、文字部分および背景部分の相違領域Ａ１、Ａ２の第１、第２セルＢ１、Ｂ２の合計個数を７０％以上、文字部分の共通領域の第３セルＢ３の個数を３０％未満とするのが好ましい。 More specifically, with respect to the total number of cells on the learning table (100 in the example of FIG. 9), the sum of the first and second cells B1 and B2 of the different areas A1 and A2 of the character part and the background part. The number is preferably 70% or more, and the number of the third cells B3 in the common area of the character part is preferably less than 30%.

このとき、間引き後の各第１、第２、第３セルＢ１、Ｂ２、Ｂ３が各領域Ａ１、Ａ２、Ａ３において極力均等に位置するように第１、第２、第３セルＢ１、Ｂ２、Ｂ３を間引きするのが好ましい。このような第１、第２、第３セルＢ１、Ｂ２、Ｂ３の間引きについての理解を容易にするために、図８（ｅ）、（ｆ）では、間引きされて少なくなった第３セルＢ３が文字部分の共通領域Ａ３に略均等に位置している様子を模式的に示している。 At this time, the first, second, and third cells B1, B2, B2, B3 are positioned as evenly as possible in the respective regions A1, A2, A3. It is preferable to thin out B3. In order to facilitate understanding of the thinning of the first, second, and third cells B1, B2, and B3, in FIG. 8E and FIG. 8F, the third cell B3 that has been thinned and reduced. Are schematically shown in the character area common area A3.

本例では、サポートベクターマシン１２９の学習対象としてのモデル画像として、認識対象文字に対応するテンプレート画像と同じ画像の他、文字の周囲に汚れがあったり文字の一部が欠けていたりする不鮮明な画像も複数個含め、これらの不鮮明なモデル画像から抽出した学習データもデフォルト設定に含めている。図９では、不鮮明なモデル画像から抽出した学習データを太枠で囲んで示している。このような不鮮明なモデル画像から抽出した学習データも利用することにより、サポートベクターマシン１２９のロバスト性を向上することができる。 In this example, as a model image as a learning target of the support vector machine 129, in addition to the same image as the template image corresponding to the recognition target character, the character is smeared or a part of the character is missing. A plurality of images are included, and learning data extracted from these unclear model images is also included in the default setting. In FIG. 9, the learning data extracted from the unclear model image is shown surrounded by a thick frame. The robustness of the support vector machine 129 can be improved by using learning data extracted from such a blurred model image.

なお、図９では図示を省略しているが、本例では、学習テーブルに各セルＢ１、Ｂ２、Ｂ３の位置情報（重心に対する相対位置）も含めている。 Although not shown in FIG. 9, in this example, position information (relative position with respect to the center of gravity) of each cell B1, B2, B3 is also included in the learning table.

ステップＳ１２９０では、ステップＳ１２８０で併合した学習データ（学習テーブル）を記憶手段１２３に出力して書き込む。 In step S1290, the learning data (learning table) merged in step S1280 is output and written to the storage means 123.

ステップＳ１３０では、ＳＶＭ学習（サポートベクターマシン学習）を行う。具体的には、ステップＳ１２０で作成した学習データ（学習テーブル）をサポートベクターマシン１２９に入れ込む。以上により、サポートベクターマシン１２９の学習処理を終了する。 In step S130, SVM learning (support vector machine learning) is performed. Specifically, the learning data (learning table) created in step S120 is inserted into the support vector machine 129. Thus, the learning process of the support vector machine 129 is completed.

次に、文字認識装置１０を用いた文字認識方法を説明する。文字認識装置１０を用いた文字認識は、サポートベクターマシン１２９の学習処理を終了した後に処理部１２が図１０のフローチャートに示す処理を実行することによって行われる。 Next, a character recognition method using the character recognition device 10 will be described. Character recognition using the character recognition device 10 is performed by the processing unit 12 executing the processing shown in the flowchart of FIG. 10 after the learning processing of the support vector machine 129 is completed.

まずステップＳ２００では、撮像部１１によって撮影された検査画像を入力する。続くステップＳ２１０では、ステップＳ２００で入力された検査画像から文字が写っている領域の画像（以下、文字領域画像という。）を切り出す。 First, in step S200, an inspection image photographed by the imaging unit 11 is input. In subsequent step S210, an image of an area in which characters are shown (hereinafter referred to as a character area image) is cut out from the inspection image input in step S200.

続くステップＳ２２０では、ステップＳ２１０で切り出された文字領域画像と、各認識対象文字に対応するテンプレート画像との間でパターンマッチングを行う。具体的には、パターンマッチング手段１２４が、ステップＳ２１０で切り出された文字領域画像と、各認識対象文字に対応するテンプレート画像との間で、上述の数式１に示す正規化相関係数を用いてパターンマッチングを行う。 In subsequent step S220, pattern matching is performed between the character region image cut out in step S210 and the template image corresponding to each recognition target character. Specifically, the pattern matching unit 124 uses the normalized correlation coefficient expressed by the above-described Equation 1 between the character region image cut out in step S210 and the template image corresponding to each recognition target character. Perform pattern matching.

続くステップＳ２３０では、ステップＳ２２０のパターンマッチングで最も高いスコアが得られた文字（以下、最高スコア文字）について、記憶手段１２３に登録されている類似グループを探索する。 In the subsequent step S230, the similar group registered in the storage unit 123 is searched for the character having the highest score (hereinafter, the highest score character) obtained by the pattern matching in step S220.

ステップＳ２４０では、ステップＳ２３０での探索結果に基づいて、類似グループの登録があるか否かを判定する。類似グループの登録があると判定した場合、ステップＳ２５０へ進みＳＶＭ判別（サポートベクターマシン判別）を行う。 In step S240, based on the search result in step S230, it is determined whether there is a similar group registered. If it is determined that a similar group is registered, the process proceeds to step S250, and SVM determination (support vector machine determination) is performed.

具体的には、ステップＳ２１０で切り出された文字領域画像、およびステップＳ２３０で探索された類似グループに属する類似文字のテンプレート画像の両画像に基づいて上述のステップＳ１２２０〜Ｓ１２７０と同様の処理を行って、両画像の文字部分の相違領域、背景部分の相違領域および文字部分の共通領域を抽出し、抽出した各領域から多数個のセルを抽出して各セルの平均輝度値を求める。そして、各セルの平均輝度値をサポートベクターマシンの入力特徴量とすることにより、ステップＳ２２０のパターンマッチングにおける最高スコア文字についてプロバビリティを求める。 Specifically, processing similar to that in steps S1220 to S1270 described above is performed based on both the character region image cut out in step S210 and the template image of similar characters belonging to the similar group searched in step S230. Then, the difference area of the character part, the difference area of the background part, and the common area of the character part of both images are extracted, and a large number of cells are extracted from each extracted area to obtain the average luminance value of each cell. Then, by using the average luminance value of each cell as the input feature amount of the support vector machine, the probability is obtained for the highest score character in the pattern matching in step S220.

一方、ステップＳ２４０にて類似グループの登録がないと判定した場合、ステップＳ２６０へ進み、ステップＳ２２０のパターンマッチングにおける最高スコア文字を第１位候補文字として選択する。 On the other hand, if it is determined in step S240 that no similar group is registered, the process proceeds to step S260, and the highest score character in the pattern matching in step S220 is selected as the first candidate character.

ステップＳ２５０、Ｓ２６０に続くステップＳ２７０では、ステップＳ２６０で選択した第１位候補文字におけるパターンマッチングのスコア、またはステップＳ２４０のＳＶＭ判別で求められたプロバビリティが閾値以上であるか否かを判定する。 In step S270 following steps S250 and S260, it is determined whether or not the pattern matching score in the first candidate character selected in step S260 or the probability obtained in the SVM discrimination in step S240 is greater than or equal to a threshold value.

閾値以上であると判定した場合、ステップＳ２８０へ進み、ステップＳ２２０のパターンマッチングにおける最高スコア文字（第１位候補文字）を認識文字としてディスプレイ等の出力対象機器に出力する。 If it is determined that the value is greater than or equal to the threshold value, the process proceeds to step S280, and the highest score character (first candidate character) in the pattern matching of step S220 is output as a recognized character to an output target device such as a display.

一方、閾値未満であると判定した場合、ステップＳ２９０へ進み、文字の識別が不能であったこと（認識ＮＧ）をディスプレイ等の出力対象機器に出力する。 On the other hand, when it determines with it being less than a threshold value, it progresses to step S290 and outputs that it was impossible to identify a character (recognition NG) to output object apparatuses, such as a display.

なお、ステップＳ２７０で用いる閾値は、文字認識装置１０に要求される認識精度に応じて適宜設定される。本例では、閾値が０．７に設定されている。 Note that the threshold used in step S270 is appropriately set according to the recognition accuracy required for the character recognition device 10. In this example, the threshold is set to 0.7.

次に、図１０のフローチャートに示す処理によって文字認識を実行した結果の例を説明する。図１１（ａ）、（ｂ）は、ステップＳ２１０で切り出された文字領域画像の例を示している。 Next, an example of the result of performing character recognition by the process shown in the flowchart of FIG. 10 will be described. FIGS. 11A and 11B show examples of character area images cut out in step S210.

図１１（ａ）は、文字「Ｃ」を鮮明に視認できる画像であり、この画像に対して文字認識を実行した結果は以下の通りであった。 FIG. 11A shows an image in which the character “C” can be clearly seen. The result of executing character recognition on this image is as follows.

ステップＳ２２０のパターンマッチングでは、最高スコア文字が「Ｃ」となり、そのスコアは０．９９９となった。 In the pattern matching in step S220, the highest score character is “C”, and the score is 0.999.

したがって、ステップＳ２３０では文字「Ｃ」について類似グループの探索が行われステップＳ２４０では、登録された類似グループとして「Ｃ、Ｄ」、「Ｃ、Ｇ」、「Ｃ、Ｏ」、「Ｃ、Ｑ」の４つがあると判定された。 Therefore, in step S230, a similar group is searched for the character “C”. In step S240, “C, D”, “C, G”, “C, O”, “C, Q” are registered as the similar groups. It was determined that there were four.

したがって、ステップＳ２５０で「Ｃ、Ｄ」、「Ｃ、Ｇ」、「Ｃ、Ｏ」、「Ｃ、Ｑ」の４つの類似グループについてＳＶＭ判別が行われ、その結果、「Ｃ、Ｄ」の類似グループについては文字「Ｃ」のプロバビリティが０．９４９、類似グループ「Ｃ、Ｇ」については文字「Ｃ」のプロバビリティが０．９４９、「Ｃ、Ｏ」の類似グループについては文字「Ｃ」のプロバビリティが０．９５１、類似グループ「Ｃ、Ｑ」については文字「Ｃ」のプロバビリティが０．９４８となった。 Accordingly, in step S250, SVM discrimination is performed for four similar groups “C, D”, “C, G”, “C, O”, “C, Q”, and as a result, the similarity of “C, D” is determined. For the group, the letter “C” has a probability of 0.949, for the similar group “C, G”, the letter “C” has a probability of 0.949, and for the similar group “C, O”, the letter “C”. The probability of the letter “C” is 0.948 for the similar group “C, Q”.

したがって、ステップＳ２７０の判定において、パターンマッチングのスコアおよびＳＶＭ判別のプロバビリティのいずれもが閾値０．７を上回ったと判定され、ステップＳ２８０にて文字「Ｃ」が認識文字として出力された。 Therefore, in the determination in step S270, it is determined that both the pattern matching score and the SVM discrimination probability exceed the threshold value 0.7, and the character “C” is output as a recognized character in step S280.

一方、図１１（ｂ）は、文字「Ｃ」が不鮮明な画像であり、この画像に対して文字認識を実行した結果は以下の通りであった。 On the other hand, FIG. 11B shows an image in which the character “C” is unclear, and the result of executing character recognition on this image is as follows.

ステップＳ２２０のパターンマッチングでは、最高スコア文字が「Ｃ」となり、そのスコアは０．３３となった。 In the pattern matching in step S220, the highest score character is “C”, and the score is 0.33.

したがって、ステップＳ２３０では文字「Ｃ」について類似グループの探索が行われ、ステップＳ２４０では、登録された類似グループとして「Ｃ、Ｄ」、「Ｃ、Ｇ」、「Ｃ、Ｏ」、「Ｃ、Ｑ」の４つがあると判定された。 Therefore, in step S230, a similar group is searched for the character “C”. In step S240, “C, D”, “C, G”, “C, O”, “C, Q” are registered as the similar groups. ”Was determined.

したがって、ステップＳ２５０で「Ｃ、Ｄ」、「Ｃ、Ｇ」、「Ｃ、Ｏ」、「Ｃ、Ｑ」の４つの類似グループについてＳＶＭ判別が行われ、その結果、「Ｃ、Ｄ」の類似グループについては文字「Ｃ」のプロバビリティが０．８５１、類似グループ「Ｃ、Ｇ」については文字「Ｃ」のプロバビリティが０．８８９、「Ｃ、Ｏ」の類似グループについては文字「Ｃ」のプロバビリティが０．８７０、類似グループ「Ｃ、Ｑ」については文字「Ｃ」のプロバビリティが０．９００となった。 Accordingly, in step S250, SVM discrimination is performed for four similar groups “C, D”, “C, G”, “C, O”, “C, Q”, and as a result, the similarity of “C, D” is determined. The probability of the letter “C” is 0.851 for the group, the probability of the letter “C” is 0.889 for the similar group “C, G”, and the letter “C” is for the similar group of “C, O”. The probability of the letter “C” is 0.900 for the similar group “C, Q”.

したがって、ステップＳ２７０の判定において、ＳＶＭ判別のプロバビリティが閾値０．７を上回ったと判例されたので、ステップＳ２８０にて文字「Ｃ」が認識文字として出力された。このように、文字「Ｃ」が不鮮明な画像についても良好な認識結果を得ることができた。 Accordingly, in the determination in step S270, it was determined that the SVM discrimination probability exceeded the threshold value 0.7, and therefore the character “C” was output as a recognized character in step S280. Thus, a good recognition result could be obtained even for an image in which the letter “C” is unclear.

本実施形態によると、ステップＳ１２２０、Ｓ１２３０等で述べたように、複数のモデル画像の文字部分相互間の相違領域Ａ１を細分化した第１セルＢ１の輝度値を特徴量としてサポートベクターマシン１２９の学習データを生成する。 According to the present embodiment, as described in steps S1220, S1230, and the like, the support vector machine 129 uses the luminance value of the first cell B1 obtained by subdividing the difference area A1 between character portions of a plurality of model images as a feature amount. Generate learning data.

このため、相違領域Ａ１の面積を特徴量とした場合と比較して複数文字相互間の相違を顕在化させることができる。このため、文字の誤認識を抑制して認識精度を向上させることができる。 For this reason, compared with the case where the area of different area A1 is made into the feature-value, the difference between several characters can be made clear. For this reason, the recognition accuracy can be improved by suppressing erroneous recognition of characters.

本実施形態によると、ステップＳ１２４０、Ｓ１２５０等で述べたように、モデル画像の背景部分相互間の相違領域Ａ２を細分化した第２セルＢ２の輝度値も特徴量として学習データを生成する。このため、複数文字相互間の相違をさらに顕在化させることができ、ひいては文字の誤認識をさらに抑制することができる。 According to the present embodiment, as described in steps S1240, S1250, etc., learning data is generated using the luminance value of the second cell B2 obtained by subdividing the different area A2 between the background portions of the model image as a feature amount. For this reason, the difference between a plurality of characters can be further manifested, and thus erroneous recognition of characters can be further suppressed.

本実施形態によると、ステップＳ１２６０、Ｓ１２７０等で述べたように、文字部分相互間の共通領域Ａ３を細分化した第３セルＢ３の輝度値も特徴量としてサポートベクターマシン１２９の学習データを生成する。このため、相違領域Ａ１、Ａ２を細分化したセルＢ１、Ｂ２の輝度値のみを特徴量としてサポートベクターマシン１２９の学習データを生成する場合と比較して文字認識精度を向上させることができる。 According to the present embodiment, as described in steps S1260, S1270, etc., the learning data of the support vector machine 129 is generated using the luminance value of the third cell B3 obtained by subdividing the common area A3 between the character parts as the feature amount. . For this reason, the character recognition accuracy can be improved as compared with the case where the learning data of the support vector machine 129 is generated using only the luminance values of the cells B1 and B2 obtained by subdividing the different areas A1 and A2 as feature amounts.

本実施形態によると、ステップＳ１２８０で述べたように、学習データに利用する第１セルＢ１および第２セルＢ２の合計個数を、学習データに利用する第３セルＢ３の個数よりも多くする。このため、サポートベクターマシン（１２９）による検出マージンを拡大することができ、ひいては文字認識精度をさらに向上させることができる。 According to the present embodiment, as described in step S1280, the total number of first cells B1 and second cells B2 used for learning data is made larger than the number of third cells B3 used for learning data. For this reason, the detection margin by the support vector machine (129) can be expanded, and the character recognition accuracy can be further improved.

本実施形態によると、ステップＳ２４０で述べたように、サポートベクターマシン１２９は、パターンマッチングによるスコア（一致度）が最も高くなった認識対象文字についてプロバビリティ（確信度）を求める。すなわち、パターンマッチングによる認識結果をサポートベクターマシン１２９で再評価する。このため、文字認識精度をさらに向上させることができる。 According to the present embodiment, as described in step S240, the support vector machine 129 obtains the probability (confidence level) for the recognition target character having the highest score (matching degree) by pattern matching. That is, the recognition result by pattern matching is re-evaluated by the support vector machine 129. For this reason, the character recognition accuracy can be further improved.

（他の実施形態）
なお、上記一実施形態では、第１、第２、第３セルＢ１、Ｂ２、Ｂ３の平均輝度値を特徴量としてサポートベクターマシン１２９の学習データを生成したが、第１、第２、第３セルＢ１、Ｂ２、Ｂ３の最大輝度値等を特徴量としてサポートベクターマシン１２９の学習データを生成してもよい。 (Other embodiments)
In the above-described embodiment, the learning data of the support vector machine 129 is generated using the average luminance value of the first, second, and third cells B1, B2, and B3 as the feature amount, but the first, second, and third The learning data of the support vector machine 129 may be generated using the maximum luminance values of the cells B1, B2, and B3 as feature amounts.

また、上記一実施形態では、２種類の文字の組合せで類似グループを作成し、２種類の文字に対してＳＶＭ判別を行ったが、３種類以上の文字の組合せで類似グループを作成し、３種類以上の文字に対してＳＶＭ判別を行ってもよい。 In the above embodiment, a similar group is created by combining two types of characters and SVM discrimination is performed on two types of characters. However, a similar group is created by combining three or more types of characters. You may perform SVM discrimination | determination with respect to the character more than a kind.

また、上記一実施形態において、サポートベクターマシン１２９の学習データ上のセルの全個数や、第１、第２、第３セルＢ１、Ｂ２、Ｂ３の個数の比率等を適宜変更してもよい。 In the above embodiment, the total number of cells on the learning data of the support vector machine 129, the ratio of the numbers of the first, second, and third cells B1, B2, and B3 may be appropriately changed.

１ワーク（被検査物）
１１撮像部（画像取得手段）
１２４パターンマッチング手段
１２９サポートベクターマシン
Ａ１文字部分の相違領域
Ｂ１第１セル
Ａ２背景部分の相違領域
Ｂ２第２セル
Ａ３文字部分の共通領域
Ｂ３第３セル
Ｓ１２０学習データ生成手段 1 Workpiece (inspection object)
11 Imaging unit (image acquisition means)
124 Pattern matching means 129 Support vector machine A1 Character area difference area B1 1st cell A2 Background area difference area B2 2nd cell A3 Character area common area B3 3rd cell S120 Learning data generation means

Claims

Image acquisition means (11) for acquiring an inspection image obtained by photographing the inspection object (1) on which characters are written;
A support vector machine (129) for identifying a character appearing in the character region based on a feature amount obtained from the character region of the inspection image;
Learning data generating means for generating learning data of the support vector machine (129) using the luminance value of the first cell (B1) obtained by subdividing the difference area (A1) between the character portions of the plurality of model images as a feature amount ( S120). A character recognition device comprising:

The learning data generating means (S120) generates the learning data using the luminance value of the second cell (B2) obtained by subdividing the different area (A2) between the background portions of the model image as a feature amount. The character recognition device according to claim 1.

The learning data generation means (S120) generates learning data of the support vector machine (129) using the luminance value of the third cell (B3) obtained by subdividing the common area (A3) between the character parts as a feature amount. The character recognition device according to claim 2, wherein:

The learning data generation means (S120) uses the total number of the first cell (B1) and the second cell (B2) used for the learning data of the third cell (B3) used for the learning data. The character recognition device according to claim 3, wherein the number is larger than the number.

The character recognition apparatus according to claim 1, wherein the plurality of model images are a set of images in which a degree of matching obtained by mutual pattern matching exceeds a threshold value.

Pattern matching means (124) for performing pattern matching by comparing the character region with a template image corresponding to a recognition target character and obtaining a degree of matching with the recognition target character;
6. The character recognition device according to claim 1, wherein the support vector machine (129) obtains a certainty factor for the recognition target character having the highest degree of coincidence.

Generating learning data of the support vector machine (129) using the luminance value of the first cell (B1) obtained by subdividing the difference area (A1) between the character portions of the plurality of model images as a feature amount (S120);
A step (S200) of acquiring an inspection image obtained by photographing the inspection object (1) on which characters are written;
Using the support vector machine (129) to identify a character appearing in the character area based on a feature amount obtained from the character area of the inspection image (S250). Method.