JP4062866B2

JP4062866B2 - Character recognition device and character recognition method

Info

Publication number: JP4062866B2
Application number: JP2000201853A
Authority: JP
Inventors: 紹明劉; 一寿市川
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2000-07-04
Filing date: 2000-07-04
Publication date: 2008-03-19
Anticipated expiration: 2020-07-04
Also published as: JP2002024765A

Description

【０００１】
【発明の属する技術分野】
本発明は認識装置及び認識方法に関し，特に文字認識を行う認識装置及び文字認識を行う認識方法に関するものである。
【０００２】
【従来の技術】
文字認識分野には，文字毎に，文字カテゴリに属しているすべての学習サンプルを用いて該文字の標準文字パターンを求め，求められた標準文字パターンを認識辞書に記憶しておく。認識するとき，入力された未知文字パターンを認識辞書に格納されているすべての標準文字パターンと比較し，もっとも近いものが認識の結果として出力される方法がもっとも一般的な認識方法である。ここで，文字特徴量の選択方法，標準文字パターンの作成方法，距離尺度或いは類似度尺度は認識精度を左右する重要な要素である。
【０００３】
標準文字パターンの作成方法について，各文字毎に，文字カテゴリに属しているすべての学習サンプルの中心値を該文字の標準文字パターンとして認識辞書に記憶させ，認識辞書を作成する方法がある。しかし，文字カテゴリに属している学習サンプルの分布がばらつき，かつ数が多い場合は，認識率が低いという問題点がある。
【０００４】
認識率を上げるために，各文字毎に複数の標準文字パターンを用いて認識を行う方法がある。例えば，特開昭６３−１２９４８８号公報には，マルチフォント文字を認識するために，各文字毎に複数の標準文字パターンを認識辞書に記憶しておき，その認識辞書を用いて認識を行う方法が提案された。また，学習サンプルを学習しながら，対応している標準文字パターンを修正し，或いは新しい標準文字パターンを追加して，認識辞書を作成する方法がある。例えば，特開平７−２８９５５号公報に記載されている方法が上記したものである。しかし，これらの方法には，認識辞書に標準文字パターンの数が多いので，認識時間が長いという問題があり，文字数が多い場合には，文字認識に要する処理時間は無視できないものとなる。
【０００５】
認識時間を短縮するために，例えば，特開平１０−１６２１０３号公報には，手書き文字学習サンプルを用いて手書き文字認識辞書，活字文字学習サンプルを用いて活字文字認識辞書をそれぞれ作成しておき，認識するとき，入力された未知文字が手書き文字か活字文字かを判断し，手書き文字の場合は手書き文字認識辞書，活字文字の場合は活字文字認識辞書を用いて認識を行う方法が提案されている。しかし，文字フォントの種類が多いので，文字フォントの種類をすべて区別するのは容易でないし，特に手書き文字の場合は，学習サンプルの分布が一定の法則に従わないので，１つの標準文字パターンで文字カテゴリに属しているすべての学習サンプルを表現するのは，認識率が低いという問題がある。
【０００６】
距離尺度或いは類似度尺度については，これまで数多く提案されている。代表的なものは，シテイブロック距離，ユークリッド距離，重み付きユークリッド距離，マハラノビス距離，投影距離などが挙げられる。これらの方法は文献『画像の処理と認識』安居院猛・長尾智晴（１９９２，昭晃堂）と，『基本多変量解析』浅野長一郎・江島伸興（日本規格協会），“手書き文字認識における投影距離法”池田正幸・田中英彦・岡本達（情処学論，ｖｏｌ．２４，ｎｏ．１，ｐｐ．１０６−１１２，１９８３）に記載されている。文字Ｘ＝（ｘ_１，ｘ_２，…，ｘ_ｎ）と文字Ｙ＝（ｙ_１，ｙ_２，…，ｙ_ｎ）の間のシテイブロック距離Ｄ_ｃ（Ｘ，Ｙ）は次の公式で計算する。ここで，｜Ｚ｜はＺの絶対値を表す。
【０００７】
【数２】

【０００８】
文字Ｘ＝（ｘ_１，ｘ_２，…，ｘ_ｎ）と文字Ｙ＝（ｙ_１，ｙ_２，…，ｙ_ｎ）の間のユークリッド距離Ｄ_ｅ（Ｘ，Ｙ）は次の公式で計算する。
【０００９】
【数３】

【００１０】
文字ｉの学習サンプルをＳ_１，Ｓ_２，…。Ｓ_ｋとし，サンプルＳ_１，Ｓ_２，…。Ｓ_ｋの中心値，すなわち，文字ｉの標準文字パターンをＵ_ｉで表す。文字Ｘ＝（ｘ_１，ｘ_２，…，ｘ_ｎ）と標準文字パターンＵ_ｉ＝（ｕ_ｉ１，ｕ_ｉ２，…，ｕ_ｉｎ）間の重み付きユークリッド距離Ｄ_ｗ（Ｘ，Ｕ_ｉ）は次の公式で計算する。
【００１１】
【数４】

ここで，
【００１２】
【数５】

である。文字Ｘ＝（ｘ_１，ｘ_２，…，ｘ_ｎ）と標準文字パターンＵ_ｉ＝（ｕ_ｉ１，ｕ_ｉ２，…，ｕ_ｉｎ）間のマハラノビス距離Ｄ_ｍ（Ｘ，Ｕ_ｉ）は次の公式で計算する。
【００１３】
【数６】

ここで，Σｉは文字ｉの学習サンプルの共分散行列を表し，Ｚ^−１は行列Ｚの逆行列であり，Ｚ^Ｔは行列Ｚの転置行列である。パターンＸ＝（ｘ_１，ｘ_２，…，ｘ_ｎ）と標準パターンＵ_ｉ＝（ｕ_ｉ１，ｕ_ｉ２，…，ｕ_ｉｎ）間の投影距離Ｄ_ｔ（Ｘ，Ｕ_ｉ）は次の公式で計算する。
【００１４】
【数７】

ここで，Φ_ｊはパターンの学習サンプルから計算された固有値を降順に並べたときにｊ番目に位置する固有値に対応する固有ベクトルであり，（α，β）はベクトルαとβの内積を表す。
【００１５】
シテイブロック距離，ユークリッド距離及び重み付きユークリッド距離は比較的簡単に求められるが，高い認識率を保証するのは困難である。マハラノビス距離は，生起確率がχ^２分布に従ったデータを対象としている距離であり，生起確率の高い分布の中心部分ほど距離が近く計算される。しかし，実際の文字の学習サンプルの分布はχ^２分布に従っているわけではないので，認識率を保証できない。また，文字の共分散行列を記憶するため，認識辞書が巨大であり，莫大な計算時間がかかるので，実用性が低い。
【００１６】
上述した従来技術には，２つの特徴がある。（１）１つ或いは複数の標準文字パターンで文字カテゴリを代表する；（２）文字パターンと文字パターン間の距離，或いは類似度を用いて文字パターンを比較する。次に従来技術の特徴（１）と（２）は誤認が発生する重要な原因であることを示す。
【００１７】
文字カテゴリに属している学習サンプルは一般に一定の分布に従わない，集中して固まっている場合もあるし，ばらばらに分散している場合もある。１つの標準文字パターンで文字カテゴリを代表した場合は，図２７に示すように，文字Ｐの認識範囲は，該文字の標準文字パターンを中心として（特徴（１）より），標準文字パターンともっとも遠い該文字カテゴリに属している学習サンプルと標準文字パターン間の距離を半径とする（特徴（２）より）多次元円Ｅ１になる。すなわち，入力された未知文字パターンがＥ１範囲に入ると，文字Ｐと認識される可能性が非常に高い。しかし，認識範囲Ｅ１は実際の文字学習サンプルの分布範囲Ｅ２より大きいため，多くの文字の認識範囲と重なってしまう。認識するとき，入力された未知文字パターンが重なっている範囲に入ると，間違って認識されることがある。例えば，図２８に示すように，文字Ｐ１の実際の分布範囲Ｅ４と文字Ｐ２の実際の分布範囲Ｅ６と重なっていないが，文字Ｐ１の認識範囲Ｅ３と文字Ｐ２の認識範囲Ｅ５と重なっている。入力された未知文字Ｘが文字Ｐ１の実際の分布範囲Ｅ４に入るので，文字Ｐ１と認識されるはずであるが，ＸがＰ１とＰ２の重なっている認識範囲に入っているので，文字Ｐ２と間違って認識される。すなわち，Ｘと文字Ｐ２の標準文字パターン間の距離がＸと文字Ｐ１の標準文字パターン間の距離より小さいので，文字Ｐ２と誤認される。
【００１８】
文字毎に複数の標準文字を用いて認識を行う場合は，認識範囲が重なっている文字の数が少なくなり，認識精度がある程度改善されるが，本質的な解決法ではない。
【００１９】
文字の分布に従って文字の認識範囲を縮小する，或いは文字の分布を想定して，認識範囲を想定した分布の形に近似するような距離関数，或いは類似度関数を用いて認識を行う場合は（例えば，重み付きユークリッド距離，マハラノビス距離など），認識範囲が重なっている文字の数が少なくなり，認識精度がある程度改善されるが，分布が一定の規則に従わない文字に対して，高い認識率を保証できない問題点があり，本質的な解決法ではない。
【００２０】
【発明が解決しようとする課題】
本発明は，上述した事情に鑑みてなされたもので，文字カテゴリを代表する標準文字パターンを用いて文字認識を行うときの認識率低下問題を解決し，高い認識率かつ簡単な文字認識方法を提供することを目的とするものである。
【００２１】
【課題を解決するための手段】
上記の課題を解決するため，本発明は，特許請求の範囲に記載のとおりの構成を採用している。すなわち，本発明の具体的な構成では，文字パターンのペリフェラル特徴量と，ストローク特徴量と，メッシュ特徴量をそれぞれ抽出し，抽出された３種類の特徴量を並べて該文字の複合特徴量を求め，各文字毎に，文字カテゴリに属しているすべての学習サンプルから，学習サンプル特徴量の各次元毎に，次元の値を列挙し，列挙した値を変換し，変換された各次元の値を該文字のカテゴリデータとして認識辞書に記憶させ，認識辞書を作成しておく。認識するとき，文字パターンと文字カテゴリ間の類似度の計算方法を用いて，入力された未知文字パターンと認識辞書に格納されているすべての文字カテゴリデータ間の類似度を計算し，もっとも類似な文字カテゴリを認識の結果として出力することにより文字を高精度・高速かつ簡単に認識することができる。
【００２２】
また，本発明によれば，特徴量の分布に対応するビット列データからなる文字カテゴリデータと，認識対象の同様の文字パターンの文字パターンデータとを比較して文字認識を行なうので精度よく認識を行なえる。さらに，複数種類の特徴量のビット列パターンを連結させればより正確な認識が可能となる。
【００２３】
なお，本発明は装置および方法として実現でき，またその方法の少なくとも一部をコンピュータプログラムとして実装することができる。このコンピュータプログラムを記録した記録媒体（プログラムパッケージ）や，当該コンピュータプログラムをコンピュータシステムにインストールするためのコンピュータプログラムを記録した記録媒体が，本発明の技術的な範囲に含まれることはもちろんである。
【００２４】
【発明の実施の形態】
図１は，本発明の認識装置の実施の一形態を示すブロック図である。図中，１は１文字分の文字画像を入力する手段，２は文字のペリフェラル特徴量を抽出する手段，３は文字のストローク特徴量を抽出する手段，４は文字のメッシュ特徴量を抽出する手段，５は文字の複合特徴量を求める手段，６は文字パターンと文字カテゴリ間の類似度を計算する手段，７は認識手段，８は認識辞書作成手段，８ａは認識辞書を格納する手段，９は文字カテゴリ作成手段，１０は記憶手段である。
【００２５】
メモリＭ１，Ｍ２及びＭ３は，それぞれ特徴量抽出手段２，３，４で抽出されたペリフェラル特徴量，ストローク特徴量及びメッシュ特徴量を格納する。メモリＭ４は，複合特徴量を求める手段５で求められた文字の複合特徴量を格納する。メモリＭ５は，認識辞書から認識手段７で検出された入力された未知文字パターンともっとも類似な文字の名前とカテゴリデータを格納する。
【００２６】
特徴量抽出手段２は，文字画像入力手段１で入力された１文字分の文字画像をそれぞれ横に２Ａ‐１区分，縦に２Ａ‐１区分に分割し，文字画像の幅或いは高さの１／Ｐを各区分の走査範囲として，各区分を走査してペリフェラル特徴量を抽出する。特徴量格納手段２ａは前記抽出されたペリフェラル特徴量をメモリＭ１に格納する。
【００２７】
特徴量抽出手段３は，文字画像入力手段１で入力された１文字分の文字画像をそれぞれ横に２Ａ‐１区分，縦に２Ａ‐１区分に分割し，各区分の走査範囲を文字画像の幅或いは高さとして，各区分を走査してストローク特徴量を抽出する。特徴量格納手段３ａは前記抽出されたストローク特徴量をメモリＭ２に格納する。
【００２８】
特徴量抽出手段４は，文字画像入力手段１で入力された１文字分の文字画像をそれぞれサイズがｂ画像＊ｂ画像の子領域Ｂ個，２Ｃ個，Ｄ個に分割し，各子領域を走査してメッシュ特徴量を抽出する。特徴量格納手段４ａは前記抽出されたメッシュ特徴量をメモリＭ３に格納する。
【００２９】
複合特徴量を求める手段５は，前記抽出された３種類の特徴量を並べ，１つの特徴量として求める。複合特徴量格納手段５ａは前記求められた複合特徴量をメモリＭ４に格納する。図２は文字の複合特徴量５０を示している。複合特徴量５０がペリフェラル特徴量５１，ストローク特徴量５２，メッシュ特徴量５３から構成されている。
【００３０】
文字カテゴリデータ作成手段９は，文字カテゴリに属しているすべての学習サンプルを用いて文字カテゴリデータを作成する。作成された各文字カテゴリデータを用いて，認識辞書作成手段８で認識辞書を作成する。作成された認識辞書を認識辞書格納手段８ａで格納する。図３は認識辞書内の認識辞書データを示す図である。認識辞書データ６０は，すべての文字（ｍ個）のデータ６１〜６ｍから構成されている。各文字のデータは文字の名前と文字カテゴリデータのベクトルから構成されている。
【００３１】
認識手段７は，類似度計算手段６を用いて，認識辞書に格納されている文字カテゴリデータの中から，入力され未知文字パターンともっとも類似な文字カテゴリを求め，その結果をメモリＭ５に記憶させる。記憶手段１０は，認識手段７で認識された文字の名前とカテゴリデータを格納する。
【００３２】
次に本発明の文字認識装置の装置適用例として，情報端末装置に適用させた場合の装置構成について説明する。図４は本発明の文字認識装置を情報端末装置に適用させた場合の装置構成を示す図である。
【００３３】
情報端末装置７０は，キーボート７１，外部記憶装置７２，ディスプレイ７３，プロセッサ部７４から構成される。キーボート７１は，ユーザが操作を指示するための入力装置であり，その他の入力装置が付加されていてもよい。外部記憶装置７２は，入力された未知文字パターンのデータや，認識辞書のデータや，認識結果や，ソフトウェアを格納する。また，特徴量格納手段２ａ，３ａ，４ａ，複合特徴量格納手段５ａ，認識辞書格納手段８ａをこの外部記憶装置７２の一部として構成することができる。さらに，記憶手段１０によって認識された文字の名前とカテゴリデータを格納してもよい。外部記憶装置７２の具体例として，例えばハードディスクなどで構成することができる。ディスプレイ７３は，ユーザに対するメッセージや認識文字のデータ，認識の結果などを表示するための出力装置である。もちろん他の出力装置が付加されていてもよい。プロセッサ部７４は，外部記憶装置７２に格納されているソフトウェアなどに従って，実際の処理を行う。プロセッサ部７４は，具体的にマイクロプロセッサや，パーソナルコンピュータなどのコンピュータシステムで構成することができる。そして，文字特徴量抽出手段２，３，４，複合特徴量を求める手段５，文字カテゴリデータ作成手段９，類似度計算手段６，認識手段７は，このプロセッサ部７４の上で動作するソフトウェアによって構成することができる。
【００３４】
次に本発明の文字認識装置の動作をさらに詳細に説明する。まず，特徴量抽出手段２について説明する。
【００３５】
図５は特徴量抽出手段２の実施の一形態を示すブロック図である。メモリＭ２１〜メモリＭ２４は文字画像入力手段１で入力された１文字分の文字画像を記憶する。
横領域分割手段２１は，メモリＭ２１に記憶している１文字分の文字画像を横にＡ区分に分割する。例えば，図８（ａ）は前記文字画像を横に４（Ａ＝４）区分に分割した様子を示している。横領域分割手段２２は，前記横領域分割手段２１で分割されたＡ区分に対して，ｋ（ｋ＝１，２，…，Ａ‐１）区分目の下半分とｋ＋１区分目の上半分を１区分とし，メモリＭ２２に記憶している１文字分の文字画像を横にＡ‐１区分に分割する。例えば，図８（ｂ）は前記文字画像を横に３（Ａ‐１＝４‐１＝３）区分に分割した様子を示している。縦領域分割手段２３は，メモリＭ２３に記憶している１文字分の文字画像を縦にＡ区分に分割する。例えば，図９（ａ）は前記文字画像を縦に４（Ａ＝４）区分に分割した様子を示している。縦領域分割手段２４は，前記縦領域分割手段２３で分割されたＡ区分に対して，ｋ（ｋ＝１，２，…，Ａ‐１）区分目の右半分とｋ＋１区分目の左半分を１区分とし，メモリＭ２４に記憶している１文字分の文字画像を縦に３（Ａ‐１＝４‐１＝３）区分に分割した様子を示している。ここで，横区分数と縦区分数は異なってもかまわない。
【００３６】
走査範囲制御手段２６は，横区分に対して前記文字画像の外接矩形の左辺と右辺の計２辺から文字方向に文字の幅の１／Ｐまで走査することを制御し，縦区分に対して前記文字画像の外接矩形の上辺と下辺の計２辺から文字方向に文字の高さの１／Ｐまで走査することを制御する。ここで，Ｐは正整数である。
【００３７】
特徴抽出手段２５は，まず，領域分割手段２１，２２により分割された横の２Ａ‐１区分の各区分毎に，前記走査範囲の制限手段２６によって制限された走査範囲において，文字画像の左辺からａ回走査し（ａ＝前記文字画像の高さ／Ａ），最初に文字を構成する画素（黒画素）にあたるまでの背景画像の画素数を計数し，ａ回走査して計数された画素数の平均値を求める。続いて，領域分割手段２１，２２により分割された横の２Ａ‐１区分の各区分毎に，前記走査範囲の制限手段２６によって制限された走査範囲において，文字画像の右辺からａ回走査し（ａ＝前記文字画像の高さ／Ａ），最初に文字を構成する画素（黒画素）にあたるまでの背景画像の画素数を計数し，ａ回走査して計数された画素数の平均値を求める。また，領域分割手段２３，２４により分割された縦の２Ａ‐１区分の各区分毎に，前記走査範囲の制限手段２６によって制限された走査範囲において，文字画像の上辺からａ回走査し（ａ＝前記文字画像の幅／Ａ），最初に文字を構成する画素（黒画素）にあたるまでの背景画像の画素数を計数し，ａ回走査して計数された画素数の平均値を求める。最後に，領域分割手段２３，２４により分割された縦の２Ａ‐１区分の各区分毎に，前記走査範囲の制限手段２６によって制限された走査範囲において，文字画像の下辺からａ回走査し（ａ＝前記文字画像の幅／Ａ），最初に文字を構成する画素（黒画素）にあたるまでの背景画像の画素数を計数し，ａ回走査して計数された画素数の平均値を求める。図１０（ａ），（ｂ）は，Ａ＝４，Ｐ＝３のとき，領域分割手段２１，２２により分割された横７（２Ａ‐１）区分の特徴量を抽出する様子を示す図である。図１０（ｃ），（ｄ）は，Ａ＝４，Ｐ＝３のとき，領域分割手段２３，２４により分割された縦７（２Ａ‐１）区分の特徴量を抽出する様子を示す図である。
【００３８】
記憶手段２ａは，特徴量抽出手段２５によって抽出された特徴量を図１に示すメモリＭ１に格納する。
【００３９】
次に特徴量抽出手段３について説明する。図６は特徴量抽出手段３の実施の一形態を示すブロック図である。メモリＭ３１〜メモリＭ３４は文字画像入力手段１で入力された１文字分の文字画像を記憶する。
【００４０】
横領域分割手段３１は，メモリＭ３１に記憶している１文字分の文字画像を横にＡ区分に分割する。例えば，図８（ａ）は前記文字画像を横に４（Ａ＝４）区分に分割した様子を示している。横領域分割手段３２は，前記横領域分割手段３１で分割されたＡ区分に対して，ｋ（ｋ＝１，２，…，Ａ‐１）区分目の下半分とｋ＋１区分目の上半分を１区分とし，メモリＭ３２に記憶している１文字分の文字画像を横にＡ‐１区分に分割する。例えば，図８（ｂ）は前記文字画像を横に３（Ａ‐１＝４‐１＝３）区分に分割した様子を示している。縦領域分割手段３３は，メモリＭ３３に記憶している１文字分の文字画像を縦にＡ区分に分割する。例えば，図９（ａ）は前記文字画像を縦に４（Ａ＝４）区分に分割した様子を示している。縦領域分割手段３４は，前記縦領域分割手段３３で分割されたＡ区分に対して，ｋ（ｋ＝１，２，…，Ａ‐１）区分目の右半分とｋ＋１区分目の左半分を１区分とし，メモリＭ３４に記憶している１文字分の文字画像を縦に３（Ａ‐１＝４‐１＝３）区分に分割した様子を示している。ここで，横区分数と縦区分数は異なってもかまわない。
【００４１】
特徴抽出手段３５は，まず，領域分割手段３１，３２により分割された横の２Ａ‐１区分の各区分毎に，前記文字画像の幅を走査範囲として，文字画像の左辺からａ回走査し（ａ＝前記文字画像の高さ／Ａ），背景画素（白画素）から文字を構成する画素（黒画素）に，及び文字を構成する画素（黒画素）から背景画素（白画素）に変化する回数を計数し，ａ回走査して計数された回数の平均値を求める。続いて，領域分割手段３３，３４により分割された縦の２Ａ‐１区分の各区分毎に，文字画像の高さを走査範囲として，文字画像の上辺からａ回走査し（ａ＝前記文字画像の幅／Ａ），背景画素（白画素）から文字を構成する画素（黒画素）に，及び文字を構成する画素（黒画素）から背景画素（白画素）に変化する回数を計数し，ａ回走査して計数された回数の平均値を求める。図１１（ａ），（ｂ）は，Ａ＝４のとき，領域分割手段３１，３２により分割された横７（２Ａ‐１）区分の特徴量を抽出する様子を示す図である。図１１（ｃ），（ｄ）は，Ａ＝４のとき，領域分割手段３３，３４により分割された縦７（２Ａ‐１）区分の特徴量を抽出する様子を示す図である。
【００４２】
記憶手段３ａは，特徴量抽出手段３５によって抽出された特徴量を図１に示すメモリＭ２に格納する。
【００４３】
次に特徴量抽出手段４について説明する。図７は特徴量抽出手段４の実施の一形態を示すブロック図である。メモリＭ４１〜メモリＭ４４は文字画像入力手段１で入力された１文字分の文字画像を記憶する。
【００４４】
子領域分割手段４１は，メモリＭ４１に記憶している１文字分の文字画像をサイズがｂ画素＊ｂ画素の子領域Ｂ個に分割する。例えば，図１２（ａ）は子領域分割手段４１で前記文字画像を１６（Ｂ＝１６）個の子領域に分割した様子を示している。子領域分割手段４２は，前記子領域分割手段４１で分割されたＢ個の子領域に対して，前記文字画像の右側にある子領域以外の子領域毎に，子領域の右半分と右隣の子領域の左半分を１子領域とし，Ｃ個の子領域に分割する。図１２（ｂ）は子領域分割手段４２で前記文字画像を１２（Ｂ＝１６のとき）個の子領域に分割した様子を示している。子領域分割手段４３は，前記子領域分割手段４１で分割されたＢ個の子領域に対して，前記文字画像の下側にある子領域以外の子領域毎に，子領域の下半分と下隣の子領域の上半分を１子領域とし，Ｃ個の子領域に分割する。図１２（ｃ）は子領域分割手段４３で前記文字画像を１２（Ｂ＝１６のとき）個の子領域に分割した様子を示している。子領域分割手段４４は，前記子領域分割手段４２で分割されたＣ個の子領域に対して，前記文字画像の下側にある子領域以外の子領域毎に，子領域の下半分と下隣の子領域の上半分を１子領域とし，Ｄ個の子領域に分割する。図１２（ｄ）は子領域分割手段４４で前記文字画像を９（Ｂ＝１６，Ｃ＝１２のとき）個の子領域に分割した様子を示している。ここで，ｂとＢは共に正整数であり，ｂ＊Ｂ＝文字画像の幅（或いは高さ）である。
【００４５】
特徴抽出手段４５は，領域分割手段４１，４２，４３，４４によりそれぞれ分割されたＢ，Ｃ，Ｃ，Ｄ個の子領域の各子領域毎に，子領域画像の左辺から走査し，文字を構成する画素（黒画素）数を計数する
【００４６】
記憶手段４ａは，特徴量抽出手段４５によって抽出された特徴量を図１に示すメモリＭ３に格納する。
【００４７】
次に文字の複合特徴量を求める手段５について説明する。複合特徴量を求める手段５は，特徴抽出手段２，特徴抽出手段３及び特徴抽出手段４によって抽出された特徴量を並べ，図１に示すメモリＭ４に記憶させる。
【００４８】
次に認識辞書格納手段８ａで文字カテゴリデータを格納するときの文字カテゴリデータの作成手段９について説明する。図１３は文字カテゴリデータの作成手段９の実施の一形態を示すブロック図である。
【００４９】
メモリＭ９０は１文字のすべての学習サンプル特徴量を格納している。メモリＭ９１，Ｍ９２，Ｍ９３，…，Ｍ９ｎ（ｎは特徴量ベクトルの次元数）は，それぞれ特徴量の各次元の列挙した値を記憶する。
【００５０】
文字サンプル特徴量の入力手段９０は，１文字のすべての学習サンプル特徴量を入力し，メモリＭ９０に記憶させる。
【００５１】
列挙手段９１は，メモリＭ９０に格納している１文字のすべての学習サンプルの特徴量から，次元毎に，次元のとりうる値を列挙し，列挙した各次元の値をそれぞれメモリＭ９１，Ｍ９２，Ｍ９３，…，Ｍ９ｎ記憶させる。
【００５２】
特徴量の変化範囲決定手段９４は，文字画像分割手段４１（４２，４３，４４）で分割された子領域内の画素数ｂ^２（メッシュ特徴量の最大値）を文字特徴量の変化範囲とする。
【００５３】
カテゴリデータの表現手段９３は，図１４に示すように，ｎ次元のベクトルで表現し，各次元をｂ^２＋１個のビットで表す。
【００５４】
列挙した値を変換する手段９２は，メモリＭ９１，Ｍ９２，Ｍ９３，…，Ｍ９ｎに格納している各次元の列挙した値を変換する。メモリＭ９ｉ（ｉ＝１，２，…，ｎ）に記憶しているｉ次元目の列挙した値｛ｅ_ｉ１，ｅ_ｉ２，．…，ｅ_ｉｓ｝に対して，カテゴリデータのｉ次元目の第ｅ_ｉｊ＋１ビットの値を“１”と設定し（ｊ＝１，２，…，ｓ），その以外のビットの値を“０”と設定する。
【００５５】
格納手段８ａは，求められたカテゴリデータを認識辞書に格納させる。
【００５６】
図１５（ａ）は，文字カテゴリに属している５つの学習サンプルを示している。ここで，文字特徴量の次元数ｎ＝６であり，文字特徴量の変化範囲が１６である。従って，該文字カテゴリデータを６次元のベクトルで表し，各次元を１７ｂｉｔｓで表す。図１５（ｂ）は，列挙手段９１で列挙された各次元の値を示している。例えば，列挙された１次元目の値は３，４，６，８であり，２次元目の値は８，１０，１１，１２である。図１５（ｃ）は，変換手段９２で求められた文字カテゴリデータを示している。
【００５７】
文字カテゴリデータの作成方法から分かるように，文字カテゴリデータは，ｎ次元空間に，文字カテゴリに属しているすべての学習サンプルが各次元毎に現れる位置の範囲を示している。例えば，図１６は文字カテゴリに属しているすべての学習サンプルが１次元目，２次元目に現れる位置範囲を示している。ここで，ａ１，ａ２は１次元目の位置範囲であり，ｂ１，ｂ２は２次元目の位置範囲である。各次元に現れる位置範囲は連続の場合もあるし，離散の場合もある。例えば，図１５（ｃ）に示している文字カテゴリデータに対して，１次元に現れる位置範囲は３〜４，６，８であり，２次元に現れる位置範囲は８，１０〜１２である。３，５，６次元の位置範囲は連続的なものであり，１，２，４次元の位置範囲は離散的なものである。文字カテゴリデータで示す該文字カテゴリに属している学習サンプルが各次元毎に現れる位置範囲は，該文字の認識範囲である。図１６に示す４つの長方形は該文字の認識範囲である。図に示すように，この認識範囲は比較的に文字の学習サンプルの分布に近いので，認識範囲が重なっている文字の数を大幅に削減することができる。例えば，図１７（ａ）に示している７つの文字Ｐ１，Ｐ２，…，Ｐ７について，従来の技術により，Ｐ１〜Ｐ７の認識範囲は図１７（ａ）に示している点線円Ｅ１１〜Ｅ１７である。Ｅ１１はＥ１２及びＥ１６と，Ｅ１２はＥ１１，Ｅ１３，Ｅ１５及びＥ１６と，Ｅ１３はＥ１２及びＥ１４と，Ｅ１６はＥ１１，Ｅ１２，Ｅ１５，Ｅ１７と重なっている。しかし，本発明により，文字の認識範囲は図１７（ｂ）に示すＥ２１〜Ｅ２７である。図から分かるように，Ｅ２１，Ｅ２２，…，Ｅ２７は相互に重なっていない。
【００５８】
次に文字パターンと文字カテゴリ間の類似度を計算する手段６について説明する。類似度の計算手段６は，メモリＭ４に格納されている未知文字Ｘ＝（ｘ_１，ｘ_２，…，ｘ_ｎ）と認識辞書に格納している文字カテゴリデータＣａｔ（ｉ）＝（ｃａｔ_１（ｉ），ｃａｔ_２（ｉ），…，ｃａｔ_ｎ（ｉ））間の類似度Ｓ（Ｘ，Ｃａｔ（ｉ））は次のように計算される。
【００５９】
【数８】

ここで，ｆ（ａ，ｂ）＝１，ｉｆｂのａ＋１ビット目の値＝１；ｆ（ａ，ｂ）＝０，ｉｆｂのａ＋１ビット目の値＝０である。
【００６０】
関数ｆ（）の定義から分かるように，入力された未知文字Ｘのｊ次元目の値ｘ_ｊはカテゴリデータのｊ次元目の位置範囲に入ると，類似度がすこし高くなる。逆に，入力された未知文字Ｘのｊ次元の値ｘ_ｊはカテゴリデータのｊ次元目の位置範囲以外に入ると，類似度がすこし低くなる。すべての次元に対して，ｆ（）＝１なら，類似度＝１であるので，カテゴリに属しているすべての学習サンプルと該文字のカテゴリデータ間の類似度は同じであり，“１”である。認識するとき，未知文字Ｘが文字カテゴリデータで示す文字Ｐの認識範囲に入ると，Ｓ（Ｘ，Ｐ）＝１になり，文字Ｐが認識の結果として出力される。これは従来技術で実現できなかった部分である。
【００６１】
本発明の文字カテゴリデータ作成方法及び文字パターンと文字カテゴリ間の類似度の計算方法は，人間の認識機能に近似するものである。人間はものの特徴を思い出すときに，ものの各特徴及び特徴量の変化範囲が思い出される。例えば，“リンゴ”の特徴を思い出すとき，“色は赤い，黄色い或いは青いなどがあり，黒はないこと；味は甘い，甘酢っぱいなどがあり，辛いはないこと；重さが１５０グラム位〜４５０グラム位；”などが自然に思い出される。つまり，人間は学習するとき，学習対象の各特徴量を取って，各特徴及び特徴量の変化範囲を記憶していることが考えられる。例えば，いろんな“リンゴ”を学習した後，“色”，“形”，“味”，“重さ”，“高さ”，“幅”等の特徴，“色”特徴量の変化範囲が“赤色，青色，黄色”，“重さ”特徴量の変化範囲が“１５０グラム位〜４００グラム位”，“高さ”特徴量の変化範囲が“６ｃｍ位〜１２ｃｍ位”などが記憶されるはずである。認識するとき，取れた特徴量の値は学習した“リンゴ”の特徴量の変化範囲内の場合は，“リンゴ”として認識されるはずである。勿論，人間は連想という機能を持っているので，未学習したリンゴも認識できる。これは，未学習したリンゴは，学習したリンゴに似ているからである。
【００６２】
次に認識手段７について説明する。認識手段７は，文字と文字カテゴリ間の類似度を計算する手段６を用いて，メモリＭ４に格納している未知文字パターンと，認識辞書に格納されているすべての文字カテゴリデータ間の類似度を計算し，未知文字ともっとも類似な文字カテゴリを認識の結果としてメモリＭ５に出力する。
【００６３】
次に入力された１文字分の文字画像から，特徴量抽出手段２で文字のペリフェラル特徴量を抽出するときの動作をフローチャートを用いて説明する。図１８〜図２１は特徴量抽出手段２の動作手順を示すフローチャートである。図１８は文字画像を横に分割された２Ａ‐１区分の各区分毎に，区分の左辺から該区分を走査して，該分区の特徴量を抽出する動作手順のフローチャートである。
〔Ｓ１〕：未処理の区分に移動し，該区分の行数の初期値をｋ＝１と設定し，該区分の特徴量を表す変数Ｆｅａを初期化する。
〔Ｓ２〕：各区分に対して，該区分の一番上の行の一番左の画素を取り出す。
〔Ｓ３〕：取り出した画素が背景画素であるかどうかを判定し，背景画像の場合は，Ｓ４へ行く。背景画素でない場合は，Ｓ７へ行く。
〔Ｓ４〕：Ｆｅａ＝Ｆｅａ＋１。
〔Ｓ５〕：取り出した画素が該行の左側から該行の“幅／Ｐ”番目の画素であるかどうかを判定し，該行の“幅／Ｐ”番目の画素である場合は，Ｓ６へ行く。そうではない場合は，Ｓ７へ行く。
〔Ｓ６〕：取り出した画素の右の画素を取り出す。Ｓ３へ行く。
〔Ｓ７〕：下の行に移動し，ｋ＝ｋ＋１である。Ｓ８へ行く。
〔Ｓ８〕：該区分の全行が全て処理されたかどうかを判定し，全部処理された場合は，Ｓ９へ行く。また残った場合は，Ｓ２へ行く。
〔Ｓ９〕：該区分特徴量を求める。Ｓ１０へ行く。
〔Ｓ１０〕：横の２Ａ‐１区分は全て処理されたかどうかを判定し，全部処理された場合は，終了する。もた残った区分があれば，Ｓ１へ行く。
【００６４】
図１９は文字画像を横に分割された２Ａ‐１区分の各区分毎に，区分の右辺から該区分を走査して，該分区の特徴量を抽出する動作手順のフローチャートである。
〔Ｓ１１〕：未処理の区分に移動し，該区分の行数の初期値をｋ＝１と設定し，該区分の特徴量を表す変数Ｆｅａを初期化する。
〔Ｓ１２〕：各区分に対して，該区分の一番上の行の一番右の画素を取り出す。
〔Ｓ１３〕：取り出した画素が背景画素であるかどうかを判定し，背景画像の場合は，Ｓ１４へ行く。背景画素でない場合は，Ｓ１７へ行く。
〔Ｓ１４〕：Ｆｅａ＝Ｆｅａ＋１。
〔Ｓ１５〕：取り出した画素が該行の右側から該行の“幅／Ｐ”番目の画素であるかどうかを判定し，該行の“幅／Ｐ”番目の画素である場合は，Ｓ１６へ行く。そうではない場合は，Ｓ１７へ行く。
〔Ｓ１６〕：取り出した画素の左の画素を取り出す。Ｓ１３へ行く。
〔Ｓ１７〕：下の行に移動し，ｋ＝ｋ＋１である。Ｓ１８へ行く。
〔Ｓ１８〕：該区分の全行が全て処理されたかどうかを判定し，全部処理された場合は，Ｓ１９へ行く。また残った場合は，Ｓ１２へ行く。
〔Ｓ１９〕：該区分特徴量を求める。Ｓ２０へ行く。
〔Ｓ２０〕：横の２Ａ‐１区分は全て処理されたかどうかを判定し，全部処理された場合は，終了する。もた残った区分があれば，Ｓ１１へ行く。
【００６５】
図２０は文字画像を縦に分割された２Ａ‐１区分の各区分毎に，区分の上端から該区分を走査して，該分区の特徴量を抽出する動作手順のフローチャートである。
〔Ｓ２１〕：未処理の区分に移動し，該区分の列数の初期値をｋ＝１と設定し，該区分の特徴量を表す変数Ｆｅａを初期化する。
〔Ｓ２２〕：各区分に対して，該区分の一番左の列の一番上の画素を取り出す。
〔Ｓ２３〕：取り出した画素が背景画素であるかどうかを判定し，背景画像の場合は，Ｓ２４へ行く。背景画素でない場合は，Ｓ２７へ行く。
〔Ｓ２４〕：Ｆｅａ＝Ｆｅａ＋１。
〔Ｓ２５〕：取り出した画素が該列の上端から該列の“高さ／Ｐ”番目の画素であるかどうかを判定し，該列の“高さ／Ｐ”番目の画素である場合は，Ｓ２６へ行く。そうではない場合は，Ｓ２７へ行く。
〔Ｓ２６〕：取り出した画素の下の画素を取り出す。Ｓ２３へ行く。
〔Ｓ２７〕：右の列に移動し，ｋ＝ｋ＋１である。Ｓ２８へ行く。
〔Ｓ２８〕：該区分の全列が全て処理されたかどうかを判定し，全部処理された場合は，Ｓ２９へ行く。また残った場合は，Ｓ２２へ行く。
〔Ｓ２９〕：該区分特徴量を求める。Ｓ３０へ行く。
〔Ｓ３０〕：縦の２Ａ‐１区分は全て処理されたかどうかを判定し，全部処理された場合は，終了する。もた残った区分があれば，Ｓ２１へ行く。
【００６６】
図２１は文字画像を縦に分割された２Ａ‐１区分の各区分毎に，区分の下端から該区分を走査して，該分区の特徴量を抽出する動作手順のフローチャートである。
〔Ｓ３１〕：未処理の区分に移動し，該区分の列数の初期値をｋ＝１と設定し，該区分の特徴量を表す変数Ｆｅａを初期化する。
〔Ｓ３２〕：各区分に対して，該区分の一番左の列の一番下の画素を取り出す。
〔Ｓ３３〕：取り出した画素が背景画素であるかどうかを判定し，背景画像の場合は，Ｓ３４へ行く。背景画素でない場合は，Ｓ３７へ行く。
〔Ｓ３４〕：Ｆｅａ＝Ｆｅａ＋１。
〔Ｓ３５〕：取り出した画素が該列の下端から該列の“高さ／Ｐ”番目の画素であるかどうかを判定し，該列の“高さ／Ｐ”番目の画素である場合は，Ｓ３６へ行く。そうではない場合は，Ｓ３７へ行く。
〔Ｓ３６〕：取り出した画素の上の画素を取り出す。Ｓ３３へ行く。
〔Ｓ３７〕：右の列に移動し，ｋ＝ｋ＋１である。Ｓ３８へ行く。
〔Ｓ３８〕：該区分の全列が全て処理されたかどうかを判定し，全部処理された場合は，Ｓ３９へ行く。また残った場合は，Ｓ３２へ行く。
〔Ｓ３９〕：該区分特徴量を求める。Ｓ４０へ行く。
〔Ｓ４０〕：縦の２Ａ‐１区分は全て処理されたかどうかを判定し，全部処理された場合は，終了する。もた残った区分があれば，Ｓ３１へ行く。
【００６７】
次に入力された１文字分の文字画像から，特徴量抽出手段３で文字のストローク特徴量を抽出するときの動作をフローチャートを用いて説明する。図２２および図２３は特徴量抽出手段３の動作手順を示すフローチャートである。図２２は文字画像を横に分割された２Ａ‐１区分の各区分毎に，区分の左辺から該区分を走査して，該分区の特徴量を抽出する動作手順のフローチャートである。
〔Ｓ４１〕：未処理の区分に移動し，該区分の行数の初期値をｋ＝１と設定し，該区分の特徴量を表す変数Ｆｅａを初期化する。
〔Ｓ４２〕：各区分に対して，該区分の一番上の行の一番左の画素及び該画素の右隣の画素を取り出す。
〔Ｓ４３〕：取り出した画素が該画素の左隣の画素と同じかどうかを判定し，同じの場合は，Ｓ４６へ行く。同じではない場合は，Ｓ４４へ行く。
〔Ｓ４４〕：Ｆｅａ＝Ｆｅａ＋１。
〔Ｓ４５〕：該行の画素がすべて処理された場合は，Ｓ４７へ行く。そうではない場合は，Ｓ４６へ行く。
〔Ｓ４６〕：取り出した画素の右の画素を取り出す。Ｓ４３へ行く。
〔Ｓ４７〕：下の行に移動し，ｋ＝ｋ＋１である。Ｓ４８へ行く。
〔Ｓ４８〕：該区分の全行が全て処理されたかどうかを判定し，全部処理された場合は，Ｓ４９へ行く。また残った場合は，Ｓ４２へ行く。
〔Ｓ４９〕：該区分特徴量を求める。Ｓ５０へ行く。
〔Ｓ５０〕：横の２Ａ‐１区分は全て処理されたかどうかを判定し，全部処理された場合は，終了する。もた残った区分があれば，Ｓ４１へ行く。
【００６８】
図２３は文字画像を縦に分割された２Ａ‐１区分の各区分毎に，区分の上端から該区分を走査して，該分区の特徴量を抽出する動作手順のフローチャートである。
〔Ｓ５１〕：未処理の区分に移動し，該区分の列数の初期値をｋ＝１と設定し，該区分の特徴量を表す変数Ｆｅａを初期化する。
〔Ｓ５２〕：各区分に対して，該区分の一番左の列の一番上の画素及び該画素の下の画素を取り出す。
〔Ｓ５３〕：取り出した画素が該画素の上の画素と同じかどうかを判定し，同じの場合は，Ｓ５６へ行く。同じではない場合は，Ｓ５４へ行く。
〔Ｓ５４〕：Ｆｅａ＝Ｆｅａ＋１。
〔Ｓ５５〕：該列の画素がすべて処理された場合は，Ｓ５７へ行く。そうではない場合は，Ｓ５６へ行く。
〔Ｓ５６〕：取り出した画素の下の画素を取り出す。Ｓ５３へ行く。
〔Ｓ５７〕：右の列に移動し，ｋ＝ｋ＋１である。Ｓ５８へ行く。
〔Ｓ５８〕：該区分の全列が全て処理されたかどうかを判定し，全部処理された場合は，Ｓ５９へ行く。また残った場合は，Ｓ５２へ行く。
〔Ｓ５９〕：該区分特徴量を求める。Ｓ６０へ行く。
〔Ｓ６０〕：縦の２Ａ‐１区分は全て処理されたかどうかを判定し，全部処理された場合は，終了する。もた残った区分があれば，Ｓ５１へ行く。
【００６９】
次に入力された１文字分の文字画像から，特徴量抽出手段４で文字のメッシュ特徴量を抽出するときの動作をフローチャートを用いて説明する。図２４は特徴量抽出手段４の動作手順を示すフローチャートである。
〔Ｓ６１〕：各子領域に対して，該子領域の一番上の行の一番左の画素を取り出す。
〔Ｓ６２〕：取り出した画素が背景画素であるかどうかを判定し，背景画素の場合は，Ｓ６５へ行く。背景画素ではない場合は，Ｓ６３へ行く。
〔Ｓ６３〕：該子領域の特徴量を１に増やす。
〔Ｓ６４〕：該行の画素がすべて処理されたかどうかを判定する。すべて処理された場合は，Ｓ６６へ行く。そうではない場合は，Ｓ６５へ行く。
〔Ｓ６５〕：取り出した画素の右の画素を取り出す。Ｓ６２へ行く。
〔Ｓ６６〕：下の行に移動する。Ｓ６７へ行く。
〔Ｓ６７〕：該子領域の全行が全て処理されたかどうかを判定し，全部処理された場合は，Ｓ６８へ行く。また残った場合は，Ｓ６１へ行く。
〔Ｓ６８〕：Ｂ＋２Ｃ＋Ｄ個の子領域は全て処理されたかどうかを判定し，全部処理された場合は，終了する。もた残った子領域があれば，Ｓ６９へ行く。
〔Ｓ６９〕：未処理の子領域に移動する。Ｓ６１へ行く。
【００７０】
次に文字カテゴリに属しているすべての学習サンプルから，文字カテゴリデータを作成する手段９の動作をフローチャートを用いて説明する。図２５は文字カテゴリデータ作成手段９の動作手順を示すフローチャートである。
〔Ｓ７０〕：文字の個数をｍと設定し，文字特徴量ベクトル及びカテゴリデータベクトルの次元数をｎと設定する。文字の学習順番ｉ＝１と設定する。
〔Ｓ７１〕：文字ｉの学習サンプルの個数をａ（ｉ）と設定し，次元数ｊ＝１と設定する；
〔Ｓ７２〕：学習サンプル特徴量のｊ次元目の列挙した値を記憶する集合Ｓを空にする。カテゴリデータのｊ次元目の値Ｃａｔ（ｉ，ｊ）＝０と設定し，サンプルの学習順番ｋ＝１と設定する。
〔Ｓ７３〕：文字ｉの第ｋ番目の学習サンプルのｊ次元目の値Ｓａｍ（ｉ，ｋ，ｊ）が集合Ｓに含まれるかどうかを判断する。含まれている場合は，Ｓ７５へ行く。含まれていない場合はＳ７４へ行く。
〔Ｓ７４〕：Ｓａｍ（ｉ，ｋ，ｊ）を集合Ｓにに加える。
〔Ｓ７５〕：次に学習するサンプルを設定する。
〔Ｓ７６〕：文字ｉのすべての学習サンプルを学習した場合は，Ｓ７７へ行く。
学習するサンプルはまた残った場合は，Ｓ７３へ行く。
〔Ｓ７７〕：集合Ｓから１の要素ｅを取り出す。Ｓ７８へ行く。
〔Ｓ７８〕：Ｃａｔ（ｉ，ｊ）の第ｅ＋１ビットに“１”を代入する。
〔Ｓ７９〕：集合Ｓから要素ｅを削除する。Ｓ８０へ行く。
〔Ｓ８０〕：集合Ｓが空であるかどうかを判定する。空の場合は，Ｓ８１へ行く。空ではない場合は，Ｓ７７へ行く。
〔Ｓ８１〕：次に学習する次元を設定する。
〔Ｓ８２〕：すべての次元が処理されたら，Ｓ７２へ行く。そうではない場合は，Ｓ８３へ行く。
〔Ｓ８３〕：次に学習する文字を設定する。
〔Ｓ８４〕：すべての文字が学習された場合は，終了する。学習文字がまた残った場合は，Ｓ７１へ行く。
【００７１】
次に認識手段７の動作をフローチャートを用いて説明する。図２６は認識手段７の動作手順を示すフローチャートである。
〔Ｓ９０〕：認識辞書に格納している文字カテゴリデータの個数をｍと設定し，文字カテゴリデータの比較順番ｉ＝１，最大類似度の初期値Ｓ_ｍａｘ＝０，認識結果を記憶する変数Ｒｅｓ＝０にする。
〔Ｓ９１〕：類似度計算手段６を用いて，入力された未知文字Ｘと認識辞書に格納されている文字ｉのカテゴリデータＣａｔ（ｉ）間の類似度Ｓ（Ｘ，Ｃａｔ（ｉ））を計算する。
〔Ｓ９２〕：類似度Ｓ（Ｘ，Ｃａｔ（ｉ））が最大類似度Ｓ_ｍａｘより大きい場合は，Ｓ９３へ行く。大きくない場合は，Ｓ９４へ行く。
〔Ｓ９３〕：類似度Ｓ（Ｘ，Ｃａｔ（ｉ））を最大類似度Ｓ_ｍａｘにコピーし，文字ｉを認識の結果としてＲｅｓに記憶させる。
〔Ｓ９４〕：次に比較する文字カテゴリデータを設定する。
〔Ｓ９５〕：すべての文字カテゴリデータが比較された場合は，終了する。比較する文字カテゴリデータがまた残った場合は，Ｓ９１へ行く。
【００７２】
次に本発明の認識装置を用いて，具体的に文字を認識したときの認識率及び認識速度について説明する。
【００７３】
文字の学習サンプルは，紙に印刷された文字画像をスキャナでコンピュータに入力されたものである。文字の個数は３４５５個である。１３種類の文字フォントから文字毎に平均７００個の学習サンプルを用意した。Ａ＝１６，Ｂ＝６４，Ｃ＝５６，Ｄ＝４９と設定し，本発明の特徴量抽出手段を用いて，４１１次元の複合特徴量（１２４次元のペリフェラル特徴量＋６２次元のストローク特徴量＋２２５次元のメッシュ特徴量）を抽出した。
【００７４】
文字毎に，該文字のすべての学習サンプルから該文字のカテゴリデータを求め，認識辞書を作成する。従来の認識方法と比較するために，文字毎に，文字カテゴリに属しているすべての学習サンプルの中心値を求め，各次元毎に，重みｗ_ｉを求める。求められた文字カテゴリの中心値を該文字の代表とし，認識辞書を作成する。また，すべての文字に対して，文字カテゴリに属しているすべての学習サンプルを用いて，該文字カテゴリの共分散行列，固有値及び固有ベクトルを求める。
【００７５】
本発明の認識方法及び従来の認識方法を用いて，学習したサンプルを認識する実験を行った。従来の認識方法は，それぞれシテイブロック距離，ユークリッド距離，重み付きユークリッド距離，投影距離（Ｊ＝３）を用いて認識を行う方法である。次の表は実験の結果を表している。
【００７６】
【表１】

【００７７】
従来認識方法の中に，もっとも高い認識率は９７．８％であり，平均認識時間は８８ｍｓであった。本発明の認識方法の認識率は９９．８％であり，平均認識時間は２１ｍｓであった。
【００７８】
従って，文字認識分野における未知文字を認識する問題に対して，より高い認識精度かつ高速に文字を認識することが可能になる。
【００７９】
以上の説明から明らかなように，本実施例の認識装置は，文字の複合特徴量を抽出し，文字カテゴリに属しているすべての学習サンプルを用いて文字カテゴリデータを求め，求められた文字カテゴリデータを認識辞書に記憶させ認識辞書を作成しておく。文字を認識するとき，文字パターンと文字カテゴリ間の類似度の計算方法を用いて，入力された未知文字を前記作成された認識辞書に格納されているすべての文字カテゴリと比較し，もっとも類似な文字カテゴリを認識の結果として出力される。これにより，入力された未知文字を高精度・高速かつ簡単に認識することができる。
【００８０】
【発明の効果】
以上説明したように，本発明によれば，特徴量の分布に対応するビット列データからなる文字カテゴリデータと，認識対象の同様の文字パターンの文字パタンデータとを比較して文字認識を行なうので学習サンプルの特徴量の分布に応じた類似となり，分布により精度が落ちることがない。さらに，複数種類の特徴量のビット列パターンを連結させればより正確な認識が可能となる。
【図面の簡単な説明】
【図１】本発明の認識装置の実施の一形態を示すブロック図である。
【図２】文字の複合特徴量を示す図である。
【図３】認識辞書内のデータを示す図である。
【図４】本発明の認識装置の構成を示す図である。
【図５】特徴量抽出手段２の実施の一形態を示すブロック図である。
【図６】特徴量抽出手段３の実施の一形態を示すブロック図である。
【図７】特徴量抽出手段４の実施の一形態を示すブロック図である。
【図８】横区分分割手段で分割された区分の様子を表す図である。
【図９】縦区分分割手段で分割された区分の様子を表す図である。
【図１０】特徴量抽出手段２で文字“Ａ”の特徴量を抽出する様子を示す図である。
【図１１】特徴量抽出手段３で文字“Ａ”の特徴量を抽出する様子を示す図である。
【図１２】子領域分割手段で分割された子領域の様子を表す図である。
【図１３】文字カテゴリデータの作成手段９の実施の一形態を示すブロック図である。
【図１４】文字カテゴリデータの構造を示す図である。
【図１５】文字カテゴリデータを求める方法の説明図である。
【図１６】文字カテゴリデータの意味を説明する図である
【図１７】従来技術及び本発明の技術による文字の認識範囲を示す図である。
【図１８】特徴量抽出手段２の動作手順を示すフローチャートである。
【図１９】特徴量抽出手段２の動作手順を示すフローチャートである。
【図２０】特徴量抽出手段２の動作手順を示すフローチャートである。
【図２１】特徴量抽出手段２の動作手順を示すフローチャートである。
【図２２】特徴量抽出手段３の動作手順を示すフローチャートである。
【図２３】特徴量抽出手段３の動作手順を示すフローチャートである。
【図２４】特徴量抽出手段４の動作手順を示すフローチャートである。
【図２５】文字カテゴリデータの作成手段の動作手順を示すフローチャートである。
【図２６】認識手段の動作手順を示すフローチャートである。
【図２７】文字カテゴリに属している学習サンプルの分布範囲と認識範囲を示す図である。
【図２８】従来技術で認識を行うときの問題点を示す図である。
【符号の説明】
１文字画像入力手段，２〜４特徴量抽出手段，５複合特徴量を求める手段，６文字パターンとカテゴリ間の類似度の計算手段，７認識手段，９文字カテゴリデータ作成手段，Ｘ入力された未知文字，Ｃａｔ（ｉ）認識辞書に格納している文字ｉのカテゴリデータ。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a recognition device and a recognition method, and more particularly to a recognition device that performs character recognition and a recognition method that performs character recognition.
[0002]
[Prior art]
In the character recognition field, for each character, a standard character pattern of the character is obtained using all learning samples belonging to the character category, and the obtained standard character pattern is stored in the recognition dictionary. The most common recognition method is to compare the input unknown character pattern with all the standard character patterns stored in the recognition dictionary and output the closest one as a result of recognition. Here, the character feature selection method, the standard character pattern creation method, the distance scale, or the similarity scale are important factors that affect the recognition accuracy.
[0003]
As a standard character pattern creation method, there is a method of creating a recognition dictionary by storing, for each character, the central value of all learning samples belonging to the character category in the recognition dictionary as the standard character pattern of the character. However, when the distribution of learning samples belonging to the character category varies and the number is large, there is a problem that the recognition rate is low.
[0004]
In order to increase the recognition rate, there is a method of performing recognition using a plurality of standard character patterns for each character. For example, Japanese Laid-Open Patent Publication No. 63-129488 discloses a method of storing a plurality of standard character patterns for each character in a recognition dictionary and recognizing using the recognition dictionary in order to recognize multi-font characters. Was proposed. There is also a method of creating a recognition dictionary by learning a learning sample, correcting a corresponding standard character pattern, or adding a new standard character pattern. For example, the method described in JP-A-7-28955 is as described above. However, these methods have a problem that recognition time is long because the number of standard character patterns in the recognition dictionary is large. If the number of characters is large, the processing time required for character recognition cannot be ignored.
[0005]
In order to shorten the recognition time, for example, in Japanese Patent Application Laid-Open No. 10-162103, a handwritten character recognition dictionary is created using a handwritten character learning sample, and a type character recognition dictionary is created using a printed character learning sample. When recognizing, a method has been proposed in which it is determined whether the input unknown character is a handwritten character or a printed character, and recognition is performed using a handwritten character recognition dictionary for handwritten characters and a printed character recognition dictionary for printed characters. Yes. However, since there are many types of character fonts, it is not easy to distinguish all types of character fonts. Especially in the case of handwritten characters, the distribution of learning samples does not follow a certain rule. Representing all learning samples belonging to the character category has a problem of low recognition rate.
[0006]
Many distance scales or similarity scales have been proposed. Typical examples include city block distance, Euclidean distance, weighted Euclidean distance, Mahalanobis distance, and projection distance. These methods are described in the literature “Image processing and recognition” Takeshi Aoi and Tomoharu Nagao (1992, Shogodo) and “Basic multivariate analysis” Choichiro Asano and Shinko Ejima (Japanese Standards Association), “Projection in Handwritten Character Recognition” The distance method is described in Masayuki Ikeda, Hidehiko Tanaka, Tatsu Okamoto (Theory of Information Processing, vol. 24, no. 1, pp. 106-112, 1983). Character X = (x ₁ , X ₂ , ..., x _n ) And the letter Y = (y ₁ , Y ₂ , ..., y _n City block distance D between _c (X, Y) is calculated by the following formula. Here, | Z | represents the absolute value of Z.
[0007]
[Expression 2]

[0008]
Character X = (x ₁ , X ₂ , ..., x _n ) And the letter Y = (y ₁ , Y ₂ , ..., y _n ) Euclidean distance D between _e (X, Y) is calculated by the following formula.
[0009]
[Equation 3]

[0010]
Learning sample of letter i ₁ , S ₂ , ... S _k Sample S ₁ , S ₂ , ... S _k Is the center value of i.e., the standard character pattern of letter i _i Represented by Character X = (x ₁ , X ₂ , ..., x _n ) And standard character pattern U _i = (U _i1 , U _i2 , ..., u _in ) Weighted Euclidean distance D between _w (X, U _i ) Is calculated by the following formula.
[0011]
[Expression 4]

here,
[0012]
[Equation 5]

It is. Character X = (x ₁ , X ₂ , ..., x _n ) And standard character pattern U _i = (U _i1 , U _i2 , ..., u _in ) Mahalanobis distance D between _m (X, U _i ) Is calculated by the following formula.
[0013]
[Formula 6]

Where Σi represents the covariance matrix of the learning sample of letter i, and Z ^-1 Is the inverse of the matrix Z and Z ^T Is a transposed matrix of the matrix Z. Pattern X = (x ₁ , X ₂ , ..., x _n ) And standard pattern U _i = (U _i1 , U _i2 , ..., u _in ) Projection distance D _t (X, U _i ) Is calculated by the following formula.
[0014]
[Expression 7]

Where Φ _j Is an eigenvector corresponding to the eigenvalue located at the j-th when the eigenvalues calculated from the pattern learning samples are arranged in descending order, and (α, β) represents the inner product of the vectors α and β.
[0015]
The city block distance, the Euclidean distance, and the weighted Euclidean distance can be obtained relatively easily, but it is difficult to guarantee a high recognition rate. Mahalanobis distance has an occurrence probability of χ ² This is the distance for the data according to the distribution, and the distance is calculated closer to the center of the distribution with a higher probability of occurrence. However, the distribution of the actual character learning sample is χ ² The recognition rate cannot be guaranteed because it does not follow the distribution. In addition, since the covariance matrix of characters is stored, the recognition dictionary is huge, and enormous calculation time is required.
[0016]
The above-described prior art has two features. (1) A character category is represented by one or a plurality of standard character patterns; (2) A character pattern is compared using a distance between character patterns and a character pattern, or similarity. Next, features (1) and (2) of the prior art show that it is an important cause of misidentification.
[0017]
The learning samples belonging to the character category generally do not follow a certain distribution, may be concentrated and may be scattered apart. When the character category is represented by one standard character pattern, as shown in FIG. 27, the recognition range of the character P is centered on the standard character pattern of the character (from the feature (1)), and the most typical character pattern. The distance between the learning sample belonging to the distant character category and the standard character pattern is a radius (from the feature (2)) to become a multidimensional circle E1. That is, if the input unknown character pattern falls within the E1 range, the possibility of being recognized as the character P is very high. However, since the recognition range E1 is larger than the distribution range E2 of the actual character learning sample, it overlaps the recognition range of many characters. When recognizing, if the entered unknown character pattern enters the overlapping area, it may be recognized incorrectly. For example, as shown in FIG. 28, the actual distribution range E4 of the character P1 and the actual distribution range E6 of the character P2 do not overlap, but the recognition range E3 of the character P1 and the recognition range E5 of the character P2 overlap. Since the input unknown character X falls within the actual distribution range E4 of the character P1, it should be recognized as the character P1, but since X falls within the recognition range where P1 and P2 overlap, the character P2 It is recognized incorrectly. That is, since the distance between the standard character pattern of X and the character P2 is smaller than the distance between the standard character patterns of X and the character P1, it is mistaken for the character P2.
[0018]
When recognition is performed using a plurality of standard characters for each character, the number of characters with overlapping recognition ranges is reduced and the recognition accuracy is improved to some extent, but this is not an essential solution.
[0019]
When performing recognition using a distance function or similarity function that reduces the character recognition range according to the character distribution or assumes a character distribution and approximates the shape of the distribution assuming the recognition range ( (For example, weighted Euclidean distance, Mahalanobis distance, etc.), the number of characters with overlapping recognition ranges is reduced and the recognition accuracy is improved to some extent, but the recognition rate is high for characters that do not follow a certain distribution rule There is a problem that cannot be guaranteed, and it is not an essential solution.
[0020]
[Problems to be solved by the invention]
The present invention has been made in view of the above-described circumstances, and solves the problem of lowering the recognition rate when performing character recognition using a standard character pattern representing a character category, and provides a simple recognition method with a high recognition rate. It is intended to provide.
[0021]
[Means for Solving the Problems]
In order to solve the above-mentioned problems, the present invention adopts a configuration as described in the claims. That is, in the specific configuration of the present invention, the peripheral feature value, stroke feature value, and mesh feature value of the character pattern are extracted, and the extracted three types of feature values are arranged to obtain the composite feature value of the character. For each character, enumerate the dimension values for each dimension of the learning sample feature value from all the learning samples belonging to the character category, convert the enumerated values, and convert the converted values for each dimension. The character category data is stored in the recognition dictionary, and a recognition dictionary is created. When recognizing, the method of calculating the similarity between character patterns and character categories is used to calculate the similarity between the input unknown character pattern and all character category data stored in the recognition dictionary. By outputting the character category as a result of recognition, it is possible to recognize the character with high accuracy, high speed and easily.
[0022]
In addition, according to the present invention, character recognition is performed by comparing character category data composed of bit string data corresponding to the distribution of feature amounts with character pattern data of similar character patterns to be recognized, so that recognition can be performed with high accuracy. The Furthermore, more accurate recognition is possible by connecting bit string patterns of a plurality of types of feature values.
[0023]
The present invention can be realized as an apparatus and a method, and at least a part of the method can be implemented as a computer program. It goes without saying that a recording medium (program package) in which the computer program is recorded and a recording medium in which the computer program for installing the computer program in a computer system is recorded are included in the technical scope of the present invention.
[0024]
DETAILED DESCRIPTION OF THE INVENTION
FIG. 1 is a block diagram showing an embodiment of a recognition apparatus of the present invention. In the figure, 1 is a means for inputting a character image for one character, 2 is a means for extracting a peripheral feature amount of a character, 3 is a means for extracting a stroke feature amount of a character, and 4 is a mesh feature amount of a character. Means, 5 means for determining the composite feature of the character, 6 means for calculating the similarity between the character pattern and the character category, 7 for recognition means, 8 for recognition dictionary creation means, 8a for means for storing the recognition dictionary, Reference numeral 9 denotes character category creation means, and reference numeral 10 denotes storage means.
[0025]
The memories M1, M2 and M3 store the peripheral feature value, the stroke feature value and the mesh feature value extracted by the feature

value extracting means

2, 3 and 4, respectively. The memory M4 stores the composite feature amount of the character obtained by the means 5 for obtaining the composite feature amount. The memory M5 stores the name and category data of the character most similar to the input unknown character pattern detected by the recognition means 7 from the recognition dictionary.
[0026]
The feature quantity extraction means 2 divides the character image for one character input by the character image input means 1 into 2A-1 sections horizontally and 2A-1 sections vertically, respectively. Peripheral feature values are extracted by scanning each section with / P as the scanning range of each section. The feature quantity storage means 2a stores the extracted peripheral feature quantity in the memory M1.
[0027]
The feature amount extraction means 3 divides the character image for one character inputted by the character image input means 1 into 2A-1 section horizontally and 2A-1 section vertically, and the scanning range of each section is divided into character images. Stroke feature values are extracted by scanning each section as width or height. The feature amount storage means 3a stores the extracted stroke feature amount in the memory M2.
[0028]
The feature quantity extraction unit 4 divides the character image for one character input by the character image input unit 1 into B regions, 2C units, and D units of a b image * b image, Scan and extract mesh features. The feature quantity storage means 4a stores the extracted mesh feature quantity in the memory M3.
[0029]
The means 5 for obtaining the composite feature value arranges the extracted three types of feature values and obtains it as one feature value. The composite feature quantity storage means 5a stores the obtained composite feature quantity in the memory M4. FIG. 2 shows a composite feature quantity 50 of characters. A composite feature value 50 includes a peripheral feature value 51, a stroke feature value 52, and a mesh feature value 53.
[0030]
The character category data creating means 9 creates character category data using all learning samples belonging to the character category. Using the created character category data, the recognition dictionary creating means 8 creates a recognition dictionary. The created recognition dictionary is stored in the recognition dictionary storage means 8a. FIG. 3 is a diagram showing recognition dictionary data in the recognition dictionary. The recognition dictionary data 60 is composed of data 61 to 6m of all characters (m). Each character data is composed of a character name and a character category data vector.
[0031]
The recognition unit 7 uses the similarity calculation unit 6 to obtain a character category most similar to the input unknown character pattern from the character category data stored in the recognition dictionary, and stores the result in the memory M5. . The storage means 10 stores the name of the character recognized by the recognition means 7 and the category data.
[0032]
Next, as a device application example of the character recognition device of the present invention, a device configuration when applied to an information terminal device will be described. FIG. 4 is a diagram showing an apparatus configuration when the character recognition apparatus of the present invention is applied to an information terminal apparatus.
[0033]
The information terminal device 70 includes a keyboard 71, an external storage device 72, a display 73, and a processor unit 74. The keyboard 71 is an input device for a user to instruct an operation, and other input devices may be added. The external storage device 72 stores input unknown character pattern data, recognition dictionary data, recognition results, and software. Also, the feature quantity storage means 2a, 3a, 4a, the composite feature quantity storage means 5a, and the recognition dictionary storage means 8a can be configured as a part of the external storage device 72. Furthermore, the character name and category data recognized by the storage means 10 may be stored. As a specific example of the external storage device 72, for example, a hard disk can be used. The display 73 is an output device for displaying a message to the user, data of recognized characters, a recognition result, and the like. Of course, other output devices may be added. The processor unit 74 performs actual processing according to software stored in the external storage device 72. Specifically, the processor unit 74 can be configured by a computer system such as a microprocessor or a personal computer. The character feature quantity extraction means 2, 3, 4, the composite feature quantity obtaining means 5, the character category data creation means 9, the similarity calculation means 6, and the recognition means 7 are executed by software operating on the processor unit 74 Can be configured.
[0034]
Next, the operation of the character recognition apparatus of the present invention will be described in more detail. First, the feature quantity extraction unit 2 will be described.
[0035]
FIG. 5 is a block diagram showing an embodiment of the feature quantity extraction means 2. The memories M21 to M24 store a character image for one character input by the character image input means 1.
The horizontal region dividing means 21 divides the character image for one character stored in the memory M21 horizontally into A sections. For example, FIG. 8A shows a state in which the character image is divided into 4 (A = 4) sections horizontally. The horizontal area dividing means 22 divides the lower half of the k (k = 1, 2,..., A-1) section and the upper half of the k + 1 section from the A section divided by the horizontal area dividing means 21 into one section. The character image for one character stored in the memory M22 is divided horizontally into A-1 sections. For example, FIG. 8B shows a state in which the character image is divided into 3 (A-1 = 4-1 = 3) sections horizontally. The vertical area dividing means 23 divides the character image for one character stored in the memory M23 vertically into A sections. For example, FIG. 9A shows a state in which the character image is vertically divided into 4 (A = 4) sections. The vertical area dividing means 24 performs the right half of the k (k = 1, 2,..., A-1) section and the left half of the k + 1 section with respect to the A section divided by the vertical area dividing means 23. A state is shown in which one character image stored in the memory M24 is divided into three (A-1 = 4-1 = 3) sections vertically. Here, the number of horizontal divisions and the number of vertical divisions may be different.
[0036]
The scanning range control means 26 controls the horizontal section to scan from the total two sides of the circumscribed rectangle of the character image to 1 / P of the character width in the character direction. It controls scanning from the total two sides of the circumscribed rectangle of the character image to 1 / P of the character height in the character direction. Here, P is a positive integer.
[0037]
The feature extraction means 25 first starts from the left side of the character image in the scanning range restricted by the scanning range restriction means 26 for each of the horizontal 2A-1 divisions divided by the area division means 21 and 22. Scan a times (a = height of the character image / A), count the number of pixels in the background image up to the first pixel (black pixel) constituting the character, and count the number of pixels scanned a times Find the average value of. Subsequently, for each of the horizontal 2A-1 sections divided by the area dividing means 21 and 22, scanning is performed a times from the right side of the character image in the scanning range limited by the scanning range limiting means 26 ( a = height of the character image / A), the number of pixels of the background image up to the first pixel (black pixel) constituting the character is counted, and an average value of the counted number of pixels is obtained by scanning a times. . For each of the vertical 2A-1 sections divided by the area dividing means 23 and 24, scanning is performed a times from the upper side of the character image in the scanning range limited by the scanning range limiting means 26 (a = Width of the character image / A), the number of pixels of the background image up to the first pixel (black pixel) constituting the character is counted, and an average value of the counted number of pixels is obtained by scanning a times. Finally, for each of the vertical 2A-1 sections divided by the area dividing means 23 and 24, scanning is performed a times from the lower side of the character image in the scanning range limited by the scanning range limiting means 26 ( a = width of the character image / A), the number of pixels of the background image up to the first pixel (black pixel) constituting the character is counted, and an average value of the counted number of pixels is obtained by scanning a times. FIGS. 10A and 10B are diagrams showing how feature values of the horizontal 7 (2A-1) section divided by the area dividing means 21 and 22 are extracted when A = 4 and P = 3. is there. FIGS. 10C and 10D are diagrams showing how feature quantities of 7 vertical (2A-1) sections divided by the area dividing means 23 and 24 are extracted when A = 4 and P = 3. is there.
[0038]
The storage unit 2a stores the feature amount extracted by the feature amount extraction unit 25 in the memory M1 shown in FIG.
[0039]
Next, the feature quantity extraction unit 3 will be described. FIG. 6 is a block diagram showing an embodiment of the feature quantity extraction means 3. The memories M31 to M34 store a character image for one character input by the character image input means 1.
[0040]
The horizontal region dividing means 31 divides the character image for one character stored in the memory M31 horizontally into A sections. For example, FIG. 8A shows a state in which the character image is divided into 4 (A = 4) sections horizontally. The horizontal area dividing means 32 divides the lower half of the k (k = 1, 2,..., A-1) section and the upper half of the k + 1 section into one section with respect to the A section divided by the horizontal area dividing means 31. The character image for one character stored in the memory M32 is divided horizontally into A-1 sections. For example, FIG. 8B shows a state in which the character image is divided into 3 (A-1 = 4-1 = 3) sections horizontally. The vertical area dividing means 33 divides the character image for one character stored in the memory M33 vertically into A sections. For example, FIG. 9A shows a state in which the character image is vertically divided into 4 (A = 4) sections. The vertical area dividing means 34 calculates the right half of the k (k = 1, 2,..., A-1) section and the left half of the k + 1 section with respect to the A section divided by the vertical area dividing means 33. A state is shown in which one character image stored in the memory M34 is divided into 3 (A-1 = 4-1 = 3) segments vertically. Here, the number of horizontal divisions and the number of vertical divisions may be different.
[0041]
The feature extraction means 35 first scans a character image width a times from the left side of the character image for each of the horizontal 2A-1 divisions divided by the area division means 31 and 32, using the width of the character image as a scanning range ( a = height of the character image / A), changes from a background pixel (white pixel) to a pixel (black pixel) constituting a character, and from a pixel (black pixel) constituting a character to a background pixel (white pixel) The number of times is counted, and an average value of the number of times counted is obtained by scanning a times. Subsequently, for each of the vertical 2A-1 sections divided by the area dividing means 33 and 34, the height of the character image is used as a scanning range and scanned a times from the upper side of the character image (a = the character image). Width / A), the number of changes from the background pixel (white pixel) to the pixel (black pixel) constituting the character, and from the pixel constituting the character (black pixel) to the background pixel (white pixel), The average value of the number of times counted by scanning is obtained. FIGS. 11A and 11B are diagrams illustrating a state in which the feature amount of the horizontal 7 (2A-1) section divided by the area dividing means 31 and 32 is extracted when A = 4. 11 (c) and 11 (d) are diagrams showing how the feature amount of the vertical 7 (2A-1) section divided by the area dividing means 33 and 34 is extracted when A = 4.
[0042]
The storage means 3a stores the feature quantity extracted by the feature quantity extraction means 35 in the memory M2 shown in FIG.
[0043]
Next, the feature quantity extraction unit 4 will be described. FIG. 7 is a block diagram showing an embodiment of the feature quantity extraction means 4. The memories M41 to M44 store a character image for one character input by the character image input means 1.
[0044]
The child area dividing means 41 divides the character image for one character stored in the memory M41 into B child areas of size b pixels * b pixels. For example, FIG. 12A shows a state in which the character image is divided into 16 (B = 16) child regions by the child region dividing means 41. The child area dividing means 42, for the B child areas divided by the child area dividing means 41, for each child area other than the child areas on the right side of the character image, The left half of the child area is divided into C child areas. FIG. 12B shows a state in which the character image is divided into 12 (when B = 16) child regions by the child region dividing means 42. The child area dividing means 43 lowers the lower half and the lower half of the child area for each of the B child areas divided by the child area dividing means 41 for each child area other than the child areas below the character image. The upper half of the adjacent child area is defined as one child area and is divided into C child areas. FIG. 12C shows a state in which the character image is divided into 12 (when B = 16) child regions by the child region dividing means 43. The child area dividing unit 44 is configured to lower the lower half and the lower half of the child area for each of the C child areas divided by the child area dividing unit 42 except for the child areas below the character image. The upper half of the adjacent child area is defined as one child area and is divided into D child areas. FIG. 12D shows a state in which the character image is divided into 9 (when B = 16, C = 12) child regions by the child region dividing means 44. Here, b and B are both positive integers, and b * B = width (or height) of the character image.
[0045]
The feature extraction means 45 scans from the left side of the child area image for each child area of the B, C, C, and D child areas divided by the area dividing means 41, 42, 43, and 44, respectively. Count the number of pixels (black pixels)
[0046]
The storage unit 4a stores the feature amount extracted by the feature amount extraction unit 45 in the memory M3 shown in FIG.
[0047]
Next, the means 5 for determining the composite feature quantity of characters will be described. The means 5 for obtaining the composite feature amount arranges the feature amounts extracted by the feature extraction means 2, the feature extraction means 3 and the feature extraction means 4, and stores them in the memory M4 shown in FIG.
[0048]
Next, the character category data creation means 9 when the character category data is stored in the recognition dictionary storage means 8a will be described. FIG. 13 is a block diagram showing an embodiment of the character category data creating means 9.
[0049]
The memory M90 stores all learning sample feature values of one character. The memories M91, M92, M93,..., M9n (n is the number of dimensions of the feature vector) store the enumerated values of each dimension of the feature.
[0050]
The character sample feature amount input means 90 inputs all the learning sample feature amounts of one character and stores them in the memory M90.
[0051]
The enumeration means 91 enumerates possible values of dimensions for each dimension from the feature values of all learning samples of one character stored in the memory M90, and the enumerated values of the dimensions are respectively stored in the memories M91, M92, M93, ..., M9n are stored.
[0052]
The feature amount change range determining unit 94 includes the number of pixels b in the child area divided by the character image dividing unit 41 (42, 43, 44). ² Let (the maximum value of the mesh feature amount) be the change range of the character feature amount.
[0053]
As shown in FIG. 14, the category data expression means 93 is represented by an n-dimensional vector, and each dimension is represented by b. ² Represented by +1 bit.
[0054]
The means 92 for converting the enumerated values converts the enumerated values of each dimension stored in the memories M91, M92, M93, ..., M9n. Enumerated values {e stored in the memory M9i (i = 1, 2,..., N) {e _i1 , E _i2 ,. ..., e _is } For the i-th dimension of the category data _ij The value of the +1 bit is set to “1” (j = 1, 2,..., S), and the values of the other bits are set to “0”.
[0055]
The storage unit 8a stores the obtained category data in the recognition dictionary.
[0056]
FIG. 15A shows five learning samples belonging to the character category. Here, the dimension number n of the character feature amount is 6 and the change range of the character feature amount is 16. Therefore, the character category data is represented by a 6-dimensional vector, and each dimension is represented by 17 bits. FIG. 15B shows the values of each dimension listed by the listing means 91. For example, the listed first dimension values are 3, 4, 6, and 8, and the second dimension values are 8, 10, 11, and 12. FIG. 15C shows character category data obtained by the conversion means 92.
[0057]
As can be seen from the method of creating character category data, the character category data indicates a range of positions where all learning samples belonging to the character category appear in each dimension in the n-dimensional space. For example, FIG. 16 shows a position range in which all learning samples belonging to the character category appear in the first and second dimensions. Here, a1 and a2 are the first-dimensional position range, and b1 and b2 are the second-dimensional position range. The position range that appears in each dimension may be continuous or discrete. For example, with respect to the character category data shown in FIG. 15C, the position range appearing in one dimension is 3 to 4, 6 and 8, and the position range appearing in two dimensions is 8, 10 to 12. The 3rd, 5th and 6th dimensional position ranges are continuous, and the 1st, 2nd and 4th dimensional position ranges are discrete. The position range where the learning sample belonging to the character category indicated by the character category data appears for each dimension is the recognition range of the character. Four rectangles shown in FIG. 16 are recognition ranges of the characters. As shown in the figure, the recognition range is relatively close to the distribution of character learning samples, so that the number of characters with overlapping recognition ranges can be greatly reduced. For example, for the seven characters P1, P2,..., P7 shown in FIG. 17A, the recognition range of P1 to P7 is the dotted circles E11 to E17 shown in FIG. is there. E11 overlaps E12 and E16, E12 overlaps E11, E13, E15, and E16, E13 overlaps E12 and E14, and E16 overlaps E11, E12, E15, and E17. However, according to the present invention, the character recognition range is E21 to E27 shown in FIG. As can be seen, E21, E22, ..., E27 do not overlap each other.
[0058]
Next, the means 6 for calculating the similarity between the character pattern and the character category will be described. The similarity calculation means 6 is used to calculate the unknown character X = (x ₁ , X ₂ , ..., x _n ) And character category data Cat (i) = (cat) stored in the recognition dictionary ₁ (I), cat ₂ (I), ..., cat _n The similarity S (X, Cat (i)) between (i)) is calculated as follows.
[0059]
[Equation 8]

Here, f (a, b) = 1, the value of the a + 1 bit of if b = 1; f (a, b) = 0, and the value of the a + 1 bit of if b = 0.
[0060]
As can be seen from the definition of the function f (), the value x of the jth dimension of the input unknown character X _j When entering the position range of the jth dimension of the category data, the similarity is slightly higher. Conversely, the j-dimensional value x of the input unknown character X _j If the position is outside the position range of the jth dimension of the category data, the similarity is slightly lower. If f () = 1 for all dimensions, similarity = 1, so the similarity between all learning samples belonging to a category and category data of the character is the same, and “1”. is there. When recognizing, if the unknown character X enters the recognition range of the character P indicated by the character category data, S (X, P) = 1, and the character P is output as a recognition result. This is a part that could not be realized by the prior art.
[0061]
The character category data creation method and the similarity calculation method between character patterns and character categories according to the present invention approximate human recognition functions. When a person remembers a feature of a thing, each feature of the thing and the change range of the feature amount are remembered. For example, when recalling the characteristics of “apple”, “the color is red, yellow or blue, there is no black; the taste is sweet, sweet and sour, etc., it is not spicy; ~ 450 grams or so; ”is naturally recalled. In other words, when learning, it is considered that each feature and the change range of the feature value are stored by taking each feature value to be learned. For example, after learning various “apples”, features such as “color”, “shape”, “taste”, “weight”, “height”, “width”, etc. “Red, Blue, Yellow”, “Weight” feature change range should be “150 to 400 gram”, “Height” feature change range should be “6 to 12 centimeter” It is. When recognizing, if the value of the obtained feature value is within the change range of the learned feature value of “apple”, it should be recognized as “apple”. Of course, since humans have a function called association, they can recognize unlearned apples. This is because an unlearned apple is similar to a learned apple.
[0062]
Next, the recognition means 7 will be described. The recognition means 7 uses the means 6 for calculating the similarity between characters and character categories, and the similarity between the unknown character pattern stored in the memory M4 and all character category data stored in the recognition dictionary. And the character category most similar to the unknown character is output to the memory M5 as a recognition result.
[0063]
Next, an operation when the feature amount extracting unit 2 extracts a peripheral feature value of a character from the input character image for one character will be described with reference to a flowchart. 18 to 21 are flowcharts showing the operation procedure of the feature quantity extraction means 2. FIG. 18 is a flowchart of an operation procedure for extracting the feature quantity of each segment by scanning the segment from the left side of each segment of the 2A-1 segment obtained by dividing the character image horizontally.
[S1]: Move to an unprocessed section, set the initial value of the number of rows in the section as k = 1, and initialize a variable Fea representing the feature amount of the section.
[S2]: For each segment, the leftmost pixel in the top row of the segment is extracted.
[S3]: It is determined whether or not the extracted pixel is a background pixel. If it is a background image, the process goes to S4. If it is not a background pixel, go to S7.
[S4]: Fea = Fea + 1.
[S5]: It is determined whether or not the extracted pixel is the “width / P” th pixel of the row from the left side of the row, and if it is the “width / P” th pixel of the row, the process proceeds to S6. go. If not, go to S7.
[S6]: The right pixel of the extracted pixel is extracted. Go to S3.
[S7]: Move to the lower row and k = k + 1. Go to S8.
[S8]: It is determined whether or not all the rows in the section have been processed. If all the rows have been processed, the process goes to S9. If it remains, go to S2.
[S9]: The segment feature amount is obtained. Go to S10.
[S10]: It is determined whether or not all the horizontal 2A-1 sections have been processed. If all sections have been processed, the process ends. If there is a remaining segment, go to S1.
[0064]
FIG. 19 is a flowchart of an operation procedure for extracting the feature amount of each segment by scanning the segment from the right side of each segment of the 2A-1 segment obtained by dividing the character image horizontally.
[S11]: Move to an unprocessed section, set an initial value of the number of rows in the section as k = 1, and initialize a variable Fea representing the feature amount of the section.
[S12]: For each segment, the rightmost pixel in the top row of the segment is extracted.
[S13]: It is determined whether or not the extracted pixel is a background pixel. If it is a background image, the process goes to S14. If it is not a background pixel, go to S17.
[S14]: Fea = Fea + 1.
[S15]: It is determined whether or not the extracted pixel is the “width / P” th pixel of the row from the right side of the row, and if it is the “width / P” th pixel of the row, the process proceeds to S16. go. If not, go to S17.
[S16]: The pixel to the left of the extracted pixel is extracted. Go to S13.
[S17]: Move to the lower row and k = k + 1. Go to S18.
[S18]: It is determined whether or not all the lines in the section have been processed. If all the lines have been processed, the process goes to S19. If it remains, go to S12.
[S19]: The segment feature amount is obtained. Go to S20.
[S20]: It is determined whether or not all of the horizontal 2A-1 sections have been processed. If there is any remaining segment, go to S11.
[0065]
FIG. 20 is a flowchart of an operation procedure for extracting the feature amount of each segment by scanning the segment from the upper end of each segment of the 2A-1 segment obtained by vertically dividing the character image.
[S21]: Move to an unprocessed section, set an initial value of the number of columns in the section as k = 1, and initialize a variable Fea representing the feature amount of the section.
[S22]: For each section, the top pixel in the leftmost column of the section is extracted.
[S23]: It is determined whether or not the extracted pixel is a background pixel. If it is a background image, the process goes to S24. If it is not a background pixel, go to S27.
[S24]: Fea = Fea + 1.
[S25]: It is determined whether or not the extracted pixel is the “height / P” th pixel of the column from the upper end of the column, and if it is the “height / P” th pixel of the column, Go to S26. If not, go to S27.
[S26]: A pixel below the extracted pixel is extracted. Go to S23.
[S27]: Move to the right column and k = k + 1. Go to S28.
[S28]: It is determined whether or not all the columns of the section have been processed. If all the columns have been processed, the process goes to S29. If it remains, go to S22.
[S29]: The segment feature amount is obtained. Go to S30.
[S30]: It is determined whether or not all vertical 2A-1 sections have been processed. If all sections have been processed, the process ends. If there is a remaining segment, go to S21.
[0066]
FIG. 21 is a flowchart of an operation procedure for extracting the feature amount of each segment by scanning the segment from the lower end of each segment of each 2A-1 segment obtained by vertically dividing the character image.
[S31]: Move to an unprocessed section, set an initial value of the number of columns in the section as k = 1, and initialize a variable Fea representing the feature amount of the section.
[S32]: For each section, the bottom pixel in the leftmost column of the section is extracted.
[S33]: It is determined whether or not the extracted pixel is a background pixel. If it is a background image, the process goes to S34. If it is not a background pixel, go to S37.
[S34]: Fea = Fea + 1.
[S35]: It is determined whether or not the extracted pixel is the “height / P” th pixel of the column from the lower end of the column, and if it is the “height / P” th pixel of the column, Go to S36. If not, go to S37.
[S36]: A pixel above the extracted pixel is extracted. Go to S33.
[S37]: Move to the right column and k = k + 1. Go to S38.
[S38]: It is determined whether all the columns of the section have been processed. If all the columns have been processed, the process goes to S39. If it remains, go to S32.
[S39]: The segment feature amount is obtained. Go to S40.
[S40]: It is determined whether or not all vertical 2A-1 sections have been processed. If all sections have been processed, the process ends. If there is any remaining segment, go to S31.
[0067]
Next, an operation when the feature amount extraction unit 3 extracts a stroke feature amount of a character from the input character image for one character will be described with reference to a flowchart. 22 and 23 are flowcharts showing the operation procedure of the feature quantity extraction means 3. FIG. 22 is a flowchart of an operation procedure for extracting the feature quantity of each segment by scanning the segment from the left side of each segment of the 2A-1 segment obtained by dividing the character image horizontally.
[S41]: Move to an unprocessed section, set the initial value of the number of rows in the section as k = 1, and initialize the variable Fea representing the feature amount of the section.
[S42]: For each segment, the leftmost pixel in the top row of the segment and the pixel to the right of the pixel are extracted.
[S43]: It is determined whether or not the extracted pixel is the same as the pixel adjacent to the left side of the pixel. If not, go to S44.
[S44]: Fea = Fea + 1.
[S45]: If all pixels in the row have been processed, go to S47. If not, go to S46.
[S46]: The right pixel of the extracted pixel is extracted. Go to S43.
[S47]: Move to the lower row and k = k + 1. Go to S48.
[S48]: It is determined whether or not all lines in the section have been processed. If all the lines have been processed, the process goes to S49. If it remains, go to S42.
[S49]: The segment feature amount is obtained. Go to S50.
[S50]: It is determined whether or not all the horizontal 2A-1 sections have been processed. If all sections have been processed, the process ends. If there is any remaining segment, go to S41.
[0068]
FIG. 23 is a flowchart of an operation procedure for extracting the feature amount of each segment by scanning the segment from the upper end of each segment of the 2A-1 segment obtained by vertically dividing the character image.
[S51]: Move to an unprocessed section, set an initial value of the number of columns in the section as k = 1, and initialize a variable Fea representing the feature amount of the section.
[S52]: For each segment, the top pixel and the pixel below the pixel in the leftmost column of the segment are extracted.
[S53]: It is determined whether or not the extracted pixel is the same as the pixel above the pixel. If it is the same, the process goes to S56. If not, go to S54.
[S54]: Fea = Fea + 1.
[S55]: If all the pixels in the column have been processed, go to S57. If not, go to S56.
[S56]: The pixel below the extracted pixel is extracted. Go to S53.
[S57]: Move to the right column and k = k + 1. Go to S58.
[S58]: It is determined whether all the columns of the section have been processed. If all the columns have been processed, the process goes to S59. If it remains, go to S52.
[S59]: The segment feature amount is obtained. Go to S60.
[S60]: It is determined whether or not all vertical 2A-1 sections have been processed. If all sections have been processed, the process ends. If there is a remaining segment, go to S51.
[0069]
Next, an operation when the feature amount extraction unit 4 extracts a mesh feature amount of a character from the input character image for one character will be described with reference to a flowchart. FIG. 24 is a flowchart showing the operation procedure of the feature quantity extraction means 4.
[S61]: For each child region, the leftmost pixel in the top row of the child region is extracted.
[S62]: It is determined whether or not the extracted pixel is a background pixel. If it is a background pixel, the process goes to S65. If it is not a background pixel, the process goes to S63.
[S63]: The feature amount of the child region is increased to 1.
[S64]: It is determined whether all the pixels in the row have been processed. If all are processed, go to S66. If not, go to S65.
[S65]: The right pixel of the extracted pixel is extracted. Go to S62.
[S66]: Move to the lower line. Go to S67.
[S67]: It is determined whether or not all the rows in the child area have been processed. If all the rows have been processed, the process goes to S68. If it remains, go to S61.
[S68]: It is determined whether or not all B + 2C + D child areas have been processed. If all the child areas have been processed, the process ends. If there is any remaining child area, go to S69.
[S69]: Move to an unprocessed child area. Go to S61.
[0070]
Next, the operation of the means 9 for creating character category data from all learning samples belonging to the character category will be described using a flowchart. FIG. 25 is a flowchart showing the operation procedure of the character category data creation means 9.
[S70]: The number of characters is set to m, and the number of dimensions of the character feature vector and the category data vector is set to n. Character learning order i = 1 is set.
[S71]: Set the number of learning samples of the letter i as a (i) and set the number of dimensions as j = 1;
[S72]: The set S for storing the enumerated values of the j-th dimension of the learning sample feature quantity is emptied. The value Cat (i, j) of the jth dimension of the category data is set to 0, and the learning order k of the sample is set to 1.
[S73]: It is determined whether the set S includes the value Sam (i, k, j) of the j-th dimension of the k-th learning sample of the character i. If it is included, go to S75. If not, go to S74.
[S74]: Add Sam (i, k, j) to the set S.
[S75]: A sample to be learned next is set.
[S76]: If all the learning samples of the letter i have been learned, go to S77.
If samples to be learned remain, go to S73.
[S77]: One element e is extracted from the set S. Go to S78.
[S78]: “1” is substituted into the e + 1-th bit of Cat (i, j).
[S79]: The element e is deleted from the set S. Go to S80.
[S80]: It is determined whether or not the set S is empty. If it is empty, go to S81. If it is not empty, go to S77.
[S81]: The next dimension to be learned is set.
[S82]: When all dimensions are processed, go to S72. If not, go to S83.
[S83]: The character to be learned next is set.
[S84]: If all characters have been learned, the process ends. If the learning characters remain, go to S71.
[0071]
Next, the operation of the recognition unit 7 will be described using a flowchart. FIG. 26 is a flowchart showing the operation procedure of the recognition means 7.
[S90]: The number of character category data stored in the recognition dictionary is set to m, the character category data comparison order i = 1, and the maximum similarity initial value S _max = 0 and the variable Res = 0 for storing the recognition result is set.
[S91]: Using the similarity calculation means 6, the similarity S (X, Cat (i)) between the input unknown character X and the category data Cat (i) of the character i stored in the recognition dictionary is obtained. calculate.
[S92]: The similarity S (X, Cat (i)) is the maximum similarity S _max If larger, go to S93. If not, go to S94.
[S93]: Similarity S (X, Cat (i)) is set to maximum similarity S _max And the letter i is stored in Res as a result of recognition.
[S94]: Character category data to be compared next is set.
[S95]: If all character category data have been compared, the process ends. If character category data to be compared still remains, go to S91.
[0072]
Next, the recognition rate and recognition speed when a character is specifically recognized using the recognition apparatus of the present invention will be described.
[0073]
The character learning sample is a character image printed on paper and input to a computer by a scanner. The number of characters is 3455. An average of 700 learning samples were prepared for each character from 13 types of character fonts. A = 16, B = 64, C = 56, D = 49 are set, and the feature quantity extraction means of the present invention is used to obtain a 411-dimensional composite feature quantity (124-dimensional peripheral feature quantity + 62-dimensional stroke feature quantity + 225). Dimensional mesh features) were extracted.
[0074]
For each character, category data of the character is obtained from all learning samples of the character, and a recognition dictionary is created. For comparison with the conventional recognition method, the center value of all the learning samples belonging to the character category is obtained for each character, and the weight w for each dimension. _i Ask for. A recognition dictionary is created with the center value of the obtained character category as a representative of the character. For all characters, the covariance matrix, eigenvalues, and eigenvectors of the character category are obtained using all learning samples belonging to the character category.
[0075]
An experiment for recognizing a learned sample was performed using the recognition method of the present invention and the conventional recognition method. The conventional recognition method is a method of performing recognition using the city block distance, the Euclidean distance, the weighted Euclidean distance, and the projection distance (J = 3), respectively. The following table shows the results of the experiment.
[0076]
[Table 1]

[0077]
Among the conventional recognition methods, the highest recognition rate was 97.8%, and the average recognition time was 88 ms. The recognition rate of the recognition method of the present invention was 99.8%, and the average recognition time was 21 ms.
[0078]
Therefore, it is possible to recognize characters with higher recognition accuracy and at a higher speed with respect to the problem of recognizing unknown characters in the character recognition field.
[0079]
As is clear from the above description, the recognition apparatus of the present embodiment extracts the composite feature quantity of characters, obtains character category data using all learning samples belonging to the character category, and obtains the obtained character category. Data is stored in a recognition dictionary and a recognition dictionary is created. When recognizing characters, the method of calculating similarity between character patterns and character categories is used to compare the input unknown characters with all character categories stored in the created recognition dictionary. The character category is output as a result of recognition. As a result, the input unknown character can be easily recognized with high accuracy, high speed, and high speed.
[0080]
【The invention's effect】
As described above, according to the present invention, character category data composed of bit string data corresponding to the distribution of feature values is compared with character pattern data of similar character patterns to be recognized, so that character recognition is performed. Similar to the distribution of sample features, accuracy is not reduced by the distribution. Furthermore, more accurate recognition is possible by connecting bit string patterns of a plurality of types of feature values.
[Brief description of the drawings]
FIG. 1 is a block diagram showing an embodiment of a recognition apparatus of the present invention.
FIG. 2 is a diagram illustrating a composite feature amount of characters.
FIG. 3 is a diagram showing data in a recognition dictionary.
FIG. 4 is a diagram showing a configuration of a recognition apparatus according to the present invention.
FIG. 5 is a block diagram showing an embodiment of a feature quantity extraction unit 2;
FIG. 6 is a block diagram showing an embodiment of a feature quantity extraction unit 3;
FIG. 7 is a block diagram showing an embodiment of the feature quantity extraction means 4;
FIG. 8 is a diagram illustrating a state of a division divided by a horizontal division dividing unit.
FIG. 9 is a diagram illustrating a state of a division divided by a vertical division dividing unit.
FIG. 10 is a diagram illustrating a state in which a feature amount of a character “A” is extracted by the feature amount extraction unit 2;
FIG. 11 is a diagram illustrating a state in which a feature amount of the character “A” is extracted by the feature amount extraction unit 3;
FIG. 12 is a diagram illustrating a state of a child area divided by a child area dividing unit.
FIG. 13 is a block diagram showing an embodiment of character category data creating means 9;
FIG. 14 is a diagram illustrating a structure of character category data.
FIG. 15 is an explanatory diagram of a method for obtaining character category data.
FIG. 16 is a diagram for explaining the meaning of character category data;
FIG. 17 is a diagram illustrating a character recognition range according to the conventional technique and the technique of the present invention.
FIG. 18 is a flowchart showing an operation procedure of the feature quantity extraction unit 2;
FIG. 19 is a flowchart showing an operation procedure of the feature quantity extraction unit 2;
FIG. 20 is a flowchart showing an operation procedure of the feature quantity extraction unit 2;
FIG. 21 is a flowchart showing an operation procedure of the feature amount extraction unit 2;
FIG. 22 is a flowchart showing an operation procedure of the feature quantity extraction unit 3;
FIG. 23 is a flowchart showing an operation procedure of the feature quantity extraction unit 3;
FIG. 24 is a flowchart showing an operation procedure of the feature quantity extraction unit 4;
FIG. 25 is a flowchart showing an operation procedure of a character category data creation unit;
FIG. 26 is a flowchart showing an operation procedure of a recognition unit.
FIG. 27 is a diagram showing a distribution range and a recognition range of learning samples belonging to a character category.
FIG. 28 is a diagram illustrating a problem when recognition is performed using a conventional technique.
[Explanation of symbols]
1 character image input means, 2 to 4 feature quantity extraction means, 5 composite feature quantity calculation means, 6 character pattern and category similarity calculation means, 7 recognition means, 9 character category data creation means, X input Unknown character, Cat (i) Category data of character i stored in recognition dictionary.

Claims

In a character recognition device for character recognition, means for extracting a composite feature quantity of a character pattern, means for creating character category data for each character, means for creating a recognition dictionary using the created character category data, All the character category data stored in the recognition dictionary with the input unknown character pattern using the means for calculating the similarity. And the most similar character category is output as a recognition result,
The means for extracting the composite feature amount of the character pattern comprises means for extracting a peripheral feature amount of the character, means for extracting the stroke feature amount of the character, and means for extracting the mesh feature amount of the character,
The means for extracting the peripheral feature value of the character includes means for inputting a character image for one character, means for storing the character image, means for dividing the region of the character image, and character feature amount Means for limiting the scanning range, and means for taking the characteristics of the background image of the character image,
It said means for dividing an area of a character image, means for dividing the A segment an area of the character image in the horizontal, with respect to divided Category A to the horizontal, k (k = 1,2, ... , A -1) A means for dividing the lower half of the section and the upper half of the (k + 1) section into one section, horizontally dividing the A-1 section, means for dividing the character image region vertically into the A section, and dividing the length vertically Means for dividing the right half of the k (k = 1, 2,..., A-1) section and the left half of the (k + 1) section into one section and dividing it vertically into the A-1 section. Yes features and to Rubun shape recognition device to.

In a character recognition device for character recognition, means for extracting a composite feature quantity of a character pattern, means for creating character category data for each character, means for creating a recognition dictionary using the created character category data, All the character category data stored in the recognition dictionary with the input unknown character pattern using the means for calculating the similarity. And the most similar character category is output as a recognition result,
The means for extracting the composite feature amount of the character pattern comprises means for extracting a peripheral feature amount of the character, means for extracting the stroke feature amount of the character, and means for extracting the mesh feature amount of the character,
The means for extracting the stroke characteristic amount of the character includes means for inputting a character image for one character, means for storing the character image, region dividing means for the character image, and stroke characteristic amount of the character image. Means for extracting,
The character image region dividing means includes means for dividing the character image region horizontally into A sections, and k (k = 1, 2,..., A-1) with respect to the horizontally divided A sections. ) The lower half of the section and the upper half of the (k + 1) section are divided into one section, horizontally divided into A-1 sections, means for dividing the character image area vertically into the A sections, and the vertical division for a division, k (k = 1,2, ... , a-1) was classified as Category th right half and k + 1 division th left half a segment, to have a means for dividing the a-1 divided vertically features and to Rubun shape recognizer that.