JP3079707B2

JP3079707B2 - Character recognition method and device

Info

Publication number: JP3079707B2
Application number: JP03320062A
Authority: JP
Inventors: 健一郎木子
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1991-10-18
Filing date: 1991-10-18
Publication date: 2000-08-21
Anticipated expiration: 2015-08-21
Also published as: KR950007888B1; KR930008667A; JPH05108890A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、次に来る文字の候補を
予測し、まず、それらに対して認識を行うことによっ
て、正確で高速な文字認識を可能とする文字認識方法お
よび装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition method and apparatus for predicting next character candidates and first performing recognition on them to enable accurate and high-speed character recognition. It is.

【０００２】[0002]

【従来の技術】従来の文字認識方法では、文字のディジ
タル画像から個々の文字の文字画像を切り出し、いろい
ろな特徴量を文字画像から抽出して、あらかじめ保持し
ている文字ごとの特徴量辞書と比較することにより、文
字コードを決定している。文字切り出しの方法として
は、周辺分布からピッチを推定する方法や連結成分の分
離・統合を繰り返す方法などがある。また、特徴量の種
類としては、線素の方向性や周囲の形状、あるいはパタ
ーン全面の濃度分布などに着目する方法がある（例えば
電子情報通信学会技術研究報告ＩＥ８８−６「文字認
識技術の最近の動向」）。しかしながら、日本語のよう
な文字の種類が多い言語を対象とする場合には、対象文
字の特徴量との比較を、特徴量辞書にある全ての文字に
対して行うのは、処理量の面で非効率的であり問題とな
っていた。これを解決するために、粗い特徴量を用いて
大分類を行った後に、詳細な認識処理を行い文字候補を
抽出する方法が一般的に行われている（例えば「パター
ン認識」電子情報通信学会編、ｐｐ．６７）。しかしこ
の方法でも、それぞれの処理に多くの時間がかかるため
問題となっていた。一方、特徴量による文字認識のみで
は、必ずしも高い精度が期待できないため、文字認識の
結果に対して、種々の後処理を行うことにより精度を向
上させる方法が提案されている。例えば、リジェクト文
字など文字認識によって得られた複数の候補に対して、
単語照合を行う方法（電子情報通信学会技術研究報告
ＰＲＬ８２−７７「パターン認識における単語照合処理
の一検討」）がある。さらに、形態素解析などの文法的
な処理と、特徴量による認識処理の結果得られる情報と
を組み合せる方法も提案されている（電子通信学会論文
誌Ｖｏｌ．Ｊ６７−ＤＮｏ．１１ｐｐ．１３４８
「認識情報及び単語・文節情報を利用した文字認識後処
理」）。また、連接可能な文字の組合せをテーブルとし
て持っておき、複数の候補の中から連接可能なものを正
解とする方法もある。漢字同士の場合には連接する可能
性のある組合せは、全体の１％以下であることが知られ
ており、３ないし４文字の連接処理を行えば、単語照合
を行うのと同等の効果があることが明らかになっている
（電子通信学会論文誌Ｖｏｌ．Ｊ６８−ＤＮｏ．１
ｐｐ．６４「文字連接情報を用いた読み取り不能文字の
判定処理 −文字認識への応用−」）。このことを利用
すると、前後の文字から候補を限定する処理を効率よく
行うことができる。しかしながら、これらの後処理はい
ずれも、特徴量を用いた文字認識処理の後にさらに別の
処理として行っているため、処理時間がより多くかかる
ことになり、問題となっていた。2. Description of the Related Art In a conventional character recognition method, a character image of an individual character is cut out from a digital image of a character, and various characteristic amounts are extracted from the character image. The character code is determined by comparison. As a method of extracting characters, there are a method of estimating a pitch from a marginal distribution and a method of repeating separation and integration of connected components. As a type of the feature amount, there is a method of paying attention to a direction of a line element, a surrounding shape, a density distribution over the entire pattern, and the like (for example, IEICE Technical Report IE88-6, “Recent Trends in Character Recognition Technology”). Trends "). However, when a language with many types of characters such as Japanese is targeted, the comparison with the feature amount of the target character is performed for all the characters in the feature amount dictionary in terms of processing amount. Was inefficient and problematic. In order to solve this problem, a method of extracting a character candidate by performing a detailed recognition process after performing a large classification using a coarse feature amount is generally performed (for example, “Pattern Recognition”, IEICE). Ed., Pp. 67). However, this method also has a problem because each process takes a lot of time. On the other hand, since high accuracy cannot always be expected only by character recognition based on the feature amount, a method of improving accuracy by performing various post-processing on the result of character recognition has been proposed. For example, for a plurality of candidates obtained by character recognition such as rejected characters,
Method of Word Matching (IEICE Technical Report)
PRL 82-77 "A Study of Word Matching Process in Pattern Recognition"). Furthermore, a method of combining grammatical processing such as morphological analysis and information obtained as a result of recognition processing based on feature amounts has also been proposed (IEICE Transactions Vol. J67-D No. 11 pp. 1348).
"Character post-processing using recognition information and word / phrase information"). There is also a method in which a combination of connectable characters is held as a table, and a connectable character is selected from among a plurality of candidates as a correct answer. It is known that the combination that may be concatenated in the case of kanji is less than 1% of the whole, and if the concatenation processing of three or four characters is performed, the same effect as performing word matching is obtained. (Electronic Communication Society Transactions Vol. J68-D No. 1)
pp. 64 "Determination processing of unreadable character using character connection information-application to character recognition-"). By utilizing this, it is possible to efficiently perform a process of limiting candidates from characters before and after. However, since all of these post-processings are performed as separate processings after the character recognition processing using the feature amount, the processing time becomes longer, which is a problem.

【０００３】[0003]

【発明が解決しようとする課題】本発明は上述の従来技
術の問題点を解決することを目的とするものである。即
ち、本発明は、特徴量によって文字認識を行う文字認識
方法および装置において、特徴量の比較を行う対象を高
速にかつ精度良く絞ることにより精度を落すことなく高
速に文字認識を行うことを目的とするものである。SUMMARY OF THE INVENTION An object of the present invention is to solve the above-mentioned problems of the prior art. That is, an object of the present invention is to provide a character recognition method and apparatus for performing character recognition based on feature amounts, in which a target for which feature amounts are compared is narrowed down at high speed and with high accuracy, thereby performing high-speed character recognition without deteriorating accuracy. It is assumed that.

【０００４】[0004]

【課題を解決するための手段】本発明の文字認識方法
は、互いに隣接する認識対象の第1の文字画像および第
２の文字画像の認識において、前記第1の文字画像に対
する候補文字に対して連連接する文字を第２の文字画像
に対する予備的な候補文字として出しておくステップと
（S３０２）、その予備的な候補文字の特徴量と第２の
文字画像の特徴量との距離に基づき予備的な候補文字を
評価して、認識対象の第２の文字画像に対する候補文字
を判定するステップと（S３０３、S３０４）、その判定
した第２の文字画像に対する候補文字との関係に基づき
第1の文字画像に対する候補文字を再判定するステップ
と（S３０６）、その再判定により確定した第1の文字画
像の候補文字とそれに連接する前記第２の画像に対する
候補文字との組に基づいて第1の文字画像に対する認識
出力文字を決定するステップと（S３０８、S７０１、S
７０２）、その認識出力文字に連接する候補文字のみを
第２の文字画像に対する新たな候補文字として再判定す
るステップと（S３０９）を備えたことを特徴とする。
また、本発明の文字認識装置は、読み取り対象となる文
字画像を入力する文字画像入力手段（図１の１）と、あ
る第２の認識対象文字画像に隣接する第１の認識対象文
字画像に対する候補文字を前候補文字として保持してお
く前候補文字保持手段（４）と、連接可能な文字の組合
せを保持する連接テーブル、連接テーブルに基づいて、
前記第1の認識対象文字画像に対する前記前候補文字に
連接する文字を次候補文字として予測する次候補文字予
測手段（５−１，５−２）と、前記次候補文字予測手段
によって予測された文字の次候補文字としての妥当性を
判定する候補文字判定手段（６）と、前記前候補文字と
それに連接する次候補文字の組において、前記前候補文
字の特徴量と前記第1の認識対象文字画像の特徴量との
距離、および前記次候補文字の特徴量と前記第２の認識
対象文字の特徴量との距離の合計を求め、その合計が最
も小さい前記組に対応する前候補文字を前記前にある認
識対象文字の認識出力として出力する認識処理手段
（３）と、全体の処理の制御を行う制御手段（７）とを
備えている。According to the character recognition method of the present invention, a first character image and a second character image to be recognized adjacent to each other are recognized.
In recognition of the second character image, the first character image
The second character image is a character connected to the candidate character
Steps to put out as preliminary candidate characters for
(S302), the feature amount of the preliminary candidate character and the second
Preliminary candidate characters are determined based on the distance from the character image features.
Evaluate and evaluate candidate characters for the second character image to be recognized
(S303, S304) and its determination
Based on the relationship between the second character image and the candidate character
Re-determining candidate characters for the first character image
(S306), the first character image determined by the re-determination
Image candidate character and the second image adjacent thereto
Recognition of first character image based on combination with candidate characters
Determining output characters (S308, S701, S
702), only candidate characters connected to the recognized output character are
Redetermined as a new candidate character for the second character image
And step (S309) .
Further, the character recognition device of the present invention, a character image input means for inputting a character image to be read (1 in FIG. 1), Oh
That a second recognition object character first recognized sentence <br/> character before to hold a candidate character as the previous candidate character for the image candidate character holding means adjacent to the image (4), articulated possible characters Union
A connection table that holds the set, based on the connection table,
Characters that connected to the front candidate character for the first recognition target character image and next candidate characters predicting means for predicting a next candidate characters (5-1, 5-2), was predicted by the next candidate character prediction means A candidate character determining means (6) for determining the validity of the character as a next candidate character;
In the set of the next candidate character connected to it, the previous candidate sentence
Between the characteristic amount of the character and the characteristic amount of the first recognition target character image.
A distance, a feature amount of the next candidate character, and the second recognition
Calculate the sum of the distance to the feature of the target character and calculate the sum
The previous candidate character corresponding to the set
Recognition processing means for outputting as recognition output of recognition target characters
(3) and control means (7) for controlling the entire processing.

【０００５】[0005]

【作用】本発明は、認識対象文字の前（または後）にあ
る文字から対象文字の候補を予測することによって、候
補文字を用いないで認識を行わなければならない文字を
大幅に減らすようにしている。その予測には、非ひらが
な文字同士の連接する組合せが比較的少ないという性質
を利用する。即ち、各文字に対して次に来る（または前
に来る）可能性のある文字の情報を予め用意しておくこ
とにより、次に来る文字の候補を非常に高速に予測する
とともに、候補の数を大分類処理に比べて減らすことが
可能となる。According to the present invention, by predicting a candidate for a target character from a character before (or after) a character to be recognized, the number of characters that must be recognized without using a candidate character is greatly reduced. I have. For the prediction, the property that the number of consecutive combinations of non-Hiragana characters is relatively small is used. That is, by preparing in advance information of a character that may come next (or come before) for each character, a candidate for the next character can be predicted very quickly, and the number of candidates can be reduced. Can be reduced as compared with the large classification processing.

【０００６】前候補文字保持手段は、ある隣接する第１
および第２の認識対象文字画像における第１の認識対象
文字画像に対する候補文字を前候補文字として保持す
る。連接テーブルには各文字に対して次にくる可能性の
ある文字（連接文字）の組み合せの情報を保持してお
く。次候補文字予測手段は、その連接テーブルに基づい
て第２の認識対象文字画像に対する予備的な候補文字
（次候補文字）を予測する。例えば「連接する」という
文字画像列の各文字を認識する場合を例にとる。図９に
例示するように、第１の認識対象文字画像「連」の認識
結果の候補として「連」「運」「速」が前候補文字保持
手段に保持されているものとする。次候補文字予測手段
は、図１０に例示するように、前候補文字「連」「運」
「速」に連接する文字を、前記連接テーブルを用いて、
第２の認識対象文字画像の予備的な次候補文字として求
める。例えば前候補文字「連」に連接する次候補文字
「続」「接」、前候補文字「運」に連接する次候補文字
「用」、前候補文字「速」に連接する次候補文字「度」
等が得られる。候補文字判定手段は、前候補文字保持手
段に保持された前候補文字の妥当性を判定する。これに
より前候補文字の数を絞り込むことができる。その前候
補文字の妥当性の判定は、例えば、第２の認識対象画像
文字とそれに対する前記予備的な次候補文字との特徴量
の距離の大きさに基づいて行う。認識処理手段は、妥当
と判定された前候補文字と次候補文字を組み合せにおけ
る、前記前候補文字の特徴量と前記第１の認識対象文字
画像の特徴量との距離、および前記次候補文字の特徴量
と前記第２の認識対象文字の特徴量との距離の合計を求
め、その合計が最も小さい組に対応する前候補文字を第
１の認識対象文字の認識出力として出力する。[0006] The preceding candidate character holding means is provided with a first adjacent character.
And a candidate character for the first recognition target character image in the second recognition target character image is held as a previous candidate character. The concatenation table holds information on combinations of characters (concatenated characters) that may come next to each character. The next candidate character prediction unit predicts a preliminary candidate character (next candidate character) for the second recognition target character image based on the connection table. For example, a case in which each character of a character image string “join” is recognized is taken as an example. As illustrated in FIG. 9, it is assumed that “run”, “luck”, and “speed” are stored in the previous candidate character storage unit as candidates for the recognition result of the first recognition target character image “run”. The next candidate character predicting means, as exemplified in FIG.
By using the connection table, characters connected to "speed"
It is obtained as a preliminary next candidate character of the second recognition target character image. For example, the next candidate character “continuation” “connected” to the previous candidate character “sequence”, the next candidate character “use” connected to the previous candidate character “luck”, and the next candidate character “degree” connected to the previous candidate character “speed” "
Etc. are obtained. The candidate character determination unit determines the validity of the previous candidate character held by the previous candidate character holding unit. This makes it possible to narrow down the number of preceding candidate characters. The determination of the validity of the preceding candidate character is performed, for example, based on the magnitude of the feature distance between the second recognition target image character and the preliminary next candidate character. The recognition processing means includes, in a combination of the previous candidate character and the next candidate character determined to be valid, a distance between the feature amount of the previous candidate character and the feature amount of the first recognition target character image, and the distance of the next candidate character. The sum of the distance between the feature amount and the feature amount of the second recognition target character is obtained, and the previous candidate character corresponding to the set having the smallest total is output as the first recognition target character recognition output.

【０００７】本発明は、後処理に用いる情報である連接
テーブルなどの複数文字の繋がりの情報を使って、次に
来る文字を予測するため、後処理を行うことなく同程度
の認識精度が確保でき、後処理時間をも節約することが
できる。このように、本発明は、認識対象となる文字の
候補を、当該文字以前の文字（または以後の文字）から
予測することにより、精度を落とすことなく高速に文字
認識を行うことができるものである。According to the present invention, the next character is predicted using the connection information of a plurality of characters such as a concatenation table which is information used for post-processing, so that the same recognition accuracy is secured without performing post-processing. And post-processing time can be saved. As described above, according to the present invention, character recognition can be performed at high speed without lowering the accuracy by predicting a character candidate to be recognized from a character before the character (or a character after the character). is there.

【Example】

【０００８】まず実施例の構成について図１により説明
する。この実施例は以下の部分からなる。（１）読み取り対象となる文字画像を入力する文字画
像入力部１。この部分は、一般的なイメージスキャナや
ファクシミリ等の画像入力装置によって実現する。（２）複数の文字からなる画像から個々の文字の文字
画像を切り出す、文字切り出し部２。（３）文字画像の特徴量のみから対応する文字コード
を候補として出力する機能と、特定の文字コード特徴量
と認識対象の文字画像の特徴量との距離（以下、単純に
距離または距離値と呼ぶ）を算出する機能とを持つ認識
処理部３。（４）認識対象文字より前にある文字の候補を保持し
ておく前候補文字保持部４。（５）次に来る文字を予測する手段として、文字間連
接情報を用いる次候補文字予測部５−１と、文字連接テ
ーブル５−２。（６）次候補文字予測部５−１によって得られた次候
補文字に対して認識処理部３によって認識を行った結果
の妥当性の判定と、その結果を元に前候補・次候補の再
判定を行う候補文字判定部６。（７）全体の処理とデータの制御を行う制御部７。First, the configuration of the embodiment will be described with reference to FIG. This embodiment comprises the following parts. (1) A character image input unit 1 for inputting a character image to be read. This part is realized by an image input device such as a general image scanner or a facsimile. (2) A character cutout unit 2 that cuts out a character image of each character from an image composed of a plurality of characters. (3) A function of outputting a corresponding character code as a candidate only from a feature amount of a character image, and a distance between a specific character code feature amount and a feature amount of a character image to be recognized (hereinafter simply referred to as distance or distance value and A recognition processing unit 3 having a function of calculating (4) A pre-candidate character holding unit 4 for holding a candidate for a character preceding the recognition target character. (5) As means for predicting the next character, a next candidate character prediction unit 5-1 using inter-character connection information and a character connection table 5-2. (6) Judgment of the validity of the result of recognition by the recognition processing unit 3 on the next candidate character obtained by the next candidate character prediction unit 5-1 and re-establishment of the previous candidate / next candidate based on the result A candidate character determination unit 6 for performing determination; (7) A control unit 7 that performs overall processing and data control.

【０００９】上記の構成において、認識処理部３は、図
２に示すように、文字切り出し部２で切り出された個々
の文字画像の特徴量を算出する特徴量算出部３１と、特
徴量間の距離を計算する距離計算部３２と、特徴量によ
る大分類と詳細な特徴量によるマッチングを行い文字コ
ードを決定する特徴量−文字コードマッチクング部３３
とを備えている。ここでいう距離とはＮ個の特徴量を
軸とするＮ時限空間における距離のことである。また、
特徴量−文字コードマッチング部３３は、個々の文字画
像の特徴量と文字コードを対応させたテーブル形式の特
徴量辞書（認識辞書）３３２と、粗い特徴量による大分
類と特徴量辞書３３２のアドレスとを対応させたテーブ
ル形式の大分類テーブル３３１と、文字コードと特徴量
辞書のアドレスとを対応させるためのハッシュテーブル
３３３とを有する。なお、この認識処理部３は、文字画
像入力部１によって入力された文字画像が、文字切り出
し部２で切り出されて、１文字ずつの文字画像になった
ものを入力として扱う。In the above configuration, as shown in FIG. 2, the recognition processing section 3 includes a feature quantity calculating section 31 for calculating the feature quantity of each character image cut out by the character cutout section 2, A distance calculation unit 32 for calculating a distance, and a feature amount-character code match king unit 33 for determining a character code by performing large classification based on the feature amount and matching based on the detailed feature amount.
And The distance here is a distance in an N time space having N feature amounts as axes. Also,
The feature-character code matching unit 33 includes a table-based feature dictionary (recognition dictionary) 332 in which the feature of each character image is associated with a character code, and a large classification based on coarse feature and an address of the feature dictionary 332. And a hash table 333 for associating a character code with an address of a feature dictionary. Note that the recognition processing unit 3 treats a character image input by the character image input unit 1 that is cut out by the character cutout unit 2 and becomes a character image for each character as input.

【００１０】認識処理部３の機能の１つは、入力された
文字画像に対して、特徴量による大分類と詳細な特徴量
によるマッチングを行い文字コードを決定する一般的な
文字認識機能である。もう１つの機能は、特定の指定さ
れた文字コードの特徴量辞書を用いて文字画像の認識を
行い、文字画像と指定された文字との距離を出力する機
能である。この２つの機能を実現させるために、特徴量
辞書３３２をどちらの側からでも引けるように、両引き
が可能な構成として前述のように大分類テーブル３３１
とハッシュテーブル３３３を用いている。特徴量から文
字コードを引く場合には、大分類テーブル１２１を用
い、文字コードからその特徴量を得る場合には、ハッシ
ュテーブル１２３を用いることにより、高速な検索が可
能になる。One of the functions of the recognition processing unit 3 is a general character recognition function for determining a character code by performing a large classification based on feature amounts and matching based on detailed feature amounts with respect to an input character image. . Another function is a function of recognizing a character image using a feature dictionary of a specific designated character code and outputting a distance between the character image and the designated character. In order to realize these two functions, as described above, the large classification table 331 has a configuration capable of double lookup so that the feature dictionary 332 can be looked up from either side.
And a hash table 333. When the character code is subtracted from the feature value, the large classification table 121 is used, and when the feature value is obtained from the character code, the hash table 123 is used, so that a high-speed search can be performed.

【００１１】次に、全体の処理の流れについて述べる。
処理全体の流れを図３に示す。（１）次候補文字予測前候補文字があるか否かを判定する（ステップＳ３０
１）。（１−１）前候補文字保持部に候補文字が１つも無い
場合ステップＳ３０１の判定の結果、前候補文字がなかった
場合は、特徴量のみを用いた文字認識を行って候補を出
し（ステップＳ３１０）、これらと認識対象文字画像と
の距離値を前候補文字保持部４に格納する（ステップＳ
３１１）。Next, the overall processing flow will be described.
FIG. 3 shows the overall processing flow. (1) Predicting next candidate character It is determined whether or not there is a previous candidate character (step S30).
1). (1-1) When there is no candidate character in the previous candidate character holding unit When there is no previous candidate character as a result of the determination in step S301, character recognition using only the feature amount is performed to generate a candidate (step S301). S310), and stores the distance value between these and the recognition target character image in the preceding candidate character storage unit 4 (step S310).
311).

【００１２】（１−２）前候補文字保持部に候補文字が
１つ以上ある場合ステップＳ３０１の判定の結果、前候補文字があった場
合は、それぞれの前候補文字について次候補文字予測を
行う（ステップＳ３０２）。この実施例の場合には、次
候補文字の予測は文字間の連接関係を用いる。図４はそ
の連接関係を用いる次候補文字予測の処理手順を示すフ
ローチャートである。ある文字が観測された時に、その
文字に連接する可能性のある文字を図７のような連接文
字テーブルとして持っておき、前候補文字保持部４に保
持されているおのおのの文字に対して、文字連接テーブ
ルを用いて、連接する可能性のある文字を全て次候補文
字とする。(1-2) When there is one or more candidate characters in the preceding candidate character holding unit If the result of the determination in step S301 is that there are preceding candidate characters, the next candidate character is predicted for each preceding candidate character. (Step S302). In the case of this embodiment, the prediction of the next candidate character uses the connection relation between the characters. FIG. 4 is a flowchart showing a processing procedure of the next candidate character prediction using the connection relation. When a certain character is observed, a character that may be connected to the character is stored as a connected character table as shown in FIG. 7, and for each character held in the previous candidate character holding unit 4, Using the character concatenation table, all characters that may be concatenated are set as next candidate characters.

【００１３】即ち、図４において、まず、一つの前候補
文字を取り出し（ステップＳ４０１）、その前候補文字
をキーとして連接テーブル（図９）を検索する（ステッ
プＳ４０２）。検索の結果得られた連接する全ての文字
を次候補として出力する（ステップＳ４０３）。前候補
を全て見終ったかどうかを判定し（ステップＳ４０
４）、見終っていないときは次の前候補文字を取り出し
（ステップＳ４０５）、ステップＳ４０２〜４０４の処
理を繰り返す。前候補文字のすべてについて処理が終っ
たときは図４の次候補文字予測処理を終了する。That is, in FIG. 4, first, one preceding candidate character is extracted (step S401), and the concatenation table (FIG. 9) is searched using the preceding candidate character as a key (step S402). All connected characters obtained as a result of the search are output as next candidates (step S403). It is determined whether all the previous candidates have been seen (step S40).
4) If not, the next preceding candidate character is extracted (step S405), and the processing of steps S402 to S404 is repeated. When the processing has been completed for all of the preceding candidate characters, the next candidate character prediction processing in FIG. 4 ends.

【００１４】（２）特徴量による認識処理認識処理部３の、特定の文字コードの特徴量と文字画像
の特徴量との距離を算出する特徴量間距離計算部３２の
機能を用いて、次候補文字予測により予測されたそれぞ
れの次候補文字と、認識対象文字との距離を算出する
（ステップＳ３０３）。こうして得られた次候補文字と
それぞれの次候補文字の認識対象との距離値が、候補文
字判定部６に送られる。(2) Recognition Processing Using Feature Amounts The recognition processing unit 3 uses the function of a feature amount distance calculation unit 32 for calculating the distance between the feature amount of a specific character code and the feature amount of a character image. The distance between each next candidate character predicted by the candidate character prediction and the recognition target character is calculated (step S303). The distance value between the next candidate character thus obtained and the recognition target of each next candidate character is sent to the candidate character determination unit 6.

【００１５】（３）次候補文字判定候補文字判定部６では、文字画像の特徴量と認識辞書の
特徴量との距離が、ある閾値以下のもののみを次候補文
字とし、それ以外は次候補文字から削除する（ステップ
Ｓ３０４）。図５はそのステップＳ３０４の処理の詳細
を示すフローチャートである。図５において、まず、一
つの次候補文字を取り出す（ステップＳ５０１）。そし
て、その距離が閾値以下であるかを判定し（ステップＳ
５０２）、閾値未満であれば次候補から削除する（ステ
ップＳ５０３）。次候補文字を全て見終ったかどうかを
判定し（ステップＳ５０４）、見終っていないときは次
の次候補文字を取り出し（ステップＳ５０５）、ステッ
プＳ５０２〜５０４の処理を繰り返す。次候補文字のす
べてについて処理が終ったときは図５の候補文字判定処
理を終了する。(3) Next candidate character determination In the candidate character determination unit 6, only those whose distance between the characteristic amount of the character image and the characteristic amount of the recognition dictionary is equal to or less than a certain threshold value are set as the next candidate character, and the others are set as the next candidate character. It is deleted from the character (step S304). FIG. 5 is a flowchart showing details of the processing in step S304. In FIG. 5, first, one next candidate character is extracted (step S501). Then, it is determined whether the distance is equal to or less than the threshold (step S).
502), if it is less than the threshold, it is deleted from the next candidate (step S503). It is determined whether all the next candidate characters have been seen (step S504). If not, the next next candidate character is extracted (step S505), and the processing of steps S502 to S504 is repeated. When the processing has been completed for all of the next candidate characters, the candidate character determination processing of FIG. 5 ends.

【００１６】（４）候補文字の確定次に、次候補文字判定の結果、次候補文字が残ったか否
かを判定する（ステップＳ３０５）。（４−１）次候補文字判定の結果、次候補文字が１つ
も残らなかった場合予測による文字認識は不成功だったことになり、今まで
前候補文字保持部４にあった文字の中で、最も距離値が
小さいものを前候補文字の認識結果として出力する（ス
テップＳ３０７）。そして、認識対象文字に対しては、
一般的な特徴量のみによる文字認識を行い（ステップＳ
３１０）、得られた候補と距離値を前候補文字保持部４
に格納する（ステップＳ３１１）。(4) Determination of Candidate Character Next, it is determined whether or not the next candidate character remains as a result of the next candidate character determination (step S305). (4-1) In the case where no next candidate character remains as a result of the next candidate character determination, character recognition by prediction is unsuccessful. Then, the one with the smallest distance value is output as the recognition result of the preceding candidate character (step S307). And for the characters to be recognized,
Character recognition using only general features is performed (step S
310), the obtained candidate and the distance value are stored in the previous candidate character holding unit 4
(Step S311).

【００１７】（４−２）次候補文字判定の結果、次候
補文字が１つ以上残った場合まず、残った次候補文字から前候補文字再判定を行う
（ステップＳ３０６）。具体的には、ステップＳ３０６
は図６のフローチャートに示す手順からなっている。即
ち、次候補文字判定によって残った次候補文字群に接続
する文字が無い前候補文字を、前候補から削除する。即
ち、図６において、まず、一つの前候補文字を取り出し
（ステップＳ６０１）、その前候補文字に連接する文字
が次候補文字にあるか否かを調べる（ステップＳ６０
２）。次候補文字になければ、その前候補文字を前候補
文字のリストから削除する（ステップＳ６０３）。次候
補文字にあればその削除は行わない。次に、前候補を全
て見終ったかどうかを判定し（ステップＳ６０４）、見
終っていないときは次の前候補文字を取り出し（ステッ
プＳ６０５）、ステップＳ６０２〜６０４の処理を繰り
返す。前候補文字のすべてについて処理が終ったときは
図６の前候補再判定処理を終了する。(4-2) When one or more next candidate characters remain as a result of the next candidate character determination First, the previous candidate character is re-determined from the remaining next candidate characters (step S306). Specifically, step S306
Consists of the procedure shown in the flowchart of FIG. That is, the previous candidate character having no character connected to the next candidate character group remaining by the next candidate character determination is deleted from the previous candidate. That is, in FIG. 6, first, one preceding candidate character is extracted (step S601), and it is checked whether or not a character connected to the preceding candidate character is in the next candidate character (step S60).
2). If not, the previous candidate character is deleted from the list of previous candidate characters (step S603). If it is in the next candidate character, it is not deleted. Next, it is determined whether all the previous candidates have been viewed (step S604). If not, the next previous candidate character is extracted (step S605), and the processes of steps S602 to S604 are repeated. When the processing has been completed for all of the preceding candidate characters, the preceding candidate re-determination processing of FIG. 6 ends.

【００１８】次に、前候補の確定と出力を行う（ステッ
プＳ３０８）。図７はその前候補確定、出力の手順の詳
細を示すフローチャートである。即ち、残った前候補文
字とそれに連接する次候補文字との組について、それぞ
れの組ごとに、前候補文字の持っている距離値と次候補
文字の持っている距離値を合計した値を算出する（ステ
ップＳ７０１）。そして、この値が最も小さな組の前候
補文字を、前候補文字の認識結果として出力する（ステ
ップＳ７０２）。さらに、出力された前候補文字に連接
する次候補文字のみを新たな前候補文字として（ステッ
プＳ３０９）、前候補文字保持部に格納する（ステップ
Ｓ３１１）。この結果、出力された前候補文字と連接し
ない次候補文字は、候補から削除されることになる。Next, the previous candidate is determined and output (step S308). FIG. 7 is a flowchart showing details of the procedure for determining and outputting the previous candidate. That is, for each pair of the remaining previous candidate character and the next candidate character connected to the remaining candidate character, the sum of the distance value of the previous candidate character and the distance value of the next candidate character is calculated for each pair. (Step S701). Then, the set of the preceding candidate characters having the smallest value is output as the recognition result of the preceding candidate character (step S702). Further, only the next candidate character connected to the output preceding candidate character is stored as a new preceding candidate character (step S309) in the preceding candidate character holding unit (step S311). As a result, the next candidate character that is not connected to the output preceding candidate character is deleted from the candidates.

【００１９】ステップＳ３０９は図８のフローチャート
に示す手順からなっている。即ち、図８において、ま
ず、一つの次候補文字を取り出し（ステップＳ８０
１）、出力した次候補文字と連接するかを調べる（ステ
ップＳ８０２）。連接していなければ、その次候補文字
を次候補文字のリストから削除する（ステップＳ８０
３）。連接していればその削除は行わない。次に、次候
補を全て見終ったかどうかを判定し（ステップＳ８０
４）、見終っていないときは次の次候補文字を取り出し
（ステップＳ８０５）、ステップＳ８０２〜８０４の処
理を繰り返す。次候補文字のすべてについて処理が終っ
たときは図８の前候補再判定処理を終了する。Step S309 comprises the procedure shown in the flowchart of FIG. That is, in FIG. 8, first, one next candidate character is extracted (step S80).
1) It is checked whether the character is connected to the output next candidate character (step S802). If not, the next candidate character is deleted from the next candidate character list (step S80).
3). If they are connected, they are not deleted. Next, it is determined whether all the next candidates have been viewed (step S80).
4) If not, the next next candidate character is extracted (step S805), and the processing of steps S802 to 804 is repeated. When the processing has been completed for all of the next candidate characters, the previous candidate re-determination processing of FIG. 8 ends.

【００２０】（５）最終文字処理認識対象文字の次に認識すべき文字画像が存在しない場
合、その文字を最終文字と呼ぶ。図３のステップＳ３１
２においてその最終文字であるか否かを判定する。（５−１）認識対象文字が最終文字の場合残った候補の中で、最も距離値が小さいものを認識結果
として出力する。（５−２）認識対象文字が最終文字でない場合（１）の処理から繰り返す。(5) Final character processing When there is no character image to be recognized next to the character to be recognized, the character is called a final character. Step S31 in FIG.
In step 2, it is determined whether the character is the last character. (5-1) When the recognition target character is the last character Among the remaining candidates, the one with the smallest distance value is output as the recognition result. (5-2) When the Character to be Recognized is Not the Last Character Repeat from the processing of (1).

【００２１】次に具体的な例を用いて説明する。図１０
はこの具体例の動作を説明するための図である。「連接
する」という文字列の画像を認識することを考える。こ
こですでに、「連」という文字に関しては「連、運、
速」という候補とそれぞれの文字の、認識対象文字「連
」に対する距離値が分かっており、前候補文字保持部
にこれらの文字と距離値のテーブル（Ａ）が格納されて
いるとする。この状態から処理を始めるとする。Next, a specific example will be described. FIG.
Is a diagram for explaining the operation of this specific example. Consider recognizing an image of a character string “join”. Here, as for the word "ren", "ren, luck,
It is assumed that the distance value between the candidate “speed” and each character with respect to the recognition target character “sequence” is known, and the table (A) of these characters and the distance values is stored in the previous candidate character holding unit. It is assumed that processing starts from this state.

【００２２】まず最初に、前候補文字保持部４に候補文
字があるので、次候補文字予測を行う。それぞれの次候
補文字に対しては、図９のような連接テーブルが用意さ
れており、これらを参照して次候補文字を予測する。こ
の実施例の場合には、連接テーブルにはひらがなを含ん
でいない。これは、ひらがなと非ひらがな文字とは非常
に連接する組合せが多いため、ひらがなを含めてしまう
と、予測される候補の中に多くのひらがなが入ってしま
うからである。予測の結果として、これら連接テーブル
にあげられている文字が全て次候補文字となる。First, since there is a candidate character in the previous candidate character holding unit 4, the next candidate character is predicted. A connection table as shown in FIG. 9 is prepared for each next candidate character, and the next candidate character is predicted with reference to these tables. In the case of this embodiment, the connection table does not include hiragana. This is because there are many combinations of hiragana and non-hiragana characters that are extremely connected, and if hiragana is included, many hiragana are included in the predicted candidates. As a result of the prediction, all the characters listed in these connection tables are the next candidate characters.

【００２３】次に、こうして得られた全ての次候補文字
に対して、認識処理部３で特徴量による認識を行い、そ
れぞれの候補文字と認識対象文字「接」との距離を求め
る。この結果、次候補文字と、それが連接する前候補文
字、及びそれぞれの文字の認識対象文字「接」に対する
距離を持ったテーブル（Ｂ）ができあがる。このテーブ
ル（Ｂ）が候補文字判定部６に送られる。候補文字判定
部６では、それぞれの距離値を判定し、ある閾値以下で
あるもののみを残してそれ以外の文字を削除する。この
結果、テーブル（Ｃ）に示すように「接、携、搬」だけ
が残る。Next, the recognition processing unit 3 performs recognition of all the next candidate characters obtained in this manner by the feature amount, and obtains the distance between each candidate character and the recognition target character "tangent". As a result, a table (B) having a distance to the next candidate character, the preceding candidate character connected to the next candidate character, and the recognition target character “tangent” of each character is created. This table (B) is sent to the candidate character determination section 6. The candidate character determining unit 6 determines each distance value, and deletes only the characters that are equal to or less than a certain threshold value and deletes other characters. As a result, only “contact, carrying, carrying” remains as shown in the table (C).

【００２４】この結果を元に、前候補文字再判定処理が
行われる。即ち、この場合、次候補文字に残ったものの
中に、「速」という字に連接していたものが存在しない
ため、前候補文字からこの字を削除する。この結果
「連、運」の２文字のみが前候補文字として残される。
そして、この２文字に接続する文字の組、即ち「連接」
「連携」「運搬」のそれぞれについて各文字の距離値を
合算し、文字組の距離とする。その結果、連接文字の組
と、それぞれの組の合算した距離値を持ったテーブル
（Ｅ）ができる。この中から合算した距離値が最も小さ
いものを選び、その組の前候補にあたる部分の文字を、
「連」に対する認識結果として出力する。この場合は
「連接」の距離値が最も小さいので、この組にある文字
「連」が選択され出力される。そして、前候補文字は全
て前候補文字保持部４から削除される。次に、出力され
た文字「連」に連接していた文字「接、携」の２文字が
前候補文字保持部４に格納される。この時、出力されな
かった文字「運」に連接していた「搬」は候補から除外
されることになる。この結果、前候補文字保持部４には
テーブル（Ｆ）のように「接、携」の２文字が格納され
ることになる。On the basis of the result, the preceding candidate character re-determination processing is performed. That is, in this case, since there is no character connected to the character "speed" among the characters remaining in the next candidate character, this character is deleted from the previous candidate character. As a result, only the two characters “ream and luck” are left as previous candidate characters.
Then, a set of characters connected to these two characters, that is, “concatenation”
The distance value of each character is summed up for each of "cooperation" and "transportation", and is set as the distance of the character set. As a result, a table (E) having a set of connected characters and a total distance value of each set is created. From these, select the one with the smallest total distance value, and replace the character of the part that is the previous candidate of that pair with
It is output as a recognition result for "ren". In this case, since the distance value of “joint” is the smallest, the character “joint” in this group is selected and output. Then, all the previous candidate characters are deleted from the previous candidate character holding unit 4. Next, the two characters of the character "", which are connected to the output character "", are stored in the previous candidate character storage unit 4. At this time, the character "luck" connected to the character "luck" which has not been output is excluded from the candidates. As a result, the preceding candidate character holding unit 4 stores two characters of "contact, carrying" as shown in the table (F).

【００２５】次に、同様にして、これら２文字に対して
次候補文字予測を行うと、テーブル（Ｇ）のように連接
テーブルの文字が次候補文字として得られる。この場
合、連接テーブルにはひらがなを含んでいないため正解
である「す」は候補の中に入ってこない。これらの候補
に対して特徴量による認識を行い、候補文字判定をする
と、「す」との距離で閾値を満たす文字が無いため、全
ての文字が次候補文字から削除され、次候補文字予測は
失敗する。そこで、一般的な文字画像の特徴量を用いた
文字認識を行い、その結果テーブル（Ｉ）のような候補
が「す」に対して得られる。次候補文字判定が失敗した
ので、前候補文字保持部４にある文字の決定に次候補文
字が使えない。そこで、単純に距離値の小さい文字であ
る「接」を出力し、前候補文字保持部４を空にする。そ
して、特徴量を用いた文字認識により得られた「す、
寸、才」が前候補文字保持部４に格納される。さらに同
様にして「す、寸、才」を用いて次候補文字予測を行
う。この場合、ひらがなは連接テーブルから除外してあ
るので、「す」に対する次候補文字は予測できない。そ
こで、「寸、才」に対してのみ次候補文字予測を行うこ
とになるが、この結果得られた候補に対して上記と同様
の処理を行うと、やはり、全ての次候補文字が条件を満
たさないので、次候補文字予測は失敗する。そこで、特
徴量を用いた文字認識がおこなわれる。Next, similarly, when the next candidate character is predicted for these two characters, the characters in the concatenation table are obtained as the next candidate characters as shown in Table (G). In this case, since the connection table does not include hiragana, the correct answer "su" does not enter the candidates. Recognition of these candidates by the feature amount and determination of candidate characters indicate that there is no character that satisfies the threshold at a distance from “su”. Therefore, all characters are deleted from the next candidate character, and the next candidate character prediction is performed. Fail. Therefore, character recognition is performed using the characteristic amount of a general character image, and as a result, a candidate such as the table (I) is obtained for “su”. Since the determination of the next candidate character has failed, the next candidate character cannot be used to determine the character in the previous candidate character holding unit 4. Therefore, the character “contact”, which is a character having a small distance value, is simply output, and the preceding candidate character holding unit 4 is emptied. Then, "su," obtained by character recognition using the feature amount
“Seng, gen” is stored in the previous candidate character holding unit 4. Further, in the same manner, the next candidate character is predicted using “S, Dimension, Talent”. In this case, since the hiragana is excluded from the connection table, the next candidate character for "su" cannot be predicted. Therefore, the next candidate character prediction is performed only for “dimension”, but if the same processing as above is performed on the candidate obtained as a result, all the next candidate characters satisfy the condition. Otherwise, the next candidate character prediction fails. Therefore, character recognition using the feature amount is performed.

【００２６】このような処理を繰り返して、認識対象と
なる文字画像が無くなるまで、次候補文字予測を用いた
文字認識処理が行われる。By repeating such processing, character recognition processing using the next candidate character prediction is performed until there is no more character image to be recognized.

【００２７】以上に詳述した本実施例は、前候補文字保
持部４を参照して次候補文字予測部５−１によって次に
来る文字を予測することにより、次に来る文字の候補を
減らして高速に文字認識を行うことができる。また、次
候補文字予測によって、後処理を行うのと同様の効果が
期待できるため、後処理を行わなくても高い認識制度が
得られ、後処理に必要な時間を節約できる。In the present embodiment described above, the next character is predicted by the next candidate character predicting unit 5-1 with reference to the preceding candidate character holding unit 4, thereby reducing the number of next character candidates. And perform character recognition at high speed. Further, since the same effect as performing post-processing can be expected by predicting the next candidate character, a high recognition accuracy can be obtained without performing post-processing, and time required for post-processing can be saved.

【００２８】なお、実施例においては、認識処理部３は
特徴量を用いて認識を行うものを例示したが、以下のよ
うな特徴量を用いないで認識を行う構成とすることもで
きる。例えば、画像を入力とするニューラルネットワー
クを用いた認識方法を採用することができる。これは文
字画像の特徴量をとるのではなく、画像そのものをニュ
ーラルネットワークに学習させておき、認識対象の文字
画像を入力すると、該当する文字が出力されるものであ
る。また、文字画像の重ね合わせにより認識するテンプ
レートマッチングなども、特徴量を用いない認識方法の
例として挙げることができる。これは、文字画像そのも
ののテンプレートを辞書として用意しておき、画素ごと
に比較して、最もよく一致したテンプレートの表す文字
を、その文字の認識結果とするものである。In the embodiment, the recognition processing unit 3 performs the recognition using the characteristic amount. However, the recognition processing unit 3 may be configured to perform the recognition without using the following characteristic amount. For example, a recognition method using a neural network that receives an image can be adopted. This is not to take the feature amount of the character image, but to train the image itself in the neural network and input the character image to be recognized, and output the corresponding character. Further, template matching and the like, which are recognized by superimposing character images, can also be cited as examples of a recognition method that does not use a feature amount. In this method, a template of a character image itself is prepared as a dictionary, and the characters represented by the template that best matches each other are compared for each pixel as a recognition result of the character.

【００２９】次候補文字予測方法としては、上記の実施
例の次候補文字予測方法の外に次のような方法を用いる
ことができる。（１）３文字以上の連接テーブルによる処理を併用す
る。例えば３文字の連接テーブルを用いる場合、前候補
保持部は２文字前まで保持しておき、２文字のエントリ
を持つ連接テーブルによって次候補文字を予測すること
になる。こうして予測された文字に対しては、２文字の
連接テーブルから予測された文字よりも評価値を高くし
て、次候補に残りやすくする。処理量は増えるものの、
可能性の高い候補を残すことができ、正確な予測が可能
になる。As the next candidate character prediction method, the following method can be used in addition to the next candidate character prediction method of the above embodiment. (1) A process using a concatenation table of three or more characters is used together. For example, when a three-character concatenation table is used, the previous candidate holding unit holds two characters before, and predicts the next candidate character using the concatenation table having two-character entries. For the character predicted in this manner, the evaluation value is set higher than that of the character predicted from the two-character concatenation table, so that the character can easily remain as the next candidate. Although the processing amount increases,
Candidates with high possibility can be left, and accurate prediction can be made.

【００３０】（２）ひらがなを連接テーブルに含まな
い場合、平仮名と形状が類似の漢字またはカタカナが次
候補文字として予測されると、誤りを生ずる可能性があ
る。これに対しては、類似の非平仮名文字と平仮名との
組をあらかじめテーブルとして持っておき、このような
文字が予測された場合には、誤りやすい平仮名も一緒に
次候補文字に付け加えることが考えられる。こうするこ
とで、ひらがなとよく似た漢字を正解としてしまう誤り
を、ある程度防ぐことができる。例えば、「赤も」とい
う文字列画像に対し、「赤」という字が前候補文字に入
っているとする。そして「赤」の連接テーブルに「毛」
という字があったとすると、次候補文字の中にこの字が
出力される。次に「毛」という字と「も」という字の文
字画像との距離を求めると、この２つの文字が類似文字
であるため閾値を満たしてしまい、「毛」という字が次
候補文字となって、特徴量を用いた文字認識が行われな
くなってしまう。その結果「も」という字は認識されず
「毛」という字が誤って出力されることになる。そこ
で、これを防ぐために図１１のような類似文字テーブル
を用いる。即ち、ひらがなと間違いやすい、「毛」のよ
うな文字に対しては、類似文字として「も」という字を
類似文字テーブルに持っておき、「毛」という字が予測
された場合には自動的に「も」という字を次候補文字に
付け加えるようにするのである（図１２）。こうすると
「毛」のほかに「も」も次候補文字となるため、結果的
に「も」の認識距離が小さいことから、正解である
「も」が出力される。(2) When hiragana is not included in the concatenation table, an error may occur if a kanji or katakana whose shape is similar to hiragana is predicted as the next candidate character. In response to this, a set of similar non-hiragana characters and hiragana characters is stored in advance as a table, and if such a character is predicted, an error-prone hiragana character may be added to the next candidate character. Can be By doing so, it is possible to prevent to some extent an error in which a kanji similar to Hiragana is regarded as the correct answer. For example, it is assumed that the character "red" is included in the preceding candidate character in the character string image "red also". And "hair" on the connection table of "red"
Is output in the next candidate character. Next, when the distance between the character "hair" and the character image of the character "mo" is calculated, the two characters are similar characters and thus meet the threshold, and the character "hair" becomes the next candidate character. As a result, character recognition using the feature amount is not performed. As a result, the character "mo" is not recognized and the character "hair" is erroneously output. Therefore, a similar character table as shown in FIG. 11 is used to prevent this. In other words, for characters such as "hair" that are easily mistaken for hiragana, the character "mo" is stored in the similar character table as a similar character, and if the character "hair" is predicted, Is added to the next candidate character (FIG. 12). In this case, "mo" is also the next candidate character in addition to "hair". As a result, since the recognition distance of "mo" is short, "mo" which is the correct answer is output.

【００３１】（３）ひらがなを連接テーブルに含まな
い場合、連接テーブルによって予測された文字ととも
に、全てのひらがなを次候補に加える方法も考えられ
る。これにより、予測される候補数は全ひらがな文字の
数（数十）だけ増えることになるが、それでも、粗い特
徴量によって大分類を行った結果得られる数よりは少な
くてすむ。また、ひらがなを連接テーブルに含めるのに
比べればテーブルを小さくでき、単純に付け加えるだけ
なのでこの処理による時間はかからない。このような方
法を取ることによって、ひらがなを次候補の中に入れる
ことができるので、（２）と同様にひらがなを誤って非
ひらがな文字と認識してしまうことがなくなり、精度が
向上できる。(3) When the hiragana is not included in the connection table, a method of adding all the hiragana to the next candidate together with the character predicted by the connection table may be considered. As a result, the number of candidates to be predicted increases by the number of all hiragana characters (several tens), but it is still smaller than the number obtained as a result of performing a large classification based on coarse feature amounts. In addition, the size of the table can be reduced as compared with the case where the hiragana is included in the concatenation table. By adopting such a method, the hiragana can be included in the next candidate, so that the hiragana is not mistakenly recognized as a non-hiragana character as in (2), and the accuracy can be improved.

【００３２】（４）（３）のように、ひらがな全てを
一律に次候補に付加するのではなく、特に使用頻度の高
いひらがなや記号、漢字を候補に付加することも考えら
れる。ひらがなの内、使用頻度の高い文字を上位から半
数程度を付加すればひらがなのかなりの部分を予測する
ことが可能である。また、使用頻度の高い記号や漢字を
一律付加し、使用頻度の低いひらがなは一律に付加しな
いようにすることで、付加する候補文字の数を減らすこ
とができるとともに、予測能力を向上することができ
る。(4) As in (3), not all hiragana characters are uniformly added to the next candidate, but it is also conceivable to add hiragana characters, symbols and kanji which are particularly frequently used to the candidates. Of the hiragana, if the most frequently used characters are added to the top half, it is possible to predict a significant part of the hiragana. In addition, by adding frequently used symbols and kanji uniformly and not adding infrequently used hiragana, the number of candidate characters to be added can be reduced, and the prediction ability can be improved. it can.

【００３３】（５）連接文字テーブルは、単純に連接
するかどうかを表すもののかわりに、それぞれの連接確
率を持つものとすることも考えられる。そして、特徴量
による距離値に、連接確率による重みづけをすることに
よって、めったに起こらない連接に対する評価値を低く
して、連接テーブルによる誤りを減らすことができる。(5) The connection character table may have each connection probability instead of simply indicating whether or not connection is performed. Then, by weighting the distance value based on the feature value with the connection probability, it is possible to reduce the evaluation value for the connection that rarely occurs, and reduce errors in the connection table.

【００３４】（６）日本語以外の言語（中国語、ハン
グルなど）への適用も可能である。対象となる言語とし
ては、文字数が多く、個々の文字の連接する組合せが少
ないものが効果的である。(6) Application to languages other than Japanese (Chinese, Korean, etc.) is also possible. As a target language, a language having a large number of characters and a small number of consecutive combinations of individual characters is effective.

【００３５】候補文字決定方法としては、上記の実施例
の候補文字決定方法の外に次のような方法を用いること
ができる。（１）前候補文字の確定を、連接する前候補文字と次
候補文字との距離の合計値によってではなく、前候補文
字のみの距離値によって行う。これは前候補文字を候補
が出た時点ですぐに確定することである。したがって、
前候補文字は一つしか存在せず、前候補再判定や次候補
再判定は行われないことになる。この結果、正確さは減
少するものの、処理が簡単になり処理時間が短縮され
る。As a candidate character determining method, the following method can be used in addition to the candidate character determining method of the above embodiment. (1) The determination of the preceding candidate character is performed not by the total value of the distance between the consecutive candidate character and the next candidate character but by the distance value of only the preceding candidate character. This means that the previous candidate character is determined immediately when the candidate appears. Therefore,
There is only one previous candidate character, and no previous candidate re-determination or next candidate re-determination is performed. As a result, although the accuracy is reduced, the processing is simplified and the processing time is shortened.

【００３６】（２）前候補文字と次候補文字の組の選
択基準として、単純な距離値の代わりに正規化した距離
値を用いる。具体的な正規化の方法としては、候補文字
群の中で最も距離が小さいものを基準とし、他の候補文
字に対する距離値を１位候補の距離値で割ることなどが
考えられる。こうすることで、文字ごとの認識距離のば
らつきによる誤った評価をなくすことができる。(2) A normalized distance value is used instead of a simple distance value as a selection criterion for a set of a preceding candidate character and a next candidate character. As a specific normalization method, it is conceivable to divide the distance value for other candidate characters by the distance value of the first candidate based on the smallest distance among the candidate character groups. By doing so, it is possible to eliminate erroneous evaluation due to variation in the recognition distance for each character.

【００３７】（３）次候補文字判定に用いる認識距離
の閾値は、一律に決める方法以外に、文字ごとに別の値
を指定する方法も考えられる。類似文字が多く誤りやす
い文字の閾値は厳しくし、少ない文字の閾値は緩くする
ようなことが考えられる。距離値の決定は、例えば、対
象となる文字との距離が最も小さい文字までの距離を目
安にして決める方法がある。この方法によって、類似文
字を誤って正解とする可能性が小さくなり、正確な判定
が実現できる。(3) In addition to a method of uniformly determining the threshold value of the recognition distance used for determining the next candidate character, a method of specifying a different value for each character is also conceivable. It is conceivable that the threshold value of a character that is likely to be erroneous with many similar characters is set to be strict, and the threshold value of a character that is small is set to be loose. For example, there is a method of determining the distance value based on the distance to the character having the smallest distance from the target character. According to this method, the possibility that a similar character is erroneously determined to be correct is reduced, and accurate determination can be realized.

【００３８】（４）次候補文字予測が失敗するまでの
前候補文字を全て残しておき、次候補文字予測が失敗し
た時点でそれまでの前候補文字の中で、連接が可能な全
ての組合せの中から最も評価値の高いものを選択する。
このことによって、より広い範囲の連接文字列に対して
評価を行うことができ、精度の向上が見込まれる。(4) All the previous candidate characters until the next candidate character prediction fails are left, and at the time the next candidate character prediction fails, all combinations of the previous candidate characters that can be connected are used. Select the one with the highest evaluation value from.
This makes it possible to evaluate a wider range of concatenated character strings, and is expected to improve the accuracy.

【００３９】（５）実施例では文の前方から予測し、
認識を行っていたが、文の後方から行うことも可能であ
る。この場合には後方から予測する連接テーブルが必要
になる。後方から前方を予測する方が制約をかけやすい
場合もあるので、候補を絞るのに有効である。(5) In the embodiment, prediction is made from the front of the sentence,
Although recognition was performed, it is also possible to perform recognition from the back of the sentence. In this case, a connection table predicted from the rear is required. Prediction from the rear to the front may be more restrictive in some cases, and is effective in narrowing down the candidates.

【００４０】[0040]

【発明の効果】本発明によれば、認識対象文字の前にあ
る文字から認識対象となる文字の候補を予測することに
よって、特徴量を用いて認識を行う必要のある文字を大
幅に減らすようにしたので、精度を落すことなく高速に
文字認識を行うことができる。According to the present invention, by predicting a character candidate to be recognized from a character in front of a character to be recognized, the number of characters that need to be recognized using the feature amount is greatly reduced. Therefore, character recognition can be performed at high speed without lowering the accuracy.

【００４１】また、本発明によれば、予測する方法とし
ては、非ひらがな文字同士の連接する組合せが比較的少
ないことを利用しており、各文字に対して次に来る可能
性のある文字の情報を、文字間の連接テーブルとして用
意したので、次に来る文字の候補を非常に高速に予測す
るとともに、候補の数を従来の大分類処理に比べて減ら
すことが可能となる。Further, according to the present invention, as a method for predicting, the fact that the number of contiguous combinations of non-hiragana characters is relatively small is utilized, and for each character, Since the information is prepared as a concatenation table between characters, it is possible to predict the next character candidate at a very high speed and to reduce the number of candidates as compared with the conventional large classification processing.

【００４２】さらに、後処理に用いる情報である連接テ
ーブルなどを使って、次に来る文字を予測するため、後
処理を行うことなく同程度の認識精度が確保でき、後処
理時間をも節約することができる。Further, since the next character is predicted by using a connection table or the like which is information used for post-processing, the same recognition accuracy can be secured without performing post-processing and post-processing time can be saved. be able to.

[Brief description of the drawings]

【図１】本発明の一実施例の構成を示す図、FIG. 1 is a diagram showing a configuration of an embodiment of the present invention;

【図２】認識処理部の構成を示す図、FIG. 2 is a diagram illustrating a configuration of a recognition processing unit;

【図３】実施例における全体の処理の流れを示す図、FIG. 3 is a diagram showing a flow of overall processing in the embodiment;

【図４】次候補文字予測の処理の流れを示す図、FIG. 4 is a diagram showing a flow of processing of next candidate character prediction;

【図５】候補文字判定の処理の流れを示す図、FIG. 5 is a diagram showing a flow of a candidate character determination process;

【図６】前候補再判定の処理の流れを示す図、FIG. 6 is a diagram showing a flow of a process of re-determining a previous candidate;

【図７】前候補確定、出力の処理の流れを示す図、FIG. 7 is a diagram showing a flow of processing of final candidate determination and output,

【図８】前候補再判定の処理の流れを示す図、FIG. 8 is a diagram showing a flow of a process of re-determining a previous candidate;

【図９】文字連接テーブルの一例を示す図、FIG. 9 is a diagram showing an example of a character connection table;

【図１０】実施例の動作を具体例により説明するため
の図、FIG. 10 is a diagram for explaining the operation of the embodiment using a specific example;

【図１１】類似文字テーブルの一例を示す図、FIG. 11 is a diagram showing an example of a similar character table.

【図１２】類似文字を付加する場合の例を示す図。FIG. 12 is a diagram illustrating an example of adding a similar character.

[Explanation of symbols]

１…文字画像入力部、２…文字切り出し部、３…認識処
理部、３１…特徴量算出部、３２…特徴量間距離計算
部、３３…特徴量−文字コードマッチング部、４…前候
補文字保持部、５−１…次候補文字予測部、５−２…連
接テーブル、候補文字判定部。DESCRIPTION OF SYMBOLS 1 ... Character image input part, 2 ... Character extraction part, 3 ... Recognition processing part, 31 ... Feature amount calculation part, 32 ... Feature amount distance calculation part, 33 ... Feature amount-character code matching part, 4 ... Previous candidate character Holding unit, 5-1: next candidate character prediction unit, 5-2: connection table, candidate character determination unit.

Claims

(57) [Claims]

1. A first character image to be recognized which is adjacent to each other.
In the recognition of the image and the second character image, the candidate characters for the first character image are connected
Characters as preliminary candidate characters for the second character image
And keep out step, wherein the characteristic amount and the second character image of the preliminary candidate character
Evaluate and recognize preliminary candidate characters based on their distance from the volume
Step of determining candidate characters for the target second character image
Between the step and the candidate character for the determined second character image
Redetermines candidate characters for the first character image based on
Steps and candidate characters of the first character image determined by the re-determination
A set of candidate characters for the second image adjacent thereto
The recognition output character for the first character image based on the
And outputting only candidate characters connected to the recognized output character to a second character
Re-determining as a new candidate character for the image
Character recognition method characterized by comprising and.

2. A character image input means for inputting a character image to be read, and a candidate character for a first recognition target character image adjacent to a certain second recognition target character image as a preceding candidate character. before candidate character holding means holds, articulated table for holding a combination of articulating possible characters, based on the concatenated table, said first recognition target character articulating the prior candidate character for the image next candidate characters and next candidate characters predicting means for predicting as, connecting the next candidate characters and the candidate characters determination means for determining the validity of the next climate <br/> auxiliary character predicted characters by the prediction means, and it the front candidate character Next candidate character set
The feature amount of the preceding candidate character and the first recognition target character
Distance to image feature and feature of next candidate character
And the total distance between the feature amount of the second recognition target character and
The previous candidate character corresponding to the pair having the smallest sum
Sure to be output as the recognition output of the first recognition target character
A character recognition device comprising: recognition processing means; and control means for controlling overall processing.