JP6320089B2

JP6320089B2 - Recognition device, recognition method and program

Info

Publication number: JP6320089B2
Application number: JP2014044341A
Authority: JP
Inventors: 鈴木　智久; 智久鈴木
Original assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2014-03-06
Filing date: 2014-03-06
Publication date: 2018-05-09
Anticipated expiration: 2034-03-06
Also published as: JP2015170129A

Description

本発明の実施形態は、認識装置、認識方法およびプログラムに関する。 Embodiments described herein relate generally to a recognition apparatus, a recognition method, and a program.

文字認識における知識処理を行う方法として、認識対象の文字列をモデル化して知識辞書に格納し、モデルに合致する文字列を知識辞書の結果とする方法が知られている。例えば、照合したい単語を登録した単語辞書部と単語辞書部に登録された単語を受理する有限オートマトンを有する単語照合部を備え、有限オートマトンが受理した単語を知識処理の結果として出力するシステムが知られている（例えば、特許文献１参照）。また、文脈自由文法で記述した地名表記と文字認識候補を照合することで知識処理を行う技術が知られている（例えば、特許文献２参照）。 As a method for performing knowledge processing in character recognition, a method is known in which a character string to be recognized is modeled and stored in a knowledge dictionary, and a character string matching the model is used as a result of the knowledge dictionary. For example, a system is known that includes a word dictionary unit that registers words to be collated and a word collation unit that has a finite automaton that accepts words registered in the word dictionary unit, and outputs the words received by the finite automaton as a result of knowledge processing. (For example, refer to Patent Document 1). In addition, a technique for performing knowledge processing by collating a place name notation described in a context-free grammar with a character recognition candidate is known (see, for example, Patent Document 2).

特開平１１−１４３８９３号公報JP-A-11-143893 特許第４００６１７６号公報Japanese Patent No. 4006176

しかしながら、認識対象の文字列をモデル化して、知識処理の結果をモデルに合致する文字列に限定する従来の技術では、知識処理の結果としてありえない文字列および不自然な文字列等を効率良く排除することが困難であった。例えば、英文において母音字が５文字以上続く文字列は、単語として不自然であるが、従来の技術では、こういった文字列を効率良く排除することは困難であった。 However, the conventional technology that models the character string to be recognized and limits the result of knowledge processing to the character string that matches the model efficiently eliminates character strings that are not possible as a result of knowledge processing and unnatural character strings. It was difficult to do. For example, a character string having five or more vowel characters in English is unnatural as a word, but it has been difficult to efficiently eliminate such character strings with conventional techniques.

本発明が解決しようとする課題は、使用されることが無い文字列および使用することが禁止される文字列等を効率良く排除して、文字列を精度良く認識することにある。 The problem to be solved by the present invention is to efficiently eliminate character strings that are not used and character strings that are prohibited from being used, and recognize character strings with high accuracy.

実施形態に係る認識装置は、入力画像から、文字を含むと推測される画素の集合である文字候補を検出する候補検出部と、前記文字候補のそれぞれを認識して、認識結果の候補の文字である少なくとも１つの認識候補を生成する認識部と、前記少なくとも１つの認識候補のそれぞれを、認識対象の文字列をモデル化した知識辞書と照合して、前記入力画像に含まれると推測される文字列と知識辞書を照合して得られる照合結果を生成する照合部と、前記照合結果のうち、禁止対象の文字列を含む文字列と知識辞書を照合して得られた照合結果を削除する禁則処理部を備える。 The recognition apparatus according to the embodiment recognizes each of the character candidates from a candidate detection unit that detects a character candidate that is a set of pixels estimated to include a character from the input image, and recognizes each of the character candidates. A recognition unit that generates at least one recognition candidate, and each of the at least one recognition candidate is compared with a knowledge dictionary that models a character string to be recognized, and is estimated to be included in the input image. A collation unit that generates a collation result obtained by collating the character string with the knowledge dictionary, and deleting a collation result obtained by collating the character string including the prohibited character string with the knowledge dictionary from the collation result. A prohibition processing unit is provided.

図１は、実施形態に係る認識装置１０の構成を示す図である。FIG. 1 is a diagram illustrating a configuration of a recognition device 10 according to the embodiment. 図２は、実施形態に係る認識装置１０の処理を示すフロー図である。FIG. 2 is a flowchart showing processing of the recognition apparatus 10 according to the embodiment. 図３は、入力画像の一例を示す図である。FIG. 3 is a diagram illustrating an example of an input image. 図４は、様式データの構成を示す図である。FIG. 4 is a diagram showing the configuration of style data. 図５は、入力画像から一連の文字候補を生成する処理を示す図である。FIG. 5 is a diagram illustrating a process for generating a series of character candidates from an input image. 図６は、断片データの構成を示す図である。FIG. 6 is a diagram showing the structure of fragment data. 図７は、断片番号の一例を示す図である。FIG. 7 is a diagram illustrating an example of a fragment number. 図８は、文字候補データの構成を示す図である。FIG. 8 is a diagram illustrating a configuration of character candidate data. 図９は、文字候補番号の一例を示す図である。FIG. 9 is a diagram illustrating an example of character candidate numbers. 図１０は、文字候補の始点番号および終点番号の一例を示す図である。FIG. 10 is a diagram illustrating an example of the start point number and end point number of a character candidate. 図１１は、文字候補マトリクスの一例を示す図である。FIG. 11 is a diagram illustrating an example of a character candidate matrix. 図１２は、文字認識辞書の構成を示す図である。FIG. 12 is a diagram showing the configuration of the character recognition dictionary. 図１３は、認識候補の配列の構成を示す図である。FIG. 13 is a diagram showing the configuration of recognition candidate sequences. 図１４は、知識辞書の構成を示す図である。FIG. 14 is a diagram illustrating a configuration of the knowledge dictionary. 図１５は、禁則辞書の構成を示す図である。FIG. 15 is a diagram showing a configuration of the prohibition dictionary. 図１６は、第一の非決定性有限オートマトンの一例を示す図である。FIG. 16 is a diagram illustrating an example of a first nondeterministic finite automaton. 図１７は、第一の非決定性有限オートマトンを変換することで得られる第二の非決定性有限オートマトンの一例を示す図である。FIG. 17 is a diagram illustrating an example of a second non-deterministic finite automaton obtained by converting the first non-deterministic finite automaton. 図１８は、照合結果データの構成を示す図である。FIG. 18 is a diagram illustrating a configuration of the collation result data. 図１９は、照合処理を示すフロー図である。FIG. 19 is a flowchart showing the matching process. 図２０は、知識辞書探索処理を示すフロー図である。FIG. 20 is a flowchart showing the knowledge dictionary search process. 図２１は、知識辞書探索処理でのデータのアクセスの流れの一例を示す図である。FIG. 21 is a diagram illustrating an example of the flow of data access in the knowledge dictionary search process. 図２２は、禁則辞書探索処理を示すフロー図である。FIG. 22 is a flowchart showing the prohibition dictionary search process. 図２３は、禁則辞書探索処理でのデータのアクセスの流れの一例を示す図である。FIG. 23 is a diagram illustrating an example of the flow of data access in the forbidden dictionary search process. 図２４は、結果抽出の処理の流れを示すフロー図である。FIG. 24 is a flowchart showing the result extraction process. 図２５は、結果抽出において参照されるデータとスタック上に積まれる文字コードの一例を示す図である。FIG. 25 is a diagram illustrating an example of data referred to in the result extraction and a character code stacked on the stack. 図２６は、変形例に係る認識装置１０で認識結果として用いられる文字とそれらの文字の種別を表す記号の一例を示す図である。FIG. 26 is a diagram illustrating an example of characters used as recognition results in the recognition apparatus 10 according to the modification and symbols representing the types of those characters. 図２７は、変形例に係る認識装置１０で認識される文字列を文字の種別を表す記号の列として表した知識辞書の内容の一例を示す図である。FIG. 27 is a diagram illustrating an example of the contents of a knowledge dictionary in which a character string recognized by the recognition apparatus 10 according to the modification is represented as a symbol string representing a character type. 図２８は、変形例に係る照合結果データの一例を示す図である。FIG. 28 is a diagram illustrating an example of collation result data according to the modification. 図２９は、実施形態に係る認識装置１０のハードウェア構成を示す図である。FIG. 29 is a diagram illustrating a hardware configuration of the recognition apparatus 10 according to the embodiment.

図１は、実施形態に係る認識装置１０の構成を示す図である。認識装置１０は、例えばスキャナ等により読み取られた入力画像に含まれる文字列を認識し、認識した文字列を出力する。 FIG. 1 is a diagram illustrating a configuration of a recognition device 10 according to the embodiment. For example, the recognition device 10 recognizes a character string included in an input image read by a scanner or the like, and outputs the recognized character string.

認識装置１０は、入力部３０と、入力画像記憶部３２と、様式データ記憶部３４と、候補検出部３６と、候補記憶部３８と、文字認識辞書記憶部４０と、認識部４２と、知識辞書記憶部４４と、照合部４６と、照合結果記憶部４８と、禁則辞書記憶部５０と、禁則処理部５２と、結果抽出部５４と、出力部５６を備える。 The recognition device 10 includes an input unit 30, an input image storage unit 32, a style data storage unit 34, a candidate detection unit 36, a candidate storage unit 38, a character recognition dictionary storage unit 40, a recognition unit 42, knowledge A dictionary storage unit 44, a collation unit 46, a collation result storage unit 48, a prohibition dictionary storage unit 50, a prohibition processing unit 52, a result extraction unit 54, and an output unit 56 are provided.

入力部３０は、スキャナ等により取り込まれた入力画像を入力する。入力部３０は、ネットワーク等を介して他のコンピュータから入力画像を入力してもよい。入力画像記憶部３２は、入力部３０により入力された入力画像を記憶する。 The input unit 30 inputs an input image captured by a scanner or the like. The input unit 30 may input an input image from another computer via a network or the like. The input image storage unit 32 stores the input image input by the input unit 30.

様式データ記憶部３４は、入力画像における文字列が記載された領域を特定する様式データを記憶する。 The form data storage unit 34 stores form data for specifying an area in which a character string in the input image is described.

候補検出部３６は、様式データ記憶部３４に記憶された様式データに基づいて、入力画像から、文字候補を検出する。それぞれの文字候補は、１つの文字を含むと推測される画素の集合である。候補検出部３６は、検出した文字候補を候補記憶部３８に書き込む。 The candidate detection unit 36 detects a character candidate from the input image based on the format data stored in the format data storage unit 34. Each character candidate is a set of pixels estimated to include one character. The candidate detection unit 36 writes the detected character candidate in the candidate storage unit 38.

候補記憶部３８は、文字候補を記憶する。さらに、候補記憶部３８は、文字候補のそれぞれに対応させて、その文字候補の認識結果の候補の文字である認識候補を記憶する。 The candidate storage unit 38 stores character candidates. Furthermore, the candidate memory | storage part 38 memorize | stores the recognition candidate which is a candidate character of the recognition result of the character candidate corresponding to each of a character candidate.

文字認識辞書記憶部４０は、文字認識辞書を記憶する。文字認識辞書は、認識対象の画像と、予め登録された文字のそれぞれとの類似度を算出するための情報を格納する。 The character recognition dictionary storage unit 40 stores a character recognition dictionary. The character recognition dictionary stores information for calculating the degree of similarity between the image to be recognized and each character registered in advance.

認識部４２は、文字認識辞書記憶部４０に記憶された文字認識辞書に基づいて、候補記憶部３８に記憶された文字候補のそれぞれを認識する。そして、認識部４２は、１つの文字候補に対して、認識結果の候補の文字である少なくとも１つの認識候補を生成する。認識部４２は、生成した少なくとも１つの認識候補を、文字候補に対応付けて候補記憶部３８に書き込む。 The recognition unit 42 recognizes each of the character candidates stored in the candidate storage unit 38 based on the character recognition dictionary stored in the character recognition dictionary storage unit 40. And the recognition part 42 produces | generates the at least 1 recognition candidate which is a candidate character of a recognition result with respect to one character candidate. The recognition unit 42 writes the generated at least one recognition candidate in the candidate storage unit 38 in association with the character candidate.

知識辞書記憶部４４は、認識対象の文字列をモデル化した知識辞書を記憶する。本実施形態においては、知識辞書は、認識対象の文字列をモデル化した決定性有限オートマトンである。 The knowledge dictionary storage unit 44 stores a knowledge dictionary obtained by modeling a character string to be recognized. In the present embodiment, the knowledge dictionary is a deterministic finite automaton that models a character string to be recognized.

照合部４６は、少なくとも１つの認識候補のそれぞれを知識辞書と照合して、入力画像に含まれると推測される文字列と知識辞書を照合して得られる少なくとも１つの照合結果を生成する。この過程において、照合部４６は、対応する文字列の尤もらしさを表すスコアを含む照合結果を生成する。そして、照合部４６は、生成した少なくとも１つの照合結果を照合結果記憶部４８に書き込む。 The collation unit 46 collates each knowledge candidate with at least one recognition candidate, and generates at least one collation result obtained by collating the character string estimated to be included in the input image with the knowledge dictionary. In this process, the matching unit 46 generates a matching result including a score representing the likelihood of the corresponding character string. Then, the collation unit 46 writes the generated at least one collation result in the collation result storage unit 48.

照合結果記憶部４８は、照合部４６により生成された少なくとも１つの照合結果を記憶する。照合結果記憶部４８は、文字候補の認識候補を並べて得られる文字列を先頭から知識辞書と照合していく過程における開始時点、途中段階および完了時点での照合結果を記憶する。なお、照合部４６は、照合の途中段階において、記憶領域を節約することを目的として、スコアの低い照合結果を削除してもよい。 The collation result storage unit 48 stores at least one collation result generated by the collation unit 46. The collation result storage unit 48 stores collation results at the start point, the middle stage, and the completion point in the process of collating character strings obtained by arranging recognition candidates of character candidates with the knowledge dictionary from the top. Note that the collation unit 46 may delete a collation result having a low score for the purpose of saving the storage area in the middle of the collation.

禁則辞書記憶部５０は、禁止対象の文字列をモデル化した禁則辞書を記憶する。本実施形態においては、禁則辞書は、禁止対象の文字列をモデル化した決定性有限オートマトンである。 The prohibition dictionary storage unit 50 stores a prohibition dictionary that models a prohibited character string. In the present embodiment, the prohibition dictionary is a deterministic finite automaton that models a character string to be prohibited.

禁則処理部５２は、少なくとも１つの照合結果のうち、禁止対象の文字列を含む文字列と知識辞書を照合して得られた照合結果を削除する。本実施形態において、禁則処理部５２は、照合結果により特定されるそれぞれの文字列を決定性有限オートマトンである禁則辞書と照合し、文字列が禁則辞書の決定性有限オートマトンで受理された場合、対応する照合結果を削除する。 The prohibition processing unit 52 deletes the collation result obtained by collating the character string including the prohibited character string with the knowledge dictionary from at least one collation result. In the present embodiment, the forbidden processing unit 52 collates each character string specified by the collation result with the forbidden dictionary that is a deterministic finite automaton, and responds when the character string is received by the deterministic finite automaton of the forbidden dictionary. Delete verification results.

禁則処理部５２は、照合部４６による照合の途中段階において禁止対象の文字列を含む文字列と知識辞書を照合して得られた照合結果が発生した場合、その途中段階の照合結果を削除してもよい。また、禁則処理部５２は、照合部４６による照合が全て完了した後において、複数の照合結果のそれぞれに禁止対象の文字列を含むかを照合して、禁止対象の文字列を含む文字列と知識辞書を照合して得られた照合結果を削除してもよい。 When the collation result obtained by collating the character string including the character string to be prohibited with the knowledge dictionary is generated in the middle stage of collation by the collation section 46, the prohibition processing section 52 deletes the collation result in the middle stage. May be. Further, after all the collations by the collation unit 46 are completed, the prohibition processing unit 52 collates whether each of a plurality of collation results includes a prohibited character string, and includes a character string including the prohibited character string. You may delete the collation result obtained by collating a knowledge dictionary.

結果抽出部５４は、照合部４６による照合および禁則処理部５２による削除が全て完了した後において、照合結果記憶部４８に記憶された少なくとも１つの照合結果からスコアに基づき１個以上の照合結果を選択し、選択した１個以上の照合結果により特定される文字列を抽出する。結果抽出部５４は、一例として、スコアが最も良い照合結果により特定される文字列を抽出する。 After all the collation by the collation unit 46 and the deletion by the prohibition processing unit 52 are completed, the result extraction unit 54 obtains one or more collation results based on the score from at least one collation result stored in the collation result storage unit 48. A character string specified by the selected one or more matching results is extracted. As an example, the result extraction unit 54 extracts a character string specified by a matching result having the best score.

出力部５６は、結果抽出部５４により抽出された文字列を外部へと出力する。 The output unit 56 outputs the character string extracted by the result extraction unit 54 to the outside.

図２は、実施形態に係る認識装置１０の処理を示すフロー図である。まず、ステップＳ１において、認識装置１０は、入力画像を入力する。 FIG. 2 is a flowchart showing processing of the recognition apparatus 10 according to the embodiment. First, in step S1, the recognition apparatus 10 inputs an input image.

続いて、ステップＳ２において、認識装置１０は、入力画像から、１つの文字を含むと推測される画素の集合である文字候補を検出する。続いて、ステップＳ３において、認識装置１０は、文字認識辞書に基づいて、文字候補のそれぞれを認識して、認識結果の候補の文字である少なくとも１つの認識候補を生成する。 Subsequently, in step S 2, the recognition apparatus 10 detects a character candidate that is a set of pixels presumed to include one character from the input image. Subsequently, in step S3, the recognition apparatus 10 recognizes each of the character candidates based on the character recognition dictionary, and generates at least one recognition candidate that is a candidate character of the recognition result.

続いて、ステップＳ４において、認識装置１０は、少なくとも１つの認識候補のそれぞれを知識辞書と照合して、入力画像に含まれると推測される文字列と知識辞書を照合して得られる少なくとも１つの照合結果を生成する。これとともに、ステップＳ４において、認識装置１０は、照合結果により特定されるそれぞれの文字列を禁則辞書と照合し、禁則辞書の決定性有限オートマトンで受理される文字列と知識辞書を照合して得られた照合結果を削除する。 Subsequently, in step S4, the recognition apparatus 10 collates each knowledge candidate with at least one recognition candidate, and at least one obtained by collating the character string estimated to be included in the input image with the knowledge dictionary. Generate verification results. At the same time, in step S4, the recognition apparatus 10 collates each character string specified by the collation result with the prohibition dictionary, and collates the character string accepted by the deterministic finite automaton of the prohibition dictionary with the knowledge dictionary. Delete the matching result.

続いて、ステップＳ５において、認識装置１０は、照合処理が全て完了した後において、照合結果からスコアに基づき１つの照合結果を選択し、選択した照合結果により特定される文字列を抽出し、認識結果の文字列とする。文字候補の個数が０個の場合、すなわち入力画像上に文字が含まれない場合、ステップＳ５において選択すべき照合結果が生成されないが、この場合は認識結果の文字列を空文字列とする。最後に、ステップＳ６において、認識装置１０は、認識結果の文字列を出力する。 Subsequently, in step S5, the recognition device 10 selects one collation result based on the score from the collation result after extracting all the collation processes, extracts a character string specified by the selected collation result, and recognizes it. The result string. When the number of character candidates is 0, that is, when no character is included in the input image, a collation result to be selected is not generated in step S5. In this case, the character string of the recognition result is an empty character string. Finally, in step S6, the recognition device 10 outputs a character string as a recognition result.

図３は、入力画像の一例を示す図である。本実施形態において、入力画像は、図３に示すように、商品を発注するための注文書をスキャナ等により取り込んで得られた画像データである。入力画像の予め定められた記入枠の内側には、発注者の名前が記入されている。本実施形態において、認識装置１０は、予め定められた記入枠の内側に記入された日本語の名前の文字列を認識し、認識した文字列を表すテキストデータを出力する。 FIG. 3 is a diagram illustrating an example of an input image. In the present embodiment, as shown in FIG. 3, the input image is image data obtained by taking an order form for ordering a product with a scanner or the like. The name of the orderer is entered inside a predetermined entry frame of the input image. In the present embodiment, the recognition apparatus 10 recognizes a character string with a Japanese name entered inside a predetermined entry frame, and outputs text data representing the recognized character string.

図４は、様式データの構成を示す図である。様式データ記憶部３４は、予め作成された様式データを記憶する。 FIG. 4 is a diagram showing the configuration of style data. The form data storage unit 34 stores form data created in advance.

様式データは、図４に示すように、入力画像に含まれる記入枠の個数を示す値と、記入枠の個数分の記入枠レコードを格納する配列を含む。本例において、配列の最初のエントリのインデックスは、０である。すなわち、配列は、０オリジンである。なお、本実施形態で用いる他の配列も、特別の記載が無い限り０オリジンである。記入枠レコードのそれぞれは、入力画像に含まれるそれぞれの記入枠に一対一で対応する。 As shown in FIG. 4, the format data includes a value indicating the number of entry frames included in the input image and an array for storing entry frame records for the number of entry frames. In this example, the index of the first entry in the array is zero. That is, the sequence is 0 origin. Other sequences used in this embodiment are also 0 origin unless otherwise specified. Each entry frame record corresponds to each entry frame included in the input image on a one-to-one basis.

それぞれの記入枠レコードは、入力画像内における、対応する記入枠の位置を示す情報を含む。本例において、記入枠の位置を示す情報は、対応する記入枠の左右の端のＸ座標（横方向の座標）および上下の端のＹ座標（縦方向の座標）である。 Each entry frame record includes information indicating the position of the corresponding entry frame in the input image. In this example, the information indicating the position of the entry frame is the X coordinate (horizontal coordinate) of the left and right ends and the Y coordinate (vertical coordinate) of the upper and lower ends of the corresponding entry frame.

図５は、入力画像から一連の文字候補を生成する処理を示す図である。候補検出部３６は、記入枠レコードに示された情報に基づいて記入枠の領域を特定し（例えば図５中の点線で囲まれた領域）、特定した領域から部分領域画像を抽出する。続いて、候補検出部３６は、抽出した部分領域画像を二値化して二値画像を生成する。続いて、候補検出部３６は、二値画像上で黒画素の連結成分を抽出し、それぞれの連結成分に対してラベリングを行う。ラベリングしたそれぞれの連結成分は、文字を構成する要素であり、断片と呼ぶ。続いて、候補検出部３６は、連続して並んだ１個以上の断片を組み合わせて、文字候補を生成する。文字候補は、１個の文字を表していると推測される画素の集合である。 FIG. 5 is a diagram illustrating a process for generating a series of character candidates from an input image. The candidate detection unit 36 specifies an area of the entry frame based on the information indicated in the entry frame record (for example, an area surrounded by a dotted line in FIG. 5), and extracts a partial area image from the specified area. Subsequently, the candidate detection unit 36 binarizes the extracted partial region image to generate a binary image. Subsequently, the candidate detection unit 36 extracts black pixel connected components on the binary image and performs labeling on each connected component. Each labeled connected component is an element constituting a character and is called a fragment. Subsequently, the candidate detection unit 36 generates a character candidate by combining one or more pieces arranged in a row. A character candidate is a set of pixels presumed to represent one character.

図６は、断片データの構成を示す図である。候補記憶部３８は、断片を表す断片データを記憶する。断片データは、図６に示すように、断片の個数を示す値と、断片の個数分の断片レコードを格納する配列を含む。断片レコードのそれぞれは、それぞれの断片と一対一で対応する。 FIG. 6 is a diagram showing the structure of fragment data. The candidate storage unit 38 stores fragment data representing a fragment. As shown in FIG. 6, the fragment data includes a value indicating the number of fragments and an array for storing fragment records for the number of fragments. Each fragment record has a one-to-one correspondence with each fragment.

それぞれの断片レコードは、対応する断片の位置を示す情報と、断片の形状を示す二値画像を含む。本例において、断片の位置を示す情報は、対応する断片の左右の端のＸ座標および上下の端のＹ座標であり、当該断片の外接矩形を示す。断片の形状を示す二値画像は、当該断片の外接矩形内で当該連結成分上の画素を黒画素とし、残りを白画素とした画像である。 Each fragment record includes information indicating the position of the corresponding fragment and a binary image indicating the shape of the fragment. In this example, the information indicating the position of the fragment is the X coordinate of the left and right ends of the corresponding fragment and the Y coordinate of the upper and lower ends, and indicates the circumscribed rectangle of the fragment. A binary image indicating the shape of a fragment is an image in which pixels on the connected component are black pixels and the rest are white pixels in a circumscribed rectangle of the fragment.

候補検出部３６は、それぞれの断片について、中心のＸ座標と、中心のＹ座標を算出する。中心のＸ座標は、左右の端のＸ座標の平均値である。中心のＹ座標は、上下の端のＹ座標の平均値である。そして、候補検出部３６は、配列内の複数の断片レコードを、中心のＸ座標の昇順に整列する。これにより、候補検出部３６は、配列内の複数の断片レコードを、記入枠における文字記入方向（本例では左から右に向かう方向）に整列することができる。 The candidate detection unit 36 calculates the center X coordinate and the center Y coordinate for each fragment. The center X coordinate is an average value of the X coordinates of the left and right ends. The center Y coordinate is an average value of the Y coordinates of the upper and lower ends. Then, the candidate detection unit 36 sorts the plurality of fragment records in the array in ascending order of the center X coordinate. Thereby, the candidate detection unit 36 can align the plurality of fragment records in the array in the character entry direction in the entry frame (in this example, the direction from left to right).

図７は、断片番号の一例を示す図である。それぞれの断片レコードは、配列のインデックスにより識別される。断片レコードを文字記入方向に整列した後のインデックスを、断片番号と呼ぶ。従って、それぞれの断片には、図７に示すように断片番号が対応付けられる。 FIG. 7 is a diagram illustrating an example of a fragment number. Each fragment record is identified by an array index. The index after the fragment records are arranged in the character entry direction is called a fragment number. Therefore, each fragment is associated with a fragment number as shown in FIG.

図８は、文字候補データの構成を示す図である。候補検出部３６は、連続して並んだ１個以上の断片を組み合わせて、文字候補を生成する。この過程において、候補検出部３６は、外接の矩形の横幅Ｌが予め定められた長さ（Ｌｍａｘ）以下となる全てのパターンで１個以上の断片を組み合わせて、文字候補を生成する。 FIG. 8 is a diagram illustrating a configuration of character candidate data. The candidate detection unit 36 generates a character candidate by combining one or more pieces arranged in a row. In this process, the candidate detection unit 36 generates a character candidate by combining one or more fragments in all patterns in which the lateral width L of the circumscribed rectangle is equal to or less than a predetermined length (Lmax).

候補記憶部３８は、文字候補を表す文字候補データを記憶する。文字候補データは、図８に示すように、生成した文字候補の個数を示す値と、文字候補マトリクス（詳細後述）と、文字候補の個数分の文字候補レコードを格納する配列を含む。文字候補レコードのそれぞれは、それぞれの文字候補と一対一で対応する。 The candidate storage unit 38 stores character candidate data representing character candidates. As shown in FIG. 8, the character candidate data includes a value indicating the number of generated character candidates, a character candidate matrix (described later in detail), and an array for storing character candidate records for the number of character candidates. Each character candidate record has a one-to-one correspondence with each character candidate.

それぞれの文字候補レコードは、対応する文字候補の位置を示す情報と、対応する文字候補の始点番号および終点番号（詳細後述）と、文字候補の形状を示す二値画像と、認識候補エントリを含む認識候補の配列（詳細後述）を含む。本例において、文字候補の位置を示す情報は、対応する文字候補の左右の端のＸ座標および上下の端のＹ座標であり、二値画像上での当該文字候補の外接矩形を示す。文字候補の形状を示す二値画像は、当該文字候補の外接矩形内で当該文字候補上の画素を黒画素とし、残りを白画素とした画像である。認識候補エントリは、認識部４２により値が設定され、候補検出部３６では値は設定されない。 Each character candidate record includes information indicating the position of the corresponding character candidate, a start point number and an end point number (described later in detail) of the corresponding character candidate, a binary image indicating the shape of the character candidate, and a recognition candidate entry. Includes recognition candidate sequences (described in detail below). In this example, the information indicating the position of the character candidate is the X coordinate of the left and right ends and the Y coordinate of the upper and lower ends of the corresponding character candidate, and indicates the circumscribed rectangle of the character candidate on the binary image. A binary image showing the shape of a character candidate is an image in which the pixels on the character candidate are black pixels and the rest are white pixels in the circumscribed rectangle of the character candidate. A value of the recognition candidate entry is set by the recognition unit 42, and no value is set by the candidate detection unit 36.

図９は、文字候補番号の一例を示す図である。それぞれの文字候補レコードは、配列のインデックスにより識別される。文字候補レコードのインデックスを、文字候補番号と呼ぶ。従って、それぞれの文字候補には、図９に示すように文字候補番号が対応付けられる。 FIG. 9 is a diagram illustrating an example of character candidate numbers. Each character candidate record is identified by an array index. The index of the character candidate record is called a character candidate number. Therefore, a character candidate number is associated with each character candidate as shown in FIG.

図１０は、文字候補の始点番号および終点番号の一例を示す図である。文字候補は、連続して並んだ１個以上の断片を組み合わせて生成される。このため文字候補は、元となった１個以上の断片の並びのうちの先頭の断片に対する断片番号と、最後の断片に対する断片番号に１を加算した値とのセットで、一意に識別することができる。 FIG. 10 is a diagram illustrating an example of the start point number and end point number of a character candidate. Character candidates are generated by combining one or more pieces arranged in series. For this reason, character candidates are uniquely identified by a set of a fragment number for the first fragment in the sequence of one or more fragments as a source and a value obtained by adding 1 to the fragment number for the last fragment. Can do.

本実施形態では、先頭の断片に対する断片番号を、その文字候補の始点番号と呼び、最後の断片に対する断片番号に１を加算した値を、その文字候補の終点番号と呼ぶ。従って、それぞれの文字候補には、図１０に示すように、始点番号および終点番号が対応付けられる。なお、始点番号および終点番号は、文字候補の区切り位置を表すことから、始点番号および終点番号の両者をまとめて位置番号とも呼ぶ。 In this embodiment, the fragment number for the first fragment is called the start point number of the character candidate, and the value obtained by adding 1 to the fragment number for the last fragment is called the end point number of the character candidate. Accordingly, each character candidate is associated with a start point number and an end point number as shown in FIG. Note that the start point number and the end point number represent character character delimiter positions, and thus both the start point number and the end point number are collectively referred to as position numbers.

図１１は、文字候補マトリクスの一例を示す図である。文字候補マトリクスは、図１１に示すように、始点番号を第１インデックス、終点番号を第２インデックスとする文字候補番号の二次元配列である。文字候補マトリクスは、文字候補レコードの生成の開始前に、全てのエントリを−１に設定することで初期化される。そして、候補検出部３６は、文字候補を作成する毎に、文字候補マトリクスの対応するエントリに文字候補番号を書き込む。 FIG. 11 is a diagram illustrating an example of a character candidate matrix. As shown in FIG. 11, the character candidate matrix is a two-dimensional array of character candidate numbers having a start point number as a first index and an end point number as a second index. The character candidate matrix is initialized by setting all entries to -1 before starting the generation of character candidate records. Each time the candidate detection unit 36 creates a character candidate, it writes the character candidate number in the corresponding entry of the character candidate matrix.

図１２は、文字認識辞書の構成を示す図である。文字認識辞書記憶部４０は、予め作成された文字認識辞書を記憶する。文字認識辞書は、図１２に示すように、辞書エントリの個数を示す値と、辞書エントリを格納する配列を含む。 FIG. 12 is a diagram showing the configuration of the character recognition dictionary. The character recognition dictionary storage unit 40 stores a character recognition dictionary created in advance. As shown in FIG. 12, the character recognition dictionary includes a value indicating the number of dictionary entries and an array for storing dictionary entries.

それぞれの辞書エントリは、文字コードと、予め定められたＤ_ｓｕｂ個の基底ベクトルを含む。基底ベクトルは、文字コードに対応する文字を表す部分空間の特徴ベクトルである。特徴ベクトルは、一例として、対応する文字の二値画像を縦方向および横方向に予め任意に定めた個数で分割し、分割した領域のそれぞれの黒画素の個数の比率を求め、求めた一連の比率を特徴ベクトルの要素とすることで算出される。 Each dictionary entry includes a character code and predetermined D _sub basis vectors. A base vector is a feature vector of a subspace representing a character corresponding to a character code. As an example, the feature vector is obtained by dividing a binary image of a corresponding character by a predetermined number in the vertical direction and the horizontal direction, obtaining a ratio of the number of black pixels in each divided area, It is calculated by using the ratio as an element of the feature vector.

図１３は、認識候補の配列の構成を示す図である。文字候補レコードに格納される認識候補の配列は、図１３に示すように、予め定められたＮ_ｃａｎｄ個の認識候補エントリを含む。それぞれの認識候補エントリは、文字コードと、類似度を含む。 FIG. 13 is a diagram showing the configuration of recognition candidate sequences. The recognition candidate array stored in the character candidate record includes predetermined N _cand recognition candidate entries as shown in FIG. Each recognition candidate entry includes a character code and a similarity.

認識部４２は、文字候補のそれぞれに対して文字認識をして、認識結果の候補の文字である少なくとも１つの認識候補を生成する。本実施形態においては、認識部４２は、それぞれの文字候補レコードに対して、予め定められたＮ_ｃａｎｄ個の認識候補エントリを生成して、認識候補の配列に書き込む。 The recognition unit 42 performs character recognition for each of the character candidates, and generates at least one recognition candidate that is a candidate character of the recognition result. In the present embodiment, the recognition unit 42 generates N _cand recognition candidate entries determined in advance for each character candidate record, and writes them in the recognition candidate array.

より具体的には、認識部４２は、対応する文字候補レコードに含まれる二値画像から特徴ベクトルを抽出し、文字認識辞書のそれぞれの辞書エントリに格納された基底ベクトルと部分空間法により照合して類似度を算出する。認識部４２は、類似度が上位Ｎ_ｃａｎｄ個の辞書エントリのそれぞれについて、その辞書エントリに格納された文字コードを抽出し、抽出した文字コードと算出した類似度を含む認識候補エントリを生成する。そして、認識部４２は、生成したＮ_ｃａｎｄ個の認識候補エントリを対応する文字候補レコードの認識候補の配列に書き込む。さらに、認識部４２は、それぞれの文字候補レコードの認識候補の配列に含まれる認識候補エントリを、類似度の降順で整列する。 More specifically, the recognition unit 42 extracts a feature vector from a binary image included in the corresponding character candidate record, and collates it with a base vector stored in each dictionary entry of the character recognition dictionary by a subspace method. To calculate the similarity. The recognizing unit 42 extracts the character code stored in the dictionary entry for each of the top N _cand dictionary entries having the similarity, and generates a recognition candidate entry including the extracted character code and the calculated similarity. Then, the recognition unit 42 _writes the generated N _cand recognition candidate entries in the recognition candidate array of the corresponding character candidate record. Furthermore, the recognition unit 42 arranges the recognition candidate entries included in the recognition candidate arrays of the respective character candidate records in descending order of similarity.

図１４は、知識辞書の構成を示す図である。知識辞書記憶部４４は、設計者等が予め作成した知識辞書を記憶する。 FIG. 14 is a diagram illustrating a configuration of the knowledge dictionary. The knowledge dictionary storage unit 44 stores a knowledge dictionary created in advance by a designer or the like.

本実施形態において、知識辞書は、認識対象の文字列をモデル化した決定性有限オートマトンである。本実施形態では、決定性有限オートマトンである知識辞書を、ＤＦＡαとも呼ぶ。ＤＦＡαは、例えば、設計者が認識対象の文字列を正規表現で記述し、その正規表現を決定性有限オートマトンに変換することで生成される。 In the present embodiment, the knowledge dictionary is a deterministic finite automaton that models a character string to be recognized. In the present embodiment, a knowledge dictionary that is a deterministic finite automaton is also referred to as DFAα. For example, DFAα is generated by a designer describing a character string to be recognized in a regular expression and converting the regular expression into a deterministic finite automaton.

ＤＦＡαは、図１４に示すように、状態数を示す値と、状態の数分の状態レコードを格納する状態配列と、エッジの数分のエッジレコードを格納するエッジ配列を含む。 As shown in FIG. 14, DFAα includes a value indicating the number of states, a state array for storing state records for the number of states, and an edge array for storing edge records for the number of edges.

それぞれの状態レコードは、ＤＦＡαに含まれるそれぞれの状態と一対一に対応し、状態は状態レコードの番号すなわち状態番号で一意に識別される。なお、状態配列は０オリジンであり、従って状態番号が０の状態は開始状態である。それぞれの状態レコードは、受理状態フラグと、エッジ配列内のエッジレコードへのポインタと、エッジレコードの要素数を含む。 Each state record has a one-to-one correspondence with each state included in DFAα, and the state is uniquely identified by the number of the state record, that is, the state number. It should be noted that the state array is 0 origin, and therefore the state with the state number 0 is the start state. Each status record includes an acceptance status flag, a pointer to the edge record in the edge array, and the number of elements in the edge record.

受理状態フラグは、当該状態が受理状態であるか否かを示す。受理状態フラグは、一例として、１の場合に受理状態であることを示し、０の場合に受理状態ではないことを示す。 The acceptance state flag indicates whether or not the state is an acceptance state. As an example, the acceptance status flag indicates that it is in an acceptance state when it is 1, and indicates that it is not in an acceptance state when it is 0.

エッジレコードへのポインタは、エッジ配列内における、当該状態から出て行くエッジの集合の格納位置を示す。エッジレコードの要素数は、当該状態から出て行くエッジの個数を表す。エッジレコードへのポインタおよび要素数により、当該状態から出て行く全てのエッジに対応するエッジレコードを特定することができる。 The pointer to the edge record indicates the storage position of the set of edges that exit from the state in the edge array. The number of elements in the edge record represents the number of edges that exit from the state. The edge record corresponding to all the edges going out from the state can be specified by the pointer to the edge record and the number of elements.

それぞれのエッジレコードは、ＤＦＡαに含まれるそれぞれのエッジと一対一に対応する。それぞれのエッジレコードは、遷移先の状態番号と、コードを含む。 Each edge record has a one-to-one correspondence with each edge included in DFAα. Each edge record includes a transition destination state number and a code.

遷移先の状態番号は、当該エッジによる遷移先の状態を特定する状態番号を表す。 The state number of the transition destination represents a state number that specifies the state of the transition destination by the edge.

コードは、当該エッジにより表される遷移を起こす入力記号を表す。本実施形態においては、コードには、文字を表す文字コードが格納される。従って、ＤＦＡαでは、認識候補の文字を表す文字コードにより、ある状態から他の状態への遷移が起きる。 The code represents the input symbol that causes the transition represented by the edge. In the present embodiment, a character code representing a character is stored in the code. Therefore, in DFAα, a transition from one state to another state occurs due to a character code representing a recognition candidate character.

図１５は、禁則辞書の構成を示す図である。禁則辞書記憶部５０は、設計者等が予め作成した禁則辞書を記憶する。 FIG. 15 is a diagram showing a configuration of the prohibition dictionary. The prohibition dictionary storage unit 50 stores a prohibition dictionary created in advance by a designer or the like.

本実施形態において、禁則辞書は、禁止対象の文字列をモデル化した決定性有限オートマトンである。禁止対象の文字列は、入力が禁止される文字列、入力がありえない文字列および不自然な文字列等であり、例えば設計者により定義される。本実施形態では、決定性有限オートマトンである禁則辞書を、ＤＦＡβとも呼ぶ。 In the present embodiment, the prohibition dictionary is a deterministic finite automaton that models a prohibited character string. The prohibited character strings are character strings that are prohibited from being input, character strings that cannot be input, unnatural character strings, and the like, and are defined by, for example, a designer. In this embodiment, the prohibition dictionary that is a deterministic finite automaton is also called DFAβ.

ＤＦＡβは、図１５に示すように、状態数を示す値と、状態の数分の状態レコードを格納する状態配列と、エッジの数分のエッジレコードを格納するエッジ配列を含む。禁則辞書を構成するそれぞれの要素は、図１４に示した知識辞書（ＤＦＡα）と同一であるので詳細な説明は省略する。 As shown in FIG. 15, DFAβ includes a value indicating the number of states, a state array for storing state records for the number of states, and an edge array for storing edge records for the number of edges. Since each element constituting the forbidden dictionary is the same as the knowledge dictionary (DFAα) shown in FIG. 14, detailed description thereof will be omitted.

ＤＦＡβは、設計者が禁止対象の文字列を正規表現で記述し（モデル化し）、その正規表現を第一の非決定性有限オートマトンに変換し、第一の非決定性有限オートマトンを第二の非決定性有限オートマトンに変換し、第二の非決定性有限オートマトンを決定性オートマトンに変換することで得られる。 In DFAβ, a designer describes a prohibited character string in a regular expression (modeled), converts the regular expression into a first nondeterministic finite automaton, and converts the first nondeterministic finite automaton into a second nondeterministic It is obtained by converting to a finite automaton and converting the second non-deterministic finite automaton to a deterministic automaton.

図１６は、第一の非決定性有限オートマトンの一例を示す図である。図１７は、第一の非決定性有限オートマトンを変換することで得られる第二の非決定性有限オートマトンの一例を示す図である。図１６に示すオートマトンでは、開始状態から受理状態までの経路上の入力記号列が、禁止対象の文字列を表す。 FIG. 16 is a diagram illustrating an example of a first nondeterministic finite automaton. FIG. 17 is a diagram illustrating an example of a second non-deterministic finite automaton obtained by converting the first non-deterministic finite automaton. In the automaton shown in FIG. 16, the input symbol string on the path from the start state to the acceptance state represents a character string to be prohibited.

第二の非決定性有限オートマトンは、図１６に示すような第一の非決定性有限オートマトンを、次の手続きで変換することにより得られる。まず、第一の非決定性有限オートマトンに、開始状態から開始状態へと戻り、全ての文字（コード）が入力記号として割り当てられた自己ループのエッジが追加される。続いて、自己ループが追加されたオートマトンに、開始状態を除く全ての状態のそれぞれから開始状態へと戻り、空記号εが入力記号として割り当てられた空エッジ（ε遷移）を追加することで図１７に示すような第二の非決定性有限オートマトンが得られる。 The second non-deterministic finite automaton is obtained by converting the first non-deterministic finite automaton as shown in FIG. 16 by the following procedure. First, the first nondeterministic finite automaton is returned from the start state to the start state, and an edge of a self-loop in which all characters (codes) are assigned as input symbols is added. Subsequently, the automaton with the added self-loop returns to the start state from each of the states other than the start state, and adds an empty edge (ε transition) to which the empty symbol ε is assigned as an input symbol. A second non-deterministic finite automaton as shown in 17 is obtained.

すなわち、第二の非決定性有限オートマトンは、図１７に示すように、開始状態から受理状態までの経路上の入力記号列が、禁止対象の文字列を表す。さらに、第二の非決定性有限オートマトンは、開始状態から開始状態へと戻り全ての文字が入力記号として割り当てられた自己ループのエッジと、開始状態を除く全ての状態のそれぞれから開始状態へと戻り空記号が入力記号として割り当てられた空エッジを含む。 That is, in the second nondeterministic finite automaton, as shown in FIG. 17, the input symbol string on the path from the start state to the accepting state represents a prohibited character string. In addition, the second nondeterministic finite automaton returns from the start state to the start state and returns to the start state from each of the states other than the start state and the edge of the self-loop where all characters are assigned as input symbols. An empty symbol contains an empty edge assigned as an input symbol.

このように構成した第二の非決定性有限オートマトンを決定性有限オートマトンに変換することで得られるＤＦＡβは、禁止対象の文字列をモデル化したものとなっており、禁止対象の文字列を含む文字列を入力した場合に、受理状態に確実に遷移するように構成されている。 DFAβ obtained by converting the second non-deterministic finite automaton configured as described above into a deterministic finite automaton is a model of a prohibited character string, and includes a character string including the prohibited character string. It is configured so that a transition to the accepting state is surely made when the is input.

なお、正規表現から非決定性有限オートマトンまたは決定性有限オートマトンを生成する方法、非決定性有限オートマトンから決定性有限オートマトンを生成する方法は、例えば、Ａ．Ｖ．エイホ，Ｒ．セシィ，Ｊ．Ｄ．ウルマン著、原田健一訳、コンパイラＩ，初版１９８６、ｐｐ．１３４−１７２等に記載されている。 Note that a method for generating a nondeterministic finite automaton or a deterministic finite automaton from a regular expression, and a method for generating a deterministic finite automaton from a nondeterministic finite automaton are described in, for example, A. V. Aiho, R.A. Cessie, J.H. D. Written by Ullman, translated by Kenichi Harada, Compiler I, first edition 1986, pp. 134-172 and the like.

図１８は、照合結果データの構成を示す図である。照合部４６は、文字候補に含まれる認識候補のそれぞれを、先頭から順次に知識辞書と照合して、入力画像の記入枠内に記載されていると推測される文字列と知識辞書を照合して得られる照合結果を生成する。そして、照合部４６は、生成した照合結果を、照合結果記憶部４８に書き込む。これとともに、禁則処理部５２は、照合結果により特定される文字列を、禁則辞書と照合して、禁則辞書の決定性有限オートマトンで受理される文字列を表す照合結果を照合結果記憶部４８から削除する。 FIG. 18 is a diagram illustrating a configuration of the collation result data. The collation unit 46 collates each of the recognition candidates included in the character candidates with the knowledge dictionary sequentially from the top, and collates the character string estimated to be described in the entry frame of the input image with the knowledge dictionary. The verification result obtained is obtained. Then, the matching unit 46 writes the generated matching result in the matching result storage unit 48. At the same time, the forbidden processing unit 52 collates the character string specified by the collation result with the forbidden dictionary, and deletes the collation result representing the character string accepted by the deterministic finite automaton of the forbidden dictionary from the collation result storage unit 48. To do.

照合結果記憶部４８は、照合結果データを記憶する。照合結果データは、それぞれの位置番号毎に、照合結果の個数と、照合結果の配列を含む。 The verification result storage unit 48 stores verification result data. The collation result data includes the number of collation results and an array of collation results for each position number.

照合結果の個数は、当該位置番号に関連付けられた照合結果の個数を表す。照合結果の配列は、当該位置番号に関連付けられた照合結果を格納する。それぞれの照合結果は、状態番号αと、状態番号βと、スコアと、コードと、位置番号および照合結果の番号のペアを含む。また、それぞれの照合結果は、格納先の配列が関連付けられた位置番号および、格納先の配列内での配列要素としての番号のペアで一意に識別される。以降では、照合結果の格納先の配列が関連付けられた位置番号を「照合結果が関連付けられた位置番号」、照合結果の格納先の配列内での配列要素としての番号を「照合結果の番号」と呼称する。 The number of matching results represents the number of matching results associated with the position number. The collation result array stores the collation result associated with the position number. Each collation result includes a pair of a state number α, a state number β, a score, a code, a position number, and a collation result number. Each collation result is uniquely identified by a pair of a position number associated with the storage destination array and a number as an array element in the storage destination array. From now on, the position number associated with the collation result storage array is referred to as "position number associated with the collation result", and the number as the array element in the collation result storage array is referred to as "collation result number". It is called.

状態番号αは、知識辞書（ＤＦＡα）の状態を表す。すなわち、状態番号αは、先頭の認識候補から当該位置の認識候補までのそれぞれの文字に応じて、ＤＦＡαを開始状態から順次に遷移させた場合に到達する状態を示す。 The state number α represents the state of the knowledge dictionary (DFAα). That is, the state number α indicates a state reached when DFAα is sequentially shifted from the start state in accordance with each character from the first recognition candidate to the recognition candidate at the position.

状態番号βは、禁則辞書（ＤＦＡβ）の状態を表す。すなわち、状態番号βは、当該照合結果により特定される文字列に含まれるそれぞれの文字に応じて、ＤＦＡβを開始状態から順次に遷移させた場合に到達する状態を示す。 The state number β represents the state of the forbidden dictionary (DFAβ). That is, the state number β indicates a state reached when DFA β is sequentially shifted from the start state in accordance with each character included in the character string specified by the collation result.

スコアは、先頭の認識候補から当該位置の認識候補までのそれぞれに対応付けられた類似度を累積した値を表す。すなわち、スコアは、先頭の認識候補から当該位置の認識候補までの文字列の尤もらしさを表す。コードは、当該位置の認識候補の文字を表す文字コードである。 The score represents a value obtained by accumulating the degrees of similarity associated with the recognition candidates at the head to the recognition candidates at the position. That is, the score represents the likelihood of the character string from the first recognition candidate to the recognition candidate at the position. The code is a character code representing a recognition candidate character at the position.

位置番号および照合結果の番号のペアは、先頭から当該位置まで１個ずつ文字候補をたどりながら文字候補の認識候補を入力記号としてＤＦＡαを遷移させながら照合結果を生成していく過程における、直前の照合結果が関連付けられた位置番号および、直前の照合結果の番号を表す。位置番号および照合結果の番号のペアは、結果抽出部５４が認識結果の文字列を抽出する際に参照される。 The pair of the position number and the number of the matching result is obtained immediately after the character candidate is traced one by one from the head to the corresponding position while the matching result is generated while the DFAα is transitioned using the character candidate recognition candidate as the input symbol. It represents the position number associated with the collation result and the number of the previous collation result. The pair of the position number and the collation result number is referred to when the result extraction unit 54 extracts the character string of the recognition result.

図１９は、照合処理を示すフロー図である。図２のステップＳ４に示した照合処理の詳細について図１９を参照して説明する。 FIG. 19 is a flowchart showing the matching process. Details of the collation process shown in step S4 of FIG. 2 will be described with reference to FIG.

まず、ステップＳ１１において、照合部４６は、照合結果データを初期化する。具体的には、照合部４６は、照合結果データの全ての位置番号について、照合結果の個数を０に設定するとともに、照合結果の配列を空にする。 First, in step S11, the collation unit 46 initializes collation result data. Specifically, the collation unit 46 sets the number of collation results to 0 for all the position numbers of the collation result data and empties the collation result array.

続いて、ステップＳ１２において、照合部４６は、位置番号０に関連付けて、新たな１つの照合結果を生成する。新たな１つの照合結果は、状態番号αおよび状態番号βが０に、スコアが０に、位置番号および照合結果の番号が−１に、コードが−１に設定される。続いて、ステップＳ１３において、照合部４６は、位置番号０に関連付けられた照合結果の個数を１に設定する。続いて、ステップＳ１４において、照合部４６は、位置番号を表す変数Ｐｓｔに０を代入する。 Subsequently, in step S12, the collation unit 46 generates a new collation result in association with the position number 0. In the new collation result, the state number α and the state number β are set to 0, the score is set to 0, the position number and the collation result number are set to −1, and the code is set to −1. Subsequently, in step S13, the matching unit 46 sets the number of matching results associated with the position number 0 to 1. Subsequently, in step S14, the collation unit 46 substitutes 0 for a variable Pst representing the position number.

続いて、ステップＳ１５において、照合部４６は、Ｐｓｔが、Ｐｓｔｍａｘ以下であるか否かを判断する。Ｐｓｔｍａｘは、最後の位置番号Ｐｅｄから１を減じた値である。照合部４６は、ＰｓｔがＰｓｔｍａｘ以下である場合（ステップＳ１５の真）、処理をステップＳ１６に進める。 Subsequently, in step S15, the collation unit 46 determines whether or not Pst is equal to or less than Pstmax. Pstmax is a value obtained by subtracting 1 from the last position number Ped. When Pst is equal to or less than Pstmax (true in step S15), the collation unit 46 advances the process to step S16.

ステップＳ１６において、照合部４６は、禁則処理部５２を呼び出す。ステップＳ１６において、禁則処理部５２は、位置番号Ｐｓｔに関連付けられたそれぞれの照合結果について、禁則辞書を用いて禁則辞書探索処理を実行する。これにより、禁則処理部５２は、禁止対象の文字列を含む文字列と知識辞書を照合して得られた照合結果を削除することができる。なお、禁則辞書探索処理の詳細については、図２２および図２３を参照して後述する。 In step S 16, the collation unit 46 calls the prohibition processing unit 52. In step S 16, the prohibition processing unit 52 executes a prohibition dictionary search process using the prohibition dictionary for each matching result associated with the position number Pst. Thereby, the prohibition processing unit 52 can delete the collation result obtained by collating the character string including the character string to be prohibited with the knowledge dictionary. Details of the prohibition dictionary search process will be described later with reference to FIGS.

続いて、ステップＳ１７において、照合部４６は、Ｐｓｔに関連付けられた照合結果を、スコアが最上位からＮｐｒ番目までに絞り込む。すなわち、照合部４６は、スコアがＮｐｒ番目より低い照合結果を削除する。 Subsequently, in step S 17, the matching unit 46 narrows down the matching results associated with Pst from the highest score to the Npr-th score. That is, the matching unit 46 deletes the matching result whose score is lower than the Nprth.

続いて、ステップＳ１８において、照合部４６は、Ｐｓｔに関連付けられたそれぞれの照合結果に対して、知識辞書を用いて知識辞書探索処理を実行する。これにより、照合部４６は、Ｐｓｔより後ろの位置番号に関連付けた新たな照合結果を生成することができる。なお、知識辞書探索処理については、図２０および図２１を参照して後述する。 Subsequently, in step S18, the collation unit 46 executes a knowledge dictionary search process using the knowledge dictionary for each collation result associated with Pst. Thereby, the collation part 46 can produce | generate the new collation result linked | related with the position number after Pst. The knowledge dictionary search process will be described later with reference to FIGS. 20 and 21.

続いて、ステップＳ１９において、照合部４６は、Ｐｓｔに１を加算する。照合部４６は、ステップＳ１９を終了すると、処理をステップＳ１５に戻す。そして、照合部４６は、ＰｓｔがＰｓｔｍａｘを超えるまで、ステップＳ１６からステップＳ１９の処理を繰り返す。 Subsequently, in step S19, the collation unit 46 adds 1 to Pst. The collation part 46 will return a process to step S15, after complete | finishing step S19. And the collation part 46 repeats the process of step S16 to step S19 until Pst exceeds Pstmax.

ＰｓｔがＰｓｔｍａｘ以下ではなくなった場合（ステップＳ１５の偽）、照合部４６は、処理をステップＳ２０に進める。ステップＳ２０において、照合部４６は、禁則処理部５２を呼び出す。ステップＳ２０において、禁則処理部５２は、最後の位置番号Ｐｅｄに関連付けられたそれぞれの照合結果について、禁則辞書を用いて禁則辞書探索処理を実行する。これにより、禁則処理部５２は、禁止対象の文字列を知識辞書と照合した結果得られる照合結果を削除することができる。そして、照合部４６は、ステップＳ２０の処理を終えると、本フローを終了する。 When Pst is no longer equal to or less than Pstmax (No in step S15), the collation unit 46 advances the process to step S20. In step S 20, the collation unit 46 calls the prohibition processing unit 52. In step S20, the prohibition processing unit 52 performs a prohibition dictionary search process using the prohibition dictionary for each matching result associated with the last position number Ped. Thereby, the prohibition processing unit 52 can delete the collation result obtained as a result of collating the character string to be prohibited with the knowledge dictionary. And the collation part 46 complete | finishes this flow, after finishing the process of step S20.

図２０は、知識辞書探索処理を示すフローチャートである。図２１は、知識辞書探索処理でのデータアクセスの流れの一例を示す。 FIG. 20 is a flowchart showing the knowledge dictionary search process. FIG. 21 shows an example of the flow of data access in the knowledge dictionary search process.

図２０および図２１を参照しながら、図１９のステップＳ１８の知識辞書探索処理を説明する。まず、ステップＳ３１において、照合部４６は、照合結果データを参照し、Ｐｓｔに関連付けられた全ての照合結果を列挙する。 The knowledge dictionary search process in step S18 of FIG. 19 will be described with reference to FIGS. First, in step S31, the collation unit 46 refers to the collation result data and lists all the collation results associated with Pst.

続いて、ステップＳ３２において、照合部４６は、文字候補データの配列内の文字候補レコードを参照し、Ｐｓｔを始点位置とする全ての文字候補を列挙する。照合部４６は、文字候補マトリクスにおける始点番号がＰｓｔに一致する全てのエントリを走査し、−１以外の文字候補の番号を収集することで、Ｐｓｔを始点位置とする全ての文字候補を列挙することができる。 Subsequently, in step S32, the collation unit 46 refers to the character candidate records in the character candidate data array, and lists all the character candidates having Pst as the starting point position. The collating unit 46 scans all entries whose starting point numbers match Pst in the character candidate matrix and collects all character candidates having Pst as the starting point position by collecting the numbers of character candidates other than -1. be able to.

続いて、照合部４６は、ステップＳ３２で列挙した全ての文字候補レコードのそれぞれに対して、ステップＳ３４〜ステップＳ４８の処理を実行する（ステップＳ３３とステップＳ４９との間のループ処理）。以降ではこのループ処理における処理対象の文字候補レコードに対応する文字候補を「文字候補Ｃｃ」と称する。 Subsequently, the collation unit 46 performs the processing of step S34 to step S48 for each of all the character candidate records listed in step S32 (loop processing between step S33 and step S49). Hereinafter, the character candidate corresponding to the character candidate record to be processed in this loop processing is referred to as “character candidate Cc”.

ステップＳ３４において、照合部４６は、文字候補Ｃｃに対応する文字候補レコードの認識候補の配列を参照し、当該文字候補の全ての認識候補エントリを列挙する。 In step S34, the collation unit 46 refers to the recognition candidate array of the character candidate record corresponding to the character candidate Cc, and lists all recognition candidate entries of the character candidate.

続いて、照合部４６は、ステップＳ３４で列挙した全ての認識候補エントリのそれぞれに対して、ステップＳ３６〜ステップＳ４７の処理を実行する（ステップＳ３５とステップＳ４８との間のループ処理）。以降ではこのループ処理における処理対象の認識候補エントリに対応する認識候補を「認識候補Ｃｒ」と称する。 Subsequently, the collation unit 46 performs the processing of Step S36 to Step S47 for each of all the recognition candidate entries listed in Step S34 (loop processing between Step S35 and Step S48). Hereinafter, the recognition candidate corresponding to the recognition candidate entry to be processed in this loop processing is referred to as “recognition candidate Cr”.

続いて、照合部４６は、ステップＳ３１で列挙した、Ｐｓｔに関連付けられた全ての照合結果のそれぞれに対して、ステップＳ３７〜ステップＳ４６の処理を実行する（ステップＳ３６とステップＳ４７との間のループ処理）。以降ではこのループ処理における処理対象の照合結果を「照合結果Ｍｐ」と称する。 Subsequently, the collation unit 46 performs the processing of step S37 to step S46 on each of all the collation results associated with Pst listed in step S31 (loop between step S36 and step S47). processing). Hereinafter, the verification result to be processed in this loop process is referred to as “matching result Mp”.

ステップＳ３７において、照合部４６は、知識辞書（ＤＦＡα）を参照して、照合結果Ｍｐに含まれる状態番号αに対応する状態レコードを列挙する。 In step S37, the collation unit 46 refers to the knowledge dictionary (DFAα) and enumerates state records corresponding to the state number α included in the collation result Mp.

続いて、ステップＳ３８において、照合部４６は、ステップＳ３７で列挙した状態レコードに含まれるエッジレコードへのポインタおよびエッジレコードの要素数により、状態番号αの状態から出て行くエッジを表すエッジレコードの格納された範囲を特定することで、状態番号αの状態から出て行くエッジを表す全てのエッジレコードを列挙する。 Subsequently, in step S38, the collation unit 46 creates an edge record representing an edge that exits from the state of the state number α based on the pointer to the edge record included in the state record enumerated in step S37 and the number of elements of the edge record. By specifying the stored range, all edge records representing the edges going out from the state of the state number α are listed.

続いて、照合部４６は、ステップＳ３８で列挙した全てのエッジレコードのそれぞれに対して、ステップＳ４０〜ステップＳ４５の処理を実行する（ステップＳ３９とステップＳ４６との間のループ処理）。以降ではこのループ処理における処理対象のエッジレコードを「エッジレコードＥｒ」と称する。 Subsequently, the collation unit 46 performs the processing of Step S40 to Step S45 for each of all the edge records listed in Step S38 (loop processing between Step S39 and Step S46). Hereinafter, the edge record to be processed in this loop processing is referred to as “edge record Er”.

ステップＳ４０において、照合部４６は、認識候補Ｃｒの認識候補エントリに設定された文字コードと、エッジレコードＥｒに設定された文字コードとが一致するか否かを判断する。一致しない場合（ステップＳ４０のＮｏ）、照合部４６は、次のエッジレコードに処理を移し、ステップＳ４０からの処理を繰り返す。一致する場合（ステップＳ４０のＹｅｓ）、照合部４６は、処理をステップＳ４１に進める。 In step S40, the collation unit 46 determines whether or not the character code set in the recognition candidate entry of the recognition candidate Cr matches the character code set in the edge record Er. If they do not match (No in step S40), the collation unit 46 moves the process to the next edge record and repeats the process from step S40. If they match (Yes in step S40), the collation unit 46 advances the process to step S41.

ステップＳ４１において、照合部４６は、文字候補Ｃｃの文字候補レコードの終点位置に関連付けて新しい照合結果Ｍｎを生成して、照合結果データに書き込む。 In step S41, the collation unit 46 generates a new collation result Mn in association with the end point position of the character candidate record of the character candidate Cc, and writes it in the collation result data.

続いて、ステップＳ４２において、照合部４６は、新しい照合結果Ｍｎに状態番号αとして、エッジレコードＥｒに設定された状態番号（遷移先の状態番号）を設定する。また、照合部４６は、新しい照合結果Ｍｎに状態番号βとして−１を設定する。 Subsequently, in step S42, the collation unit 46 sets the state number (transition destination state number) set in the edge record Er as the state number α in the new collation result Mn. The collation unit 46 sets −1 as the state number β in the new collation result Mn.

続いて、ステップＳ４３において、照合部４６は、新しい照合結果Ｍｎにコードとして、認識候補Ｃｒの認識候補エントリに設定された文字コードを設定する。 Subsequently, in step S43, the collation unit 46 sets the character code set in the recognition candidate entry of the recognition candidate Cr as a code for the new collation result Mn.

続いて、ステップＳ４４において、照合部４６は、新しい照合結果Ｍｎに位置番号として、照合結果Ｍｐが関連付けられた位置番号Ｐｓｔを設定する。また、照合部４６は、新しい照合結果Ｍｎに照合結果の番号として、照合結果Ｍｐの番号を格納する。 Subsequently, in step S44, the collation unit 46 sets the position number Pst associated with the collation result Mp as the position number for the new collation result Mn. The collation unit 46 stores the number of the collation result Mp as the collation result number in the new collation result Mn.

続いて、ステップＳ４５において、照合部４６は、新しい照合結果Ｍｎにスコアとして、照合結果Ｍｐに格納されたスコアと、認識候補Ｃｒの認識候補エントリに格納された類似度を加算した値を設定する。 Subsequently, in step S45, the matching unit 46 sets a value obtained by adding the score stored in the matching result Mp and the similarity stored in the recognition candidate entry of the recognition candidate Cr as a score to the new matching result Mn. .

ステップＳ４６において、照合部４６は、全てのエッジレコードについて、ステップＳ４０〜ステップＳ４５の処理を終えると、ループを抜けて処理をステップＳ４７に進める。 In step S46, the collation unit 46 exits from the loop and proceeds to step S47 after completing the process of steps S40 to S45 for all edge records.

ステップＳ４７において、照合部４６は、Ｐｓｔに関連付けられた全ての照合結果について、ステップＳ３７〜ステップＳ４６の処理を終えると、ループを抜けて処理をステップＳ４８に進める。 In step S47, the collation unit 46 exits from the loop and proceeds to step S48 after completing the processes of step S37 to step S46 for all the collation results associated with Pst.

ステップＳ４８において、照合部４６は、文字候補Ｃｃに対応する全ての認識候補エントリについて、ステップＳ３６〜ステップＳ４７の処理を終えると、ループを抜けて処理をステップＳ４９に進める。 In step S48, the collation unit 46 exits from the loop and proceeds to step S49 after completing the process of steps S36 to S47 for all the recognition candidate entries corresponding to the character candidate Cc.

そして、ステップＳ４９において、照合部４６は、全ての文字候補レコードについて、ステップＳ３４〜ステップＳ４８の処理を終えると、ループを抜けて、本フローを終了する。 In step S49, the collation unit 46 exits from the loop and ends the present flow after completing the processes in steps S34 to S48 for all the character candidate records.

このように照合部４６は、第１の文字候補の照合結果に、照合により到達した知識辞書（ＤＦＡα）の状態を示す番号（状態番号α）を書き込む。そして、照合部４６は、第１の文字候補に続く第２の文字候補を知識辞書（ＤＦＡα）と照合する際、第１の文字候補の照合結果に書き込まれた番号（状態番号α）に示される状態から第２の文字候補の認識候補による状態遷移に対応するエッジを辿ることで第２の文字候補を照合する。 As described above, the collation unit 46 writes the number (state number α) indicating the state of the knowledge dictionary (DFAα) reached by collation in the collation result of the first character candidate. Then, when the collation unit 46 collates the second character candidate following the first character candidate with the knowledge dictionary (DFAα), the collation unit 46 indicates the number (state number α) written in the collation result of the first character candidate. The second character candidate is collated by following the edge corresponding to the state transition by the recognition candidate of the second character candidate.

図２２は、禁則辞書探索処理を示すフローチャートである。図２３は、禁則辞書探索処理でのデータの流れを示す。 FIG. 22 is a flowchart showing the prohibition dictionary search process. FIG. 23 shows a data flow in the forbidden dictionary search process.

図２２および図２３を参照しながら、禁則辞書探索処理を説明する。まず、ステップＳ５１において、禁則処理部５２は、照合結果データを参照し、Ｐｓｔに関連付けられた全ての照合結果を列挙する。 The forbidden dictionary search process will be described with reference to FIGS. First, in step S51, the prohibition processing unit 52 refers to the collation result data and lists all the collation results associated with Pst.

続いて、禁則処理部５２は、ステップＳ５１で列挙した全ての照合結果のそれぞれに対して、ステップＳ５３〜ステップＳ６２の処理を実行する（ステップＳ５２とステップＳ６３との間のループ処理）。以降ではこのループ処理における処理対象の照合結果を「照合結果Ｍｔ」と称する。 Subsequently, the prohibition processing unit 52 executes the processing of step S53 to step S62 for each of all the matching results listed in step S51 (loop processing between step S52 and step S63). Hereinafter, the verification result to be processed in this loop process is referred to as “matching result Mt”.

ステップＳ５３において、禁則処理部５２は、照合結果Ｍｔに格納された位置番号および照合結果の番号により特定される直前の照合結果Ｍを取得する。 In step S53, the prohibition processing unit 52 acquires the matching result M immediately before specified by the position number and the matching result number stored in the matching result Mt.

続いて、ステップＳ５４において、禁則処理部５２は、禁則辞書（ＤＦＡβ）を参照して、直前の照合結果Ｍに格納された状態番号βに対応する状態レコードを取得する。 Subsequently, in step S54, the prohibition processing unit 52 refers to the prohibition dictionary (DFAβ), and acquires a state record corresponding to the state number β stored in the previous matching result M.

続いて、ステップＳ５５において、禁則処理部５２は、ステップＳ５４で取得した状態レコードに格納されたエッジレコードへのポインタおよびエッジレコードの要素数により、から、状態番号βの状態から出て行くエッジを表すエッジレコードが格納された範囲を特定することで、状態番号βの状態から出て行くエッジを表す全てのエッジレコードを列挙する。 Subsequently, in step S55, the prohibition processing unit 52 determines an edge that exits from the state of the state number β based on the pointer to the edge record stored in the state record acquired in step S54 and the number of elements of the edge record. By specifying the range in which the edge records to be represented are stored, all edge records representing the edges going out from the state of the state number β are listed.

続いて、ステップＳ５６とステップＳ５８との間のループ処理において、禁則処理部５２は、ステップＳ５５で列挙した全てのエッジレコードのそれぞれに対して、ステップＳ５７の判断処理を実行する。以降ではこのループ処理における処理対象のエッジレコードを「エッジレコードＥｔ」と称する。 Subsequently, in the loop process between step S56 and step S58, the prohibition processing unit 52 executes the determination process of step S57 for each of all the edge records listed in step S55. Hereinafter, the edge record to be processed in this loop processing is referred to as “edge record Et”.

ステップＳ５７において、禁則処理部５２は、処理対象の照合結果Ｍｔに格納された文字コードと、エッジレコードＥｔに格納された文字コードが一致するか否かを判断する。一致しない場合（ステップＳ５７のＮｏ）、禁則処理部５２は、次のエッジレコードの処理に移り、ステップＳ５７の処理を繰り返す。一致する場合（ステップＳ５７のＹｅｓ）、禁則処理部５２は、ステップＳ５６とステップＳ５８との間のループ処理を抜けて、処理をステップＳ５９に進める。 In step S57, the prohibition processing unit 52 determines whether or not the character code stored in the verification result Mt to be processed matches the character code stored in the edge record Et. If they do not match (No in step S57), the prohibition processing unit 52 proceeds to the next edge record process and repeats the process in step S57. If they match (Yes in step S57), the forbidden processing unit 52 exits the loop process between step S56 and step S58, and advances the process to step S59.

ステップＳ５９において、禁則処理部５２は、エッジレコードＥｔの遷移先の状態番号により特定される状態レコードＳｔの受理状態フラグを確認し、状態レコードＳｔに対応するＤＦＡβの状態が受理状態であるか否かを判断する。 In step S59, the prohibition processing unit 52 confirms the acceptance state flag of the state record St specified by the state number of the transition destination of the edge record Et, and whether or not the state of DFAβ corresponding to the state record St is an acceptance state. Determine whether.

受理状態である場合（ステップＳ５９のＹｅｓ）、ステップＳ６１において、禁則処理部５２は、照合結果Ｍｔを削除する。受理状態ではない場合（ステップＳ５９のＮｏ）、ステップＳ６０において、禁則処理部５２は、照合結果Ｍｔに状態番号βとして、エッジレコードＥｔの遷移先の状態番号を設定する。 When it is in the accepting state (Yes in step S59), in step S61, the prohibition processing unit 52 deletes the matching result Mt. When it is not in the accepting state (No in step S59), in step S60, the prohibition processing unit 52 sets the state number of the transition destination of the edge record Et as the state number β in the matching result Mt.

また、禁則処理部５２は、ステップＳ５６とステップＳ５８との間のループ処理において、全てのエッジレコードについて、文字コードが一致しない場合、処理をステップＳ６２に進める。ステップＳ６２において、禁則処理部５２は、照合結果Ｍｔに状態番号βとして、初期状態を表す０を設定する。 In the loop processing between step S56 and step S58, the prohibition processing unit 52 advances the processing to step S62 when the character codes do not match for all edge records. In step S62, the prohibition processing unit 52 sets 0 representing the initial state as the state number β in the matching result Mt.

ステップＳ６０、ステップＳ６１またはステップＳ６２の処理を終えると、禁則処理部５２は、次の照合結果について、ステップＳ５２から処理を繰り返す。 When the process of step S60, step S61, or step S62 is completed, the prohibition processing unit 52 repeats the process from step S52 for the next matching result.

そして、ステップＳ６３において、禁則処理部５２は、全ての照合結果について、ステップＳ５３〜ステップＳ６２の処理を終えると、ループを抜けて、本フローを終了する。 In step S63, the forbidden processing unit 52 exits the loop after completing the processing of step S53 to step S62 for all the matching results, and ends this flow.

このように禁則処理部５２は、第１の照合結果により特定される文字列を照合することにより到達した禁則辞書（ＤＦＡβ）の状態を示す番号（状態番号β）を第１の照合結果に書き込む。そして、禁則処理部５２は、第１の照合結果に続く第２の文字候補を知識辞書（ＤＦＡα）と照合して得られた第２の照合結果により特定される文字列を照合する際、第１の照合結果に書き込まれた番号に示されるＤＦＡβの状態から第２の文字候補の認識候補による状態遷移に対応するエッジを辿ることで、第２の照合結果により特定される文字列を照合する。 As described above, the prohibition processing unit 52 writes the number (state number β) indicating the state of the prohibition dictionary (DFAβ) reached by collating the character string specified by the first collation result into the first collation result. . Then, when the prohibition processing unit 52 collates the character string specified by the second collation result obtained by collating the second character candidate following the first collation result with the knowledge dictionary (DFAα), The character string specified by the second collation result is collated by following the edge corresponding to the state transition by the recognition candidate of the second character candidate from the state of DFAβ indicated by the number written in the collation result of 1. .

図２４は、結果抽出部で行われる結果抽出の処理の流れを示す図である。図２５は、結果抽出において参照されるデータとスタック上に積まれる文字コードの様子を示す図である。 FIG. 24 is a diagram illustrating a flow of result extraction processing performed by the result extraction unit. FIG. 25 is a diagram illustrating data referred to in the result extraction and a state of character codes stacked on the stack.

図２のステップＳ５に示した結果抽出は結果抽出部５４で行われる。以降では、結果抽出の詳細について図２４および図２５を参照して説明する。まず、ステップＳ７０において結果抽出部５４は、文字候補の個数が０であるか否か確認し、文字候補の個数が０の場合はステップＳ８４において認識結果の文字列を空文字列として本フローを終了する。文字候補の個数が０でない場合は、ステップＳ７１において、結果抽出部５４は、最後の位置番号Ｐｅｄに関連付けられた全ての照合結果を列挙した上で、ステップＳ７２以降の処理を実行する。 The result extraction shown in step S5 of FIG. Hereinafter, details of the result extraction will be described with reference to FIGS. 24 and 25. First, in step S70, the result extraction unit 54 checks whether or not the number of character candidates is 0. If the number of character candidates is 0, the flow ends with the recognition result character string as an empty character string in step S84. To do. If the number of character candidates is not 0, in step S71, the result extraction unit 54 enumerates all the collation results associated with the last position number Ped, and then executes the processing after step S72.

続いて、ステップＳ７２において、結果抽出部５４は、ステップＳ７１において列挙した照合結果のそれぞれについて、知識辞書（ＤＦＡα）から状態番号αに対応する状態レコードを取得して、受理状態フラグを確認する。 Subsequently, in step S72, the result extraction unit 54 acquires a state record corresponding to the state number α from the knowledge dictionary (DFAα) for each of the matching results listed in step S71, and confirms the acceptance state flag.

続いて、ステップＳ７３において、結果抽出部５４は、状態番号αに対応する状態が受理状態の照合結果があるかを判断する。以降では、状態番号αに対応する状態が受理状態の照合結果を「ＤＦＡαで受理状態の照合結果」と称する。ＤＦＡαで受理状態の照合結果がある場合には（ステップＳ７３のＹｅｓ）、ステップＳ７４において、結果抽出部５４は、ＤＦＡαで受理状態の照合結果のうち、スコアが最大の照合結果を照合結果Ｍｘとして選択する。ＤＦＡαで受理状態の照合結果が無い場合には（ステップＳ７３のＮｏ）、ステップＳ７５において、結果抽出部５４は、列挙した全ての照合結果のうち、スコアが最大の照合結果を照合結果Ｍｘとして選択する。 Subsequently, in step S73, the result extraction unit 54 determines whether there is a collation result in which the state corresponding to the state number α is an accepted state. Hereinafter, the collation result in which the state corresponding to the state number α is the accepting state is referred to as “the collation result of the accepting state with DFAα”. When there is a matching result in the accepted state in DFAα (Yes in step S73), in step S74, the result extracting unit 54 sets the matching result having the maximum score among the matching results in the accepted state in DFAα as the matching result Mx. select. When there is no collation result in the accepted state in DFAα (No in step S73), in step S75, the result extraction unit 54 selects the collation result having the maximum score as the collation result Mx among all the collation results listed. To do.

ステップＳ７４またはステップＳ７５の処理に続いて、ステップＳ７６において、結果抽出部５４は、位置番号を表す変数ｐに、選択した照合結果Ｍｘが関連付けられた位置番号ｐｘを代入する。また、結果抽出部５４は、照合結果の番号を表す変数ｍに、選択した照合結果Ｍｘの番号ｍｘを代入する。 Subsequent to step S74 or step S75, in step S76, the result extraction unit 54 substitutes the position number px associated with the selected matching result Mx for the variable p representing the position number. Further, the result extraction unit 54 substitutes the number mx of the selected matching result Mx into the variable m representing the number of the matching result.

続いて、ステップＳ７７において、結果抽出部５４は、ＦＩＬＯ（First In Last Out）メモリであるスタックを空にする。 Subsequently, in step S77, the result extraction unit 54 empties the stack which is a FILO (First In Last Out) memory.

続いて、ステップＳ７８において、ｐとｍとが指す照合結果のコードが−１であるかを判断する。ｐとｍとが指す照合結果のコードが−１ではない場合（ステップＳ７８の偽）、結果抽出部５４は、処理をステップＳ７９に進める。 Subsequently, in step S78, it is determined whether the collation result code indicated by p and m is -1. When the code of the collation result indicated by p and m is not −1 (false in step S78), the result extraction unit 54 advances the process to step S79.

ステップＳ７９において、結果抽出部５４は、ｐとｍとが指す照合結果に格納されているコードをスタックに積む。続いて、ステップＳ８０において、結果抽出部５４は、ｐにｐとｍとが指す照合結果に格納された位置番号を、ｍにｐとｍとが指す照合結果に格納された照合結果の番号を代入する。 In step S79, the result extraction unit 54 loads the code stored in the matching result indicated by p and m on the stack. Subsequently, in step S80, the result extraction unit 54 sets the position number stored in the matching result indicated by p and m to p, and the number of the matching result stored in the matching result indicated by p and m in p. substitute.

そして、結果抽出部５４は、ステップＳ８０の処理を終えると、処理をステップＳ７８に戻して、ｐとｍとが指す照合結果に格納されたコードが−１となるまで、ステップＳ７９とステップＳ８０の処理を繰り返す。これにより、結果抽出部５４は、図２５に示すように、文字列の末尾から順に文字コードを選択して、スタックに積み上げていくことができる。 And the result extraction part 54 complete | finishes the process of step S80, returns a process to step S78, and until the code stored in the collation result which p and m point to becomes -1, the process of step S79 and step S80 is carried out. Repeat the process. Thereby, as shown in FIG. 25, the result extraction unit 54 can select character codes in order from the end of the character string and accumulate them in the stack.

ｐとｍとが指す照合結果のコードが−１である場合（ステップＳ７８の真）、すなわち、位置番号が０に関連付けられた照合結果を指す場合には、結果抽出部５４は、処理をステップＳ８１に進める。ステップＳ８１において、結果抽出部５４は、メモリに格納された認識結果の文字列を空文字列に初期化する。 When the code of the matching result pointed to by p and m is −1 (true in step S78), that is, when the position number indicates the matching result associated with 0, the result extraction unit 54 performs the process. Proceed to S81. In step S81, the result extraction unit 54 initializes the character string of the recognition result stored in the memory to an empty character string.

続いて、ステップＳ８２において、スタックが空であるかを判断する。スタックが空ではない場合（ステップＳ８２の偽）、結果抽出部５４は、ステップＳ８３において、スタックのトップからコードを１つ取り出して、メモリに格納された認識結果の文字列の末尾に追加する。 Subsequently, in step S82, it is determined whether the stack is empty. If the stack is not empty (false in step S82), the result extraction unit 54 extracts one code from the top of the stack and adds it to the end of the character string of the recognition result stored in the memory in step S83.

ステップＳ８３の処理を終えると、結果抽出部５４は、処理をステップＳ８２に戻して、スタックが空になるまで、ステップＳ８３の処理を繰り返す。これにより、結果抽出部５４は、文字列の先頭から末尾までを生成することができる。 When the process of step S83 is completed, the result extraction unit 54 returns the process to step S82 and repeats the process of step S83 until the stack becomes empty. As a result, the result extraction unit 54 can generate the character string from the beginning to the end.

そして、結果抽出部５４は、スタックが空となった場合（ステップＳ８２の真）、本フローの処理を終了する。 Then, when the stack becomes empty (true in step S82), the result extraction unit 54 ends the process of this flow.

以上のように、本実施形態に係る認識装置１０は、禁止対象の文字列をモデル化した禁則辞書を用いて、禁止対象の文字列を含む文字列と知識辞書を照合して得られた照合結果を削除する。これにより、本実施形態に係る認識装置１０で、使用されることが無い文字列および使用することが禁止される文字列等を効率良く排除して、文字列を精度良く認識することができる。 As described above, the recognition apparatus 10 according to the present embodiment uses the prohibition dictionary that models the prohibited character string, and collates the character string including the prohibited character string with the knowledge dictionary. Delete the result. Thereby, the recognition apparatus 10 according to the present embodiment can efficiently eliminate character strings that are not used and character strings that are prohibited from being used, and recognize character strings with high accuracy.

（変形例）
変形例に係る認識装置１０は、知識辞書および照合結果データの形式と、照合部４６、禁則処理部５２および結果抽出部５４の働きが異なる点を除いては、図１から図２５を参照して説明した構成と同様である。以下、変形例に係る認識装置１０について、図１から図２５を参照して説明した構成との相違点を説明する。 (Modification)
The recognition apparatus 10 according to the modified example refers to FIG. 1 to FIG. 25 except that the knowledge dictionary and the format of the verification result data are different from the functions of the verification unit 46, the prohibition processing unit 52, and the result extraction unit 54. The configuration is the same as described above. Hereinafter, the recognition apparatus 10 according to the modification will be described with respect to differences from the configuration described with reference to FIGS.

図２６は、変形例に係る認識装置１０で認識結果として用いられる文字とそれらの文字の種別を表す記号の一例を示す図である。図２７は、変形例に係る認識装置１０で認識される文字列を文字の種別を表す記号の列として表した知識辞書の内容の一例を示す図である。 FIG. 26 is a diagram illustrating an example of characters used as recognition results in the recognition apparatus 10 according to the modification and symbols representing the types of those characters. FIG. 27 is a diagram illustrating an example of the contents of a knowledge dictionary in which a character string recognized by the recognition apparatus 10 according to the modification is represented as a symbol string representing a character type.

変形例に係る認識装置１０は、文字の種別を現す記号を並べた文字列をモデル化した知識辞書を用いる。文字の種別を現す記号は、一例として、１から９までの数字を示す記号「Ｎ」、０から９までの数字を現す記号「ｎ」、ハイフンを現す記号「−」、マンション名、アパート名、棟名に使われる文字を現す記号「Ｍ」を用いる。知識辞書は、図２７に示されるような、これらの記号を並べたワイルドカード文字列が含まれている。 The recognition apparatus 10 according to the modification uses a knowledge dictionary that models a character string in which symbols representing character types are arranged. For example, the symbols indicating the type of character include a symbol “N” indicating a number from 1 to 9, a symbol “n” indicating a number from 0 to 9, a symbol “−” indicating a hyphen, an apartment name, and an apartment name. The symbol “M” representing the letter used for the building name is used. The knowledge dictionary includes a wild card character string in which these symbols are arranged as shown in FIG.

図２８は、変形例に係る照合結果データの一例を示す図である。本変形例において、照合結果データに含まれる照合結果のそれぞれは、スコアｓと、置換え済み文字数ｃと、ワイルドカード文字列を部分的に認識結果で置き換えた文字列ｗとの組を含む。 FIG. 28 is a diagram illustrating an example of collation result data according to the modification. In this modification, each of the matching results included in the matching result data includes a set of a score s, the number of replaced characters c, and a character string w in which a wild card character string is partially replaced with a recognition result.

照合結果データは、位置番号のそれぞれに対応して、照合結果が格納される配列と、配列の要素数を含む。なお、図２８において、照合結果の文字列ｗのアンダーラインを示した文字は、ワイルドカードから認識結果に置き換えられた部分を示す。 The collation result data includes an array in which the collation result is stored and the number of elements of the array corresponding to each position number. Note that in FIG. 28, the character that indicates the underline of the character string w of the collation result indicates a portion in which the wild card is replaced with the recognition result.

本変形例において照合部４６は、先ず、知識辞書に含まれる文字列のそれぞれを複写し、位置番号０の照合結果の配列の要素に設定することで、位置番号０の照合結果の配列を初期化する。それぞれの照合結果には、スコアとして０、置換え済み文字数として０、文字列ｗとして知識辞書から複写した文字列が設定される。続いて、照合部４６は、位置番号Ｐｓｔを１からＰｓｔｍａｘまで順に１ずつ増加させながら、位置番号Ｐｓｔ＋１以降の照合結果を生成する。 In this modification, the collation unit 46 first copies each character string included in the knowledge dictionary and sets it as an element of the collation result array of position number 0, thereby initializing the collation result array of position number 0. Turn into. In each collation result, 0 is set as the score, 0 is the number of replaced characters, and a character string copied from the knowledge dictionary is set as the character string w. Subsequently, the collation unit 46 generates a collation result after the position number Pst + 1 while sequentially increasing the position number Pst by 1 from 1 to Pstmax.

本変形例において、照合部４６は、知識辞書探索処理において、照合結果の置換え済み文字数ｃを取得し、当該照合結果の文字列ｗのｃ番目の文字が文字候補Ｃｃの認識候補Ｃｒの文字コードと一致する場合は、文字候補Ｃｃの終点番号と等しい位置番号に関連付けて、新しい照合結果Ｍｎを生成する。新しい照合結果Ｍｎには、直前の照合結果のスコアに認識候補Ｃｒの類似度を加算したスコアと、置き換え済みの文字数ｃ＋１と、照合結果の文字列ｗのｃ番目の文字を認識候補Ｃｒに置き換えた文字列とが設定される。 In this modification, the collation unit 46 acquires the number c of replaced characters of the collation result in the knowledge dictionary search process, and the c-th character of the character string w of the collation result is the character code of the recognition candidate Cr of the character candidate Cc. Is matched with the position number equal to the end point number of the character candidate Cc, a new matching result Mn is generated. In the new collation result Mn, the score obtained by adding the similarity of the recognition candidate Cr to the score of the previous collation result, the number of replaced characters c + 1, and the c-th character in the character string w of the collation result are replaced with the recognition candidate Cr. Set to a character string.

本変形例において、禁則処理部５２は、禁則辞書探索処理において、照合結果の置換え済み文字数ｃを取得し、照合結果の文字列のｃ−１番目の文字までの部分文字列が禁則辞書に格納されたＤＦＡβで受理されるか否か確認する。そして、禁則処理部５２は、受理される場合、その照合結果を削除し、受理されない場合にはその照合結果を残存させる。 In this modification, the prohibition processing unit 52 acquires the number c of replaced characters in the matching result in the prohibition dictionary search process, and stores the partial character string up to the c-1th character in the matching result character string in the prohibition dictionary. It is confirmed whether or not it is accepted by the registered DFAβ. Then, the prohibition processing unit 52 deletes the matching result when it is accepted, and leaves the matching result when it is not accepted.

このように、認識装置１０は、認識対象の文字列をモデル化した知識辞書と文字候補を照合することで照合結果を得る方法であれば、どのような方法で文字列を照合してもよい。また、認識装置１０は、禁止対象の文字列を照合結果から検出する方法であれば、どのような方法で禁止対象の文字列を検出してもよい。 As described above, the recognition apparatus 10 may collate character strings by any method as long as it obtains a collation result by collating a knowledge dictionary that models a character string to be recognized with character candidates. . The recognition device 10 may detect the prohibited character string by any method as long as it is a method for detecting the prohibited character string from the collation result.

図２９は、実施形態に係る認識装置１０のハードウェア構成を示す図である。 FIG. 29 is a diagram illustrating a hardware configuration of the recognition apparatus 10 according to the embodiment.

認識装置１０は、プログラムを実行可能な一般的なコンピュータシステムにより実現することができる。認識装置１０は、一例として、ディスプレイ１１０と、キーボード１１２と、スキャナ１１４と、外部記憶装置１１６と、通信装置１１８と、コンピュータ１２０を備える。 The recognition apparatus 10 can be realized by a general computer system that can execute a program. As an example, the recognition device 10 includes a display 110, a keyboard 112, a scanner 114, an external storage device 116, a communication device 118, and a computer 120.

ディスプレイ１１０は、表示装置であり、認識した文字列等を表示する。キーボード１１２は、入力装置であり、ユーザからの操作を受け付けて情報を入力する。スキャナ１１４は、用紙等に記載された情報を読み取って入力画像等を取得する。外部記憶装置１１６は、ハードディスクドライブまたは光ディスクドライブ等であり、各種の情報を記憶する。通信装置１１８は、インターネット等を介して外部のコンピュータ等と情報を入出力し、例えば入力画像を外部から取得したり、文字列を外部へと出力したりする。 The display 110 is a display device and displays a recognized character string or the like. The keyboard 112 is an input device, and receives information from a user and inputs information. The scanner 114 obtains an input image or the like by reading information written on a sheet or the like. The external storage device 116 is a hard disk drive or an optical disk drive, and stores various types of information. The communication device 118 inputs and outputs information with an external computer or the like via the Internet or the like, and acquires, for example, an input image from the outside or outputs a character string to the outside.

コンピュータ１２０は、一例として、ＣＰＵ１２２と、入出力制御部１２４と、記憶装置１２６を有する。ＣＰＵ１２２、入出力制御部１２４および記憶装置１２６は、バス１２８により接続される。 As an example, the computer 120 includes a CPU 122, an input / output control unit 124, and a storage device 126. The CPU 122, the input / output control unit 124, and the storage device 126 are connected by a bus 128.

ＣＰＵ１２２は、プログラムを実行して認識装置１０の全体の制御をする。入出力制御部１２４は、ディスプレイ１１０、キーボード１１２、スキャナ１１４、外部記憶装置１１６および通信装置１１８等とのインターフェイスである。また、入出力制御部１２４は、バス１２８を介したデータ転送等も制御する。 The CPU 122 executes a program to control the entire recognition apparatus 10. The input / output control unit 124 is an interface with the display 110, the keyboard 112, the scanner 114, the external storage device 116, the communication device 118, and the like. The input / output control unit 124 also controls data transfer via the bus 128.

記憶装置１２６は、ＲＯＭ、ＲＡＭまたはハードディスクドライブ等を含む。記憶装置１２６では、同一のアドレス空間により、ＲＯＭ、ＲＡＭまたはハードディスクドライブ等の何れのデバイスに対してもアクセスが可能である。記憶装置１２６は、プログラム、入力画像、様式データ、辞書データ（文字認識辞書、知識辞書および禁則辞書）、および、作業データ（文字候補および照合結果）等を記憶する。これらのデータは、記憶装置を構成する何れのデバイス（ＲＯＭ、ＲＡＭおよびハードディスクドライブ）に記憶されていてもよい。また、これらのデータは、一部または全部が、外部記憶装置１１６、または、通信装置１１８を介してアクセスされるサーバ等に記憶されていてもよい。 The storage device 126 includes a ROM, a RAM, a hard disk drive, or the like. In the storage device 126, any device such as a ROM, a RAM, or a hard disk drive can be accessed by the same address space. The storage device 126 stores programs, input images, style data, dictionary data (character recognition dictionary, knowledge dictionary and prohibition dictionary), work data (character candidates and collation results), and the like. These data may be stored in any device (ROM, RAM, and hard disk drive) constituting the storage device. In addition, part or all of these data may be stored in a server or the like accessed via the external storage device 116 or the communication device 118.

本実施形態の認識装置１０で実行されるプログラムは、インストール可能な形式又は実行可能な形式のファイルでＣＤ−ＲＯＭ、フレキシブルディスク（ＦＤ）、ＣＤ−Ｒ、ＤＶＤ等のコンピュータで読み取り可能な記録媒体に記録されて提供される。また、本実施形態の認識装置１０で実行されるプログラムを、インターネット等のネットワークに接続されたコンピュータ上に格納し、ネットワーク経由でダウンロードさせることにより提供するように構成してもよい。また、本実施形態の認識装置１０で実行されるプログラムをインターネット等のネットワーク経由で提供または配布するように構成してもよい。 The program executed by the recognition apparatus 10 of the present embodiment is an installable or executable file, and is a computer-readable recording medium such as a CD-ROM, a flexible disk (FD), a CD-R, or a DVD. Recorded and provided. The program executed by the recognition apparatus 10 of the present embodiment may be configured to be stored by being stored on a computer connected to a network such as the Internet and downloaded via the network. Further, the program executed by the recognition apparatus 10 of the present embodiment may be configured to be provided or distributed via a network such as the Internet.

本実施形態の認識装置１０で実行されるプログラムは、上述した各部（入力部３０、候補検出部３６、認識部４２、照合部４６、禁則処理部５２、結果抽出部５４および出力部５６）を含むモジュール構成となっており、実際のハードウェアとしてはＣＰＵ１２２（プロセッサ）が上記記憶媒体からプログラムを読み出して実行することにより上記各部が主記憶装置上にロードされ、入力部３０、候補検出部３６、認識部４２、照合部４６、禁則処理部５２、結果抽出部５４および出力部５６が記憶装置１２６上に生成されるようになっている。なお、入力部３０、候補検出部３６、認識部４２、照合部４６、禁則処理部５２、結果抽出部５４および出力部５６は、一部または全部がハードウェアで構成されていてもよい。 The program executed by the recognition apparatus 10 of this embodiment includes the above-described units (the input unit 30, the candidate detection unit 36, the recognition unit 42, the collation unit 46, the prohibition processing unit 52, the result extraction unit 54, and the output unit 56). As the actual hardware, the CPU 122 (processor) reads the program from the storage medium and executes it to load the respective units onto the main storage device, and the input unit 30 and the candidate detection unit 36. The recognition unit 42, the collation unit 46, the prohibition processing unit 52, the result extraction unit 54, and the output unit 56 are generated on the storage device 126. The input unit 30, the candidate detection unit 36, the recognition unit 42, the collation unit 46, the prohibition processing unit 52, the result extraction unit 54, and the output unit 56 may be partially or entirely configured by hardware.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、請求の範囲に記載された発明とその均等の範囲に含まれる。 Although several embodiments of the present invention have been described, these embodiments are presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

１０認識装置
３０入力部
３２入力画像記憶部
３４様式データ記憶部
３６候補検出部
３８候補記憶部
４０文字認識辞書記憶部
４２認識部
４４知識辞書記憶部
４６照合部
４８照合結果記憶部
５０禁則辞書記憶部
５２禁則処理部
５４結果抽出部
５６出力部
１１０ディスプレイ
１１２キーボード
１１４スキャナ
１１６外部記憶装置
１１８通信装置
１２０コンピュータ
１２２ＣＰＵ
１２４入出力制御部
１２６記憶装置
１２８バス DESCRIPTION OF SYMBOLS 10 Recognition apparatus 30 Input part 32 Input image memory | storage part 34 Style data memory | storage part 36 Candidate detection part 38 Candidate memory | storage part 40 Character recognition dictionary memory | storage part 42 Recognition part 44 Knowledge dictionary memory | storage part 46 Collation part 48 Collation result memory | storage part 50 Forbidden dictionary memory | storage Unit 52 prohibition processing unit 54 result extraction unit 56 output unit 110 display 112 keyboard 114 scanner 116 external storage device 118 communication device 120 computer 122 CPU
124 I / O controller 126 Storage device 128 Bus

Claims

A candidate detection unit that detects a character candidate that is a set of pixels estimated to include a character from the input image;
A recognition unit that recognizes each of the character candidates and generates at least one recognition candidate that is a candidate character of a recognition result;
Each of the at least one recognition candidate is collated with a knowledge dictionary obtained by modeling a character string to be recognized, and a collation result obtained by collating a character string estimated to be included in the input image with a knowledge dictionary is obtained. A verification unit to be generated;
Among the matching results, a prohibition processing unit for deleting a matching result obtained by matching a character string including a prohibited character string and a knowledge dictionary;
A recognition device comprising:

The matching unit generates a matching result including a score representing the likelihood of the corresponding character string,
The recognition apparatus according to claim 1, further comprising: a result extraction unit that selects one matching result based on the score from the matching result and extracts a character string specified by the selected matching result.

The recognition apparatus according to claim 1, wherein the knowledge dictionary is a finite automaton that models a character string to be recognized.

The knowledge dictionary is a first deterministic finite automaton;
The collation unit
Write a number indicating the state of the first deterministic finite automaton reached by collation to the collation result of the first character candidate,
When collating the second character candidate following the first character candidate with the first deterministic finite automaton, the second character candidate from the state indicated by the number written in the collation result of the first character candidate The recognition apparatus according to claim 3, wherein the second character candidate is collated by following an edge corresponding to the state transition by the recognition candidate.

The forbidden processing unit is obtained by collating the character string specified by the collation result with a forbidden dictionary that models the forbidden character string and collating a character string including the forbidden character string with a knowledge dictionary. The recognition apparatus according to claim 1, wherein the verification result is deleted.

The recognition apparatus according to claim 5, wherein the prohibition dictionary is a finite automaton that models a character string to be prohibited.

The forbidden dictionary is a second deterministic finite automaton;
The prohibition processing unit is
A number indicating the state of the second deterministic finite automaton reached by collating the character string specified by the first collation result is written to the first collation result;
When collating a character string specified by a second collation result obtained by collating the second character candidate following the first collation result with the second deterministic finite automaton, the first collation result The character string specified by the second collation result is collated by following the edge corresponding to the state transition by the recognition candidate of the second character candidate from the state indicated by the number written in The recognition device described.

The second deterministic finite automaton is
The recognition apparatus according to claim 7, wherein when a character string including a prohibited character string is input, the recognition apparatus is configured to transition to an acceptance state.

A candidate detection step of detecting a character candidate that is a set of pixels presumed to include a character from the input image;
A recognition step of recognizing each of the character candidates and generating at least one recognition candidate that is a candidate character of a recognition result;
Each of the at least one recognition candidate is collated with a knowledge dictionary obtained by modeling a character string to be recognized, and a collation result obtained by collating a character string estimated to be included in the input image with a knowledge dictionary is obtained. A matching step to generate,
Among the matching results, a prohibition processing step of deleting a matching result obtained by matching a character string including a prohibited character string and a knowledge dictionary;
A recognition method including:

A program for causing a computer to function as a recognition device,
In the computer,
A candidate detection step of detecting a character candidate that is a set of pixels presumed to include a character from the input image;
A recognition step of recognizing each of the character candidates and generating at least one recognition candidate that is a candidate character of a recognition result;
Each of the at least one recognition candidate is collated with a knowledge dictionary obtained by modeling a character string to be recognized, and a collation result obtained by collating a character string estimated to be included in the input image with a knowledge dictionary is obtained. A matching step to generate,
Among the matching results, a prohibition processing step of deleting a matching result obtained by matching a character string including a prohibited character string and a knowledge dictionary;
A program that executes