JPH0632087B2

JPH0632087B2 - Pattern recognition device

Info

Publication number: JPH0632087B2
Application number: JP60247045A
Authority: JP
Inventors: 斎司蔭山; 修国崎; 歳弘花野井
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1985-11-06
Filing date: 1985-11-06
Publication date: 1994-04-27
Anticipated expiration: 2009-04-27
Also published as: JPS62107389A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明はパターン認識装置に関する。The present invention relates to a pattern recognition device.

[Background of the Invention]

文字認識においては、入力パターンとあらかじめメモリ
内に登録された標準パターンとを比較して認識してい
る。この標準パターンの集合を辞書と呼ぶ。第２図に示
したように共用辞書（不特定筆記者の書く文字パターン
を認識するための標準パターンの集合）２１を用いて手
書き漢字を認識部２０において認識する場合、対象とす
る文字カテゴリ数が数千と多く字形が複雑であるため、
ていねいな楷書体は読取れるが日常字体を読取ろうとす
ると、十分な認識率が得られない、或いは十分な認
識率を得ようとする辞書容量（つまりコスト）や認識時
間が莫大で非実用的なものになる等の問題点があった。
これらの問題点を解決するための方法として、従来第３
図に示した方式が提案されている。第４図にていねいな
楷書体と日常字体の例を示す。In character recognition, an input pattern is compared with a standard pattern registered in a memory in advance for recognition. This set of standard patterns is called a dictionary. As shown in FIG. 2, when recognizing handwritten Chinese characters in the recognition unit 20 using the common dictionary (a set of standard patterns for recognizing character patterns written by an unspecified writer) 21, the number of target character categories is set. There are many thousands and the glyph is complicated, so
It is possible to read a polite typeface, but if you try to read a daily typeface, you will not get a sufficient recognition rate, or the dictionary capacity (that is, cost) and the recognition time that will try to obtain a sufficient recognition rate are enormous and impractical. There was a problem such as a thing.
As a method for solving these problems, the third conventional method is used.
The scheme shown in the figure has been proposed. Fig. 4 shows an example of a well-designed regular typeface and everyday typeface.

第３図に示した方式は、辞書として個人用辞書（個人専
用の標準パターンの集合）３１を用いるものである。個
人用辞書３１は個人が記入した文字のみから作ってあ
る。本方式により、共用辞書を用いる場合より認識率と
認識速度を高め、辞書容量を低減できたが以下の欠点が
あった。The system shown in FIG. 3 uses a personal dictionary (set of standard patterns dedicated to an individual) 31 as a dictionary. The personal dictionary 31 is made up only of characters written by the individual. With this method, the recognition rate and recognition speed can be increased and the dictionary capacity can be reduced as compared with the case of using a shared dictionary, but there are the following drawbacks.

学習文字数が少い時点の個人用辞書には、個人が使
用するカテゴリがまだほとんど入っていない。そのため
個人用辞書は使いものにならない。At the time when the number of learned characters was small, the personal dictionary still had few categories used by individuals. Therefore, personal dictionaries are useless.

使用カテゴリが個人用辞書に入っていても、そのカ
テゴリについての学習サンプル数が少いと認識率が低
い。例えば第５図の個人用辞書を用いた場合の認識率に
ついての学習曲線を参照されたい。本図では学習サンプ
ル数１〜２付近における認識率が共用辞書の場合より低
く、個人用辞書は使いものにならないことを示してい
る。Even if the usage category is in the personal dictionary, the recognition rate is low if the number of learning samples for that category is small. See, for example, the learning curve for the recognition rate when using the personal dictionary in FIG. This figure shows that the recognition rate in the vicinity of the learning sample numbers 1 and 2 is lower than that in the shared dictionary, and the personal dictionary is not useful.

なお、従来方式については次の文献で論じられている。The conventional method is discussed in the following document.

電子通信学会技術研究報告、ＰＲＬ８１−９４、内
藤誠一郎、パーソナル手書き漢字認識の基礎検討電子通信学会論文誌、Vol.J66-D、No.４、P.４５４
〜４５５、吉村ミツ、手書き文字認識における個人用テ
ンプレートの有効性について〔発明の目的〕本発明の目的は、上記従来技術の欠点を解消できるパタ
ーン認識装置を提供することにある。IEICE Technical Report, PRL 81-94, Seiichiro Naito, Basic Study on Personal Handwritten Kanji Recognition, IEICE Transactions, Vol.J66-D, No.4, P.454
~ 455, Mitsu Yoshimura, Regarding Effectiveness of Personal Template in Handwritten Character Recognition [Object of the Invention] An object of the present invention is to provide a pattern recognition apparatus capable of solving the above-mentioned drawbacks of the prior art.

[Outline of Invention]

上記目的を達成するため、本発明では第１図に示した構
成のパターン認識装置を構成する。In order to achieve the above object, the present invention configures a pattern recognition device having the configuration shown in FIG.

第１図において、辞書１は認識用の辞書である。辞書２
は特定筆記者が記入した特徴パターンについてのカテゴ
リ毎の統計値を記憶する部分である。特定筆記者が記入
した文字数が少い、又は全くない時点では辞書１に共用
辞書をセットする。In FIG. 1, the dictionary 1 is a dictionary for recognition. Dictionary 2
Is a part for storing the statistical value for each category for the characteristic pattern entered by the specific writer. When the number of characters entered by the specific writer is small or there is no character, the common dictionary is set in the dictionary 1.

また辞書置換部８は、特定筆記者が記入した文字数が増
し、学習が進むとともに辞書２（特定筆記者が記入した
文字サンプルのみから作成されている）が充実するのに
応じて、辞書１全体又は辞書１に属する個々の標準パタ
ーンを辞書２全体又は辞書２に属する標準パターンで置
き換える部分である。In addition, the dictionary replacement unit 8 increases the number of characters written by the specific writer, and as the learning progresses, the dictionary 2 (created only from the character samples written by the specific writer) is expanded, and the entire dictionary 1 is expanded. Alternatively, the individual standard patterns belonging to the dictionary 1 are replaced with the entire dictionary 2 or the standard patterns belonging to the dictionary 2.

本装置においては、従来発明の欠点を解消し、以下の効
果を得ることができた。In this device, the drawbacks of the conventional invention were solved and the following effects could be obtained.

特定筆記者が記入した文字数が増すにつれて、辞書
１を共用辞書から個人用辞書に徐々に置き換えていくこ
とができる。しかも自動的に人手を介さずにできる。辞
書１内から共用標準パターンが減少するにつれて認識性
能が向上する。また認識速度も向上する。As the number of characters entered by the specific writer increases, the dictionary 1 can be gradually replaced from the common dictionary to the personal dictionary. Moreover, it can be done automatically without human intervention. The recognition performance improves as the number of shared standard patterns in the dictionary 1 decreases. Also, the recognition speed is improved.

個人がまだ１度も記入していないが、或いは記入し
ていても記入サンプル数がまだ少いカテゴリについて
は、共用標準パターンを用いたことにより、従来の不特
定筆記者用OCR（第２図の方式）と同程度の認識性能が
得られる。つまりていねいな楷書体を読取ることができ
る。For the categories that individuals have not filled in even once, or the number of filled samples is small even if they are filled in, the conventional standard OCR for unspecified writers (Fig. 2 The same level of recognition performance can be obtained. In other words, it is possible to read a neat typeface.

個人が十分たくさん記入したカテゴリについては従
来の特定筆記者用OCR（第３図の方式）と同様に、日常
字体を高い認識性能で読取ることができる。As for the categories written by a large number of individuals, everyday typefaces can be read with high recognition performance, similar to the conventional OCR for specified writers (method of FIG. 3).

辞書１及び辞書２内の個人用標準パターンは、常に
個人が記入したサンプルのみから作った純粋なものとな
る。つまり他人が記入したサンプルが作成に使われな
い。The personal standard patterns in the dictionary 1 and the dictionary 2 are always pure ones made only from the samples written by the individual. In other words, the sample filled in by others is not used for making.

Example of Invention

次に本発明の１番目の実施例を、第１図の構成例を用い
て説明する。構成例は認識部３、答カテゴリメモリ部
４、特徴メモリ部５、修正部６、学習部７、辞書１、辞
書２、辞書置換部８、及び制御部９からなる。Next, the first embodiment of the present invention will be described using the configuration example of FIG. The configuration example includes a recognition unit 3, an answer category memory unit 4, a feature memory unit 5, a correction unit 6, a learning unit 7, a dictionary 1, a dictionary 2, a dictionary replacement unit 8, and a control unit 9.

初めに文字パターンの列が入力パターン列として認識部
３に入力される。認識部３は入力パターン列を光電変換
により電気信号に変換した後、前処理、切り出し、特徴
抽出などを行い、個々の文字毎に切り出した特徴パター
ンの列として出力する。First, a character pattern string is input to the recognition unit 3 as an input pattern string. The recognition unit 3 converts the input pattern sequence into an electric signal by photoelectric conversion, and then performs preprocessing, cutout, feature extraction, and the like, and outputs it as a sequence of feature patterns cut out for each character.

また認識部３はこれらの各特徴パターンに対し辞書１に
属するすべての標準パターンとの間の類似性を評価した
後、その評価値に基づいて答カテゴリを出力する。認識
部３が出力した特徴パターンの列と答カテゴリの列はそ
れぞれ特徴メモリ部５と答カテゴリメモリ部４に格納さ
れる。修正部６では本装置のユーザが答カテゴリメモリ
部４内の答カテゴリを確認修正する。学習部７では特徴
メモリ部５に属する特徴パターンと答カテゴリメモリ部
４に属する修正した答カテゴリを入力して、入力された
特徴パターンの平均値をカテゴリ毎に計算し、辞書２に
出力する。辞書置換部８は辞書２に属する各標準パター
ンによる辞書１に属する標準パターンの置き換えを、辞
書２内の上記標準パターンについての評価値に基づいて
行う。Further, the recognition unit 3 evaluates the similarity between each of these characteristic patterns and all the standard patterns belonging to the dictionary 1, and then outputs the answer category based on the evaluation value. The sequence of feature patterns and the sequence of answer categories output by the recognition unit 3 are stored in the feature memory unit 5 and the answer category memory unit 4, respectively. In the correction unit 6, the user of this apparatus confirms and corrects the answer category in the answer category memory unit 4. The learning unit 7 inputs the characteristic pattern belonging to the characteristic memory unit 5 and the corrected answer category belonging to the answer category memory unit 4, calculates the average value of the inputted characteristic pattern for each category, and outputs it to the dictionary 2. The dictionary replacement unit 8 replaces the standard pattern belonging to the dictionary 1 with each standard pattern belonging to the dictionary 2 based on the evaluation value of the standard pattern in the dictionary 2.

本発明の特徴は辞書１の他に辞書２を設けるとともに、
辞書置換部８を設けたことである。辞書２は、本装置の
特定使用者が記入した文字パターンから抽出された特徴
パターンについてのカテゴリ毎の統計値を記憶する部分
である。例えば学習部７に文字カテゴリが「海」である
特徴パターンf₁、f₂、……、fn₁、文字カテゴリが
「山」である特徴パターンg₁、……、gn₂、文字カテゴ
リが「川」である特徴パターンh₁、h₂、……、h_n3が入
力されると、学習部７は平均値ｆ、ｇ、ｈを各文字カテ
ゴリ「海」、「山」、「川」についての標準パターンと
して出力する。ここで鉤括弧（「、」）内の文字は文字
カテゴリを表し、ｆ、ｇ、ｈはそれぞれを表すものとする。辞書置換部８は辞書２に属する個々
の標準パターンが認識用の辞書１で十分使えるか判定
し、十分と判定されたならば標準パターン単位で置換す
る部分である。具体的には辞書置換部８が次の手順（辞
書置換方法１と呼ぶ）を実行する。A feature of the present invention is that a dictionary 2 is provided in addition to the dictionary 1, and
The dictionary replacement unit 8 is provided. The dictionary 2 is a part for storing the statistical value for each category of the characteristic patterns extracted from the character patterns written by the specific user of the apparatus. For example, in the learning unit 7, the characteristic patterns f ₁ , f ₂ , ..., Fn ₁ whose character category is “sea”, the characteristic patterns g ₁ , ..., Gn ₂ whose character category is “mountain”, and the character category “ When the characteristic pattern h ₁ , h ₂ , ..., H _n3 that is “river” is input, the learning unit 7 sets the average values f, g, and h for each of the character categories “sea”, “mountain”, and “river”. Output as a standard pattern of. Here, the characters inside the brackets (“,”) represent character categories, and f, g, and h are respectively Shall be represented. The dictionary replacement unit 8 is a unit that determines whether or not each standard pattern belonging to the dictionary 2 can be sufficiently used in the recognition dictionary 1, and if it is determined to be sufficient, replaces it in standard pattern units. Specifically, the dictionary replacement unit 8 executes the following procedure (referred to as dictionary replacement method 1).

（辞書置換方法１） (1)辞書２に属する各標準パターンについての以下の評
価値、(a)その標準パターンのカテゴリが辞書１にある
か、(b)あるとしたらその辞書１内標準パターンが共用
標準パターンか個人用標準パターンか、(c)学習サンプ
ル数、(d)その他、を計算する。(Dictionary Replacement Method 1) (1) The following evaluation values for each standard pattern belonging to the dictionary 2, (a) the category of the standard pattern is in the dictionary 1, or (b) if there is, the standard pattern in the dictionary 1 Calculate common standard pattern or personal standard pattern, (c) number of learning samples, (d) other.

(2)上記(1)で求めた(a)〜(d)の評価値のうちのすべて或
いは任意の組合せに基づいて、辞書２に属する標準パタ
ーンの辞書１への追加、更新を行う。(2) The standard pattern belonging to the dictionary 2 is added to or updated in the dictionary 1 based on all or any combination of the evaluation values (a) to (d) obtained in the above (1).

本図において辞書２の記述表２Ｂは、上記辞書置換方法
１の手順(1)で計算した(a)〜(d)の評価値を各標準パタ
ーンについて列挙したものである。In the drawing, the description table 2B of the dictionary 2 lists the evaluation values (a) to (d) calculated in the procedure (1) of the dictionary replacement method 1 for each standard pattern.

第７図と第８図に辞書置換方法１の具体例を示す。とも
に辞書２本体と第７図（ａ）、第８図（ａ）の辞書１に
基づいて、第７図（ｂ）、第８図（ｂ）の辞書２の記述
表を計算し、評価した後、第７図（ｃ）、第８図（ｃ）
の辞書１を出力した。第７図（ｃ），及び第８図（ｃ）
の下に注意書き注１で示したように、第７図（ａ）
（ｂ）（ｃ）及び第８図（ａ）（ｂ）（ｃ）において、
共は共用標準パターンを、個は個人用標準パターンを表
す。また、注２に示したように、θ₁は共用標準パター
ンに対する学習サンプル数評価用閾値を表し、θ₂は個
人用標準パターンに対する学習サンプル数評価用閾値を
表す。具体的にはθ₁は辞書１にあるカテゴリについて
の共用標準パターン（ア₁）が既にある場合に辞書２の
同じカテゴリについての個人用標準パターン（ア₂）で
それを置き換えるためには、（ア₁）の学習サンプル数
がθ₁以上であることを要求するために用いる。θ₂は辞
書１にあるカテゴリについての個人用標準パターン（ア
₁）が既にある場合に、辞書２の同じカテゴリについて
の標準パターン（ア₂）でそれを置き換えるためには
（ア₂）の学習サンプル数がθ₂以上であることを要求す
るために用いる。7 and 8 show a specific example of the dictionary replacement method 1. Both were calculated and evaluated based on the dictionary 2 main body and the dictionary 2 of FIG. 7 (b) and FIG. 8 (b) based on the dictionary 1 of FIG. 7 (a) and FIG. 8 (a). After that, FIG. 7 (c) and FIG. 8 (c)
The dictionary 1 of is output. 7 (c) and 8 (c)
As shown in the note 1 below, Fig. 7 (a)
(B) (c) and FIGS. 8 (a) (b) (c),
Both represent common standard patterns, and individual represent individual standard patterns. Further, as shown in Note 2, θ ₁ represents the learning sample number evaluation threshold for the common standard pattern, and θ ₂ represents the learning sample number evaluation threshold for the individual standard pattern. Specifically, if θ ₁ is a shared standard pattern (A ₁ ) for a category in dictionary 1 and it is replaced with a personal standard pattern (A ₂ ) for the same category in dictionary 2, (1 It is used to request that the number of learning samples in ( ₁ ) is greater than θ ₁ . θ ₂ is the personal standard pattern (A
_{If 1} ) already exists, it is used to request that the number of learning samples of (A ₂ ) is θ ₂ or more in order to replace it with the standard pattern (A ₂ ) for the same category of dictionary 2.

次に２番目の具体例を説明する。本例では辞書置換部が
次の手順（辞書置換方法２と呼ぶ）を実行する。Next, the second specific example will be described. In this example, the dictionary replacement unit executes the following procedure (referred to as dictionary replacement method 2).

（辞書置換方法２） (1)辞書２全体が本装置の特定使用者用の個人専用辞書
として十分使えるかを判定する。具体的には(a)カテゴ
リ数n_cが十分多いか、(b)標準パターン当りの平均学習
サンプル数n_sが十分多いか、(c)学習サンプル数の全標
準パターンについての合計値n_tが十分多いか、(d)その
他、を評価する。(a)〜(d)の全て或いは適当な任意の組
合せに基づいて、辞書２が十分使えるか判定する。例え
ば、n_cθ_cかつn_sθ_sならば辞書２が十分使えると判
定する。ここでθ_c、θ_sは正定数である。(Dictionary Replacement Method 2) (1) It is determined whether the entire dictionary 2 can be sufficiently used as a personal dictionary for a specific user of this device. Specifically, whether (a) the number of categories n _c is sufficiently large, (b) the average number of learning samples per standard pattern n _s is sufficiently large, or (c) the total value n _t of all standard patterns If there are enough, evaluate (d) other. Based on all of (a) to (d) or an appropriate arbitrary combination, it is determined whether the dictionary 2 can be sufficiently used. For example, if n _c θ _c and n _s θ _s , it is determined that the dictionary 2 can be used sufficiently. Here, θ _c and θ _s are positive constants.

(2)上記(1)で辞書２が十分使えると判定されたならば、
辞書１全体を辞書２（特定使用者専用）で置き換える。(2) If it is determined in the above (1) that the dictionary 2 can be used sufficiently,
The entire dictionary 1 is replaced with the dictionary 2 (special user only).

辞書置換方法２も第６図の構成例で実現することができ
た。但し、この場合は辞書置換方法１の場合とは次の２
点が異なる。The dictionary replacement method 2 could also be realized by the configuration example of FIG. However, in this case, the following 2 is different from the case of the dictionary replacement method 1.
The points are different.

(i)辞書２全体の評価値である上記(a)〜(d)を計算し、
それらを辞書２の記述表に記録する。(i) Calculate the above evaluation values (a) to (d) of the dictionary 2 as a whole,
Record them in the description table of the dictionary 2.

(ii)辞書置換部への辞書１からの入力が不要である。(ii) It is not necessary to input from the dictionary 1 to the dictionary replacement unit.

次に３番目の具体例を説明する。本例では辞書置換部が
辞書置換方法１と２を組合わせた処理を行う。具体的に
は第９図に示したように、初めに辞書置換方法２を適用
してみる。辞書２が十分使えると判定されたら、辞書２
全体で辞書１全体を置き換え、辞書置換方法２を完遂す
る。もし辞書２が不十分と判定されたならば、辞書置換
方法１を適用する。Next, a third specific example will be described. In this example, the dictionary replacement unit performs a process that combines dictionary replacement methods 1 and 2. Specifically, as shown in FIG. 9, the dictionary replacement method 2 is first applied. If it is determined that the dictionary 2 can be used enough, the dictionary 2
The entire dictionary 1 is replaced as a whole, and the dictionary replacement method 2 is completed. If it is determined that the dictionary 2 is insufficient, the dictionary replacement method 1 is applied.

次に４番目の具体例を説明する。具体例２又は３におい
て辞書置換部が辞書置換方法２を実行した結果、辞書１
全体を辞書２全体で置き換え、辞書１が特定使用者専用
の辞書（共用標準パターンを全然含まない）になったも
のとする。この状態以後は辞書置換部が次の辞書置換方
法３を実行するのが本例である。Next, a fourth specific example will be described. As a result of the dictionary replacement unit 2 executing the dictionary replacement method 2 in the specific example 2 or 3, the dictionary 1
It is assumed that the entire dictionary is replaced with the entire dictionary 2 and the dictionary 1 is a dictionary dedicated to a specific user (not including a shared standard pattern at all). In this example, after this state, the dictionary replacement unit executes the following dictionary replacement method 3.

（辞書置換方法３） (1)辞書２のうち学習により変化した部分（つまり追
加、更新された標準パターン）の量を変化分として評価
する。変化分の評価は以下の評価値、(a)カテゴリ数の
増加分、(b)標準パターン当りの平均学習サンプル数の
増加分、(c)学習サンプル数の全標準パターンについて
の合計値の増加分、(d)その他、を計算し、それらのう
ちのすべて或いは適当な任意の組合わせに基づいて行
う。(Dictionary Replacement Method 3) (1) The amount of the portion of the dictionary 2 that has changed due to learning (that is, the added or updated standard pattern) is evaluated as the amount of change. The evaluation of changes is as follows: (a) increase in the number of categories, (b) increase in the average number of learning samples per standard pattern, (c) increase in the total number of learning samples for all standard patterns Minutes, (d) etc., and based on all of them or any suitable combination.

(2)上記(1)で評価した変化分が十分大きければ、辞書２
の上記変化部分を辞書１に代入する。ここで代入とは、
辞書２内の追加、更新された標準パターンを辞書１に追
加、更新することである。(2) If the change evaluated in (1) above is large enough, dictionary 2
The above-mentioned changed part of is substituted into the dictionary 1. Here, substitution means
This is to add or update the standard pattern added or updated in the dictionary 2 to the dictionary 1.

辞書置換方法３も第６図の構成例で実現することができ
た。但し、この場合は辞書置換方法１の場合と次の２点
が異る。The dictionary replacement method 3 could also be realized by the configuration example of FIG. However, in this case, the following two points are different from the case of the dictionary replacement method 1.

(i)辞書２全体の変化分である上記(a)〜(d)を計算し、
それらを辞書２の記述表に記録する。(i) Calculate the above (a) to (d), which is the change of the entire dictionary 2,
Record them in the description table of the dictionary 2.

以上２つの実施例は、マイクロプロセッサ、メモリ、ス
キャナ、マイクロホン、ディスプレイなどを用いて実現
することができる。The above two embodiments can be realized by using a microprocessor, a memory, a scanner, a microphone, a display and the like.

また本発明は次に示す各場合にも適用することができ
る。The present invention can also be applied to the following cases.

(1)上記実施例は辞書がカテゴリ当り一組の標準パター
ンと閾値から成る場合についてのものであった。カテゴ
リ当り複数組の標準パターンを用いると認識性能をさら
に向上することができる。この場合についても、本発明
は適用することができる。(1) The above embodiment is for the case where the dictionary consists of a set of standard patterns and thresholds per category. The recognition performance can be further improved by using a plurality of sets of standard patterns per category. The present invention can be applied to this case as well.

(2)上記実施例では入力パターンから得られたすべての
特徴パターンを学習サンプルとした。学習サンプルとし
て次の〜を用いても同様にして辞書を作ることがで
きる。(2) In the above embodiment, all feature patterns obtained from the input patterns are used as learning samples. A dictionary can be created in the same manner by using the following as a learning sample.

上記すべての特徴パターンからユーザが学習すべき
でないと判断した特徴パターンを除いたものを学習サン
プルとする。A learning sample is obtained by excluding the characteristic patterns determined by the user not to be learned from all the characteristic patterns.

入力パターンをパターン認識してエラー又はリジェ
クトになった特徴パターンを学習サンプルとする。A feature pattern that is an error or rejected by recognizing the input pattern is used as a learning sample.

エラー又はリジェクトになった特徴パターンからユ
ーザが学習すべきでないと判断した特徴パターンを除い
たものを学習サンプルとする。A learning sample is obtained by excluding the characteristic pattern determined by the user not to be learned from the characteristic pattern that has become an error or rejected.

なお、以上の学習サンプル選択、判断処理は第１図の修
正部で行うことができる。The learning sample selection and determination processing described above can be performed by the correction unit in FIG.

(3)上記実施例では文字パターンを認識の対象としてい
た。音声パターン、図形パターンなど文字以外のパター
ンの認識にも本発明は適用することできる。(3) In the above embodiment, the character pattern is the recognition target. The present invention can be applied to recognition of patterns other than characters such as voice patterns and graphic patterns.

なお(1)〜(3)を組合わせた論理的に妥当な各場合に対し
ても本発明は適用することができる。The present invention can also be applied to each logically appropriate case that is a combination of (1) to (3).

〔The invention's effect〕

本発明により次の(1)〜(4)に示す効果を達成することが
できた。According to the present invention, the following effects (1) to (4) can be achieved.

(1)特定筆記者が記入した文字数が増すにつれて、辞書
１を共用辞書から個人用辞書に徐々に置き換えていくこ
とができた。しかも自動的に人手を介さずにできた。辞
書１内から共用標準パターンが減少するにつれて認識性
能が向上した。また認識速度も向上した。(1) As the number of characters written by the specified scribe increased, it was possible to gradually replace the dictionary 1 from the common dictionary to the personal dictionary. Moreover, it could be done automatically without human intervention. The recognition performance improved as the number of shared standard patterns decreased from within the dictionary 1. The recognition speed has also improved.

(2)個人がまだ１度も記入していないか、或いは記入し
ていても記入サンプル数がまだ少いカテゴリについて
は、共用標準パターンを用いたことにより、従来の不特
定筆記者用ＯＣＲ（第２図の方式）と同程度の認識性能
が得られた。つまりていねいな楷書体を読取ることがで
きた。(2) For categories that individuals have not filled in even once, or who have filled in a small number of samples even though they have filled in, by using the common standard pattern, the conventional OCR for unspecified writers ( The same recognition performance as that of the method shown in FIG. 2) was obtained. In other words, I was able to read the polite typeface.

(3)個人が十分たくさん記入したカテゴリについては従
来の特定筆記者用ＯＣＲ（第３図の方式）と同様に、日
常字体を高い認識性能で読取ることができた。(3) As for the category in which a large number of individuals were filled in, everyday typefaces could be read with high recognition performance, similar to the conventional OCR for specified writers (method of FIG. 3).

(4)辞書１及び辞書２内の個人用標準パターンは、常に
個人が記入したサンプルのみから作った純粋なものにで
きた。つまり他人が記入したサンプルが作成に使われて
いない。(4) The standard patterns for personal use in the dictionary 1 and the dictionary 2 were always pure ones made only from the samples entered by the individual. In other words, the sample written by another person is not used for making.

[Brief description of drawings]

第１図は本発明の一実施例の構成を示すブロック図、第
２図と第３図は従来技術を説明するためのブロック図、
第４図はていねいな楷書体と日常字体の例を示す説明
図、第５図は個人用辞書を用いた場合の認識率について
の学習曲線の例を示すグラフ、第６図は辞書置換方法を
説明するためのブロック図、第７図と第８図は辞書置換
方法の具体例を説明するための図、第９図は他の具体例
を説明するための流れ図、である。１……辞書１、２……辞書２、３……認識部、４……答
カテゴリメモリ部、５……特徴メモリ部、６……修正
部、７……学習部、８……辞書置換部、９……制御部。FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention, FIGS. 2 and 3 are block diagrams for explaining the prior art,
FIG. 4 is an explanatory diagram showing an example of a polite typeface and everyday typeface, FIG. 5 is a graph showing an example of a learning curve for the recognition rate when a personal dictionary is used, and FIG. 6 is a dictionary replacement method. FIG. 7 is a block diagram for explaining, FIG. 7 and FIG. 8 are diagrams for explaining a concrete example of the dictionary replacement method, and FIG. 9 is a flow chart for explaining another concrete example. 1 ... Dictionary 1, 2 ... Dictionary 2,3 ... Recognition unit, 4 ... Answer category memory unit, 5 ... Feature memory unit, 6 ... Correction unit, 7 ... Learning unit, 8 ... Dictionary replacement Department, 9 ... control department.

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開昭59−106085（ＪＰ，Ａ) ─────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-59-106085 (JP, A)

Claims

[Claims]

1. A first dictionary, which is a set of first standard patterns representing each category, and a similarity between a feature pattern cut out from an input pattern and each standard pattern belonging to the first dictionary are evaluated. , A recognition section that calculates and outputs an answer category corresponding to the input pattern based on the evaluation value of those similarities, and also outputs a feature pattern cut out from the input pattern, and an answer that is an output of the recognition section. A correction unit that corrects the category, the corrected answer category, and the feature pattern output from the recognition unit are input, and the average value of the feature patterns corresponding to the answer category is output as a second standard pattern for each category. The learning unit, the second dictionary to which the second standard pattern is input, added or updated, and the evaluation value of the contents of the second dictionary have a certain threshold value. Pattern recognition apparatus the contents When e is replaced with the corresponding portion of the first dictionary, or characterized by comprising a dictionary replacement unit to be added.

2. The pattern recognition apparatus according to claim 1, wherein the dictionary replacement unit replaces the first standard pattern with the second standard pattern in the category of the second standard pattern. For each of the above, whether the category of the second standard pattern is in the first dictionary, and if any, the standard pattern in the first dictionary is the first standard pattern or the second standard pattern. The pattern recognition apparatus is characterized in that it has been replaced, added, or updated by the standard pattern, or is based on at least one evaluation value among the evaluation values consisting of the number of learning samples.

3. The pattern recognition apparatus according to claim 1, wherein the dictionary replacement unit determines the sufficiency of the second dictionary as a personal dedicated dictionary for a specific user, per number of categories and standard patterns. Of the second dictionary as a whole after the judgment based on at least one evaluation value among the evaluation values consisting of the average learning sample number and the total value of all the standard patterns of the learning sample Is replaced with the entire first dictionary.

4. The pattern recognition apparatus according to claim 3, wherein when it is determined that the sufficiency of the second dictionary as a personal dictionary is unsatisfactory, the dictionary replacement unit causes the second dictionary. Replacement of each category constituting the standard pattern with the first standard pattern is performed for each category constituting the second standard pattern, in which the category of the second standard pattern to be the first standard pattern is the first category. 1 dictionary, if there is one,
Pattern recognition device based on at least one of the evaluation values consisting of the number of learning samples, or the second standard pattern.

5. The pattern recognition apparatus according to claim 3 or 4, wherein the second dictionary satisfies the sufficiency as a personalized dictionary, and the entire first dictionary is After replacing with the entire second dictionary, the amount of the second standard pattern changed by learning in the second dictionary is
It is evaluated as a change based on at least one of the evaluation values consisting of the increase in the number of categories, the increase in the average number of learning samples per standard pattern, and the increase in the total value of the learning samples for all standard patterns. A pattern recognition apparatus, wherein the changed part of the dictionary 2 is added to or updated in the first dictionary if the change is sufficiently large.