JPH0324709B2

JPH0324709B2 -

Info

Publication number: JPH0324709B2
Application number: JP57130605A
Authority: JP
Inventors: Kunio Sakai; Yoshiaki Kurosawa
Original assignee: Tokyo Shibaura Electric Co Ltd
Current assignee: Toshiba Corp
Priority date: 1982-07-27
Filing date: 1982-07-27
Publication date: 1991-04-03
Also published as: JPS5922177A

Description

【発明の詳細な説明】〔発明の技術分野〕本発明は漢字を含む多数の文字を読取り対象と
して、安定に且つ効率良く大分類識別の為の特徴
を抽出することのできる実用性の高い文字読取り
装置に関する。[Detailed Description of the Invention] [Technical Field of the Invention] The present invention provides highly practical characters that can stably and efficiently extract features for broad classification identification by reading a large number of characters including kanji. Relating to a reading device.

[Technical background of the invention]

従来の文字読取り装置にあつては入力文字パタ
ーンの特徴を抽出し、辞書に予め登録されている
辞書パターンの特徴との照合を行ない、その照合
結果に従つて上記入力文字パターンを読取り認識
することが行なわれている。ところがこの文字認
識において漢字を含む多数の文字を読取り対象と
する場合、上記特徴の照合処理が非常に膨大な量
となり、処理効率が著しく悪くなる。そこで従来
では、上記認識処理に先立つて認識対象文字を複
数の概略的な特徴によつて大分類し、これによつ
て識別段階での照合文字数を低減して処理効率の
向上、処理速度の高速化を図ることが行なわれて
いる。この際、上記大分類に用いる特徴と候補文
字識別に用いる特徴とに同じものを用いることが
必要であるが、この特徴は手書文字等に起因する
文字パターンの変形や種々の雑音に対して十分安
定であることが必要である。 Conventional character reading devices extract features of an input character pattern, match them with features of dictionary patterns registered in advance in a dictionary, and read and recognize the input character pattern according to the matching results. is being carried out. However, when a large number of characters including Chinese characters are to be read in this character recognition, the amount of processing required to match the above-mentioned features becomes extremely large, and the processing efficiency deteriorates significantly. Conventionally, prior to the above recognition process, characters to be recognized are roughly classified based on a number of general characteristics, and this reduces the number of characters to be matched at the identification stage, improving processing efficiency and increasing processing speed. Efforts are being made to make this happen. At this time, it is necessary to use the same features as the features used for the above-mentioned major classification and the features used for candidate character identification, but this feature is necessary to prevent deformation of character patterns caused by handwritten characters and various noises. It needs to be sufficiently stable.

しかして、従来、このような文字パターンの大
分類に用いられる特徴としては、文字線の複雑さ
に着目したものや、文字周辺部の形状に着目した
もの、更には文字全体の粗い形状に着目したもの
等がある。尚、これらについては、例えば下記の
文献に詳しく紹介されている。 However, conventionally, the features used to broadly classify character patterns have focused on the complexity of the character lines, the shape of the peripheral part of the character, and even the rough shape of the entire character. There are things that have been done. In addition, these are introduced in detail in, for example, the following literature.

坂井、渡辺 “印刷漢字認識の現状” 情報処理、Vol.22No.4PP.274〜279（昭和56年
４月）森、坂井 “2000字種を100字／秒で読む印刷漢字OCRの
開発” 日経エレクトロニクス、1977年10月31日号
PP.102〜128 このような従来の特徴抽出は、例えば第１図に
示すように原文字パターンの周囲の形状に着目
し、その周辺部上下左右に矩形状に設けた走査領
域Ｕ、Ｄ、Ｌ、Ｒ内に存在する文字線を検出し、
その量を「０」、「１」、「２」なる３段階のレベル
に量子化してこれを特徴データとするものである
（四辺コード法）。また第２図に示すものは、原文
字パターンに接する外接辺ｕ、ｄ、ｌ、ｒに着目
し、これらの外接辺ｕ、ｄ、ｌ、ｒに接する文字
背景部の面積を求め、これを多次元の特徴ベクト
ルとして利用するものである（ペリフエラル・パ
ターン法）。前者（第１図）は文字線部に着目す
るのに対し、後者（第２図）は背景部面積に着目
する点を異にしているが、いずれも文字パターン
の周囲における面積的数量を特徴としていると云
える。 Sakai, Watanabe “Current status of printed kanji recognition” Information Processing, Vol. 22 No. 4 PP. 274-279 (April 1981) Mori, Sakai “Development of printed kanji OCR that reads 2000 character types at 100 characters/second” Nikkei Electronics, October 31, 1977 issue
PP.102-128 Such conventional feature extraction focuses on the shape of the periphery of the original character pattern, as shown in Fig. 1, for example, and scans rectangular scanning areas U, D, Detect character lines existing in L and R,
The amount is quantized into three levels of "0", "1", and "2" and used as feature data (four-sided code method). In addition, the one shown in Figure 2 focuses on circumscribed sides u, d, l, and r that touch the original character pattern, calculates the area of the character background that touches these circumscribed sides u, d, l, and r, and calculates this by It is used as a multidimensional feature vector (peripheral pattern method). The difference is that the former (Figure 1) focuses on the character line area, while the latter (Figure 2) focuses on the area of the background area, but both are characterized by the area quantity around the character pattern. It can be said that this is true.

また文字パターン全体の複雑さに着目して、文
字線の長さを縦方向および横方向に分解して求
め、これらの値を文字の大きさで正規化してその
平均的な文字線密度（複雑指数）を求める方法も
知られている（特公昭53−29416号）。 In addition, focusing on the complexity of the entire character pattern, the length of the character line is divided into vertical and horizontal directions, and these values are normalized by the character size to calculate the average character line density (complex There is also a known method for calculating the index (Special Publication No. 53-29416).

[Problems with background technology]

ところが、これらの特徴抽出法は、活字文字等
のようにそのパターンが規格化されている場合に
は非常に有効であるが、手書文字のように文字と
しての特徴を有しながらも大きく変形しているよ
うな場合には極めて不安定である。しかも、喩え
活字文字であつても、文字パターンに欠けやかす
れが存在する場合、その特徴抽出は甚だ不安定な
ものとなる。換言すれば文字パターンの変形や
種々の雑音に対して常に安定に特徴抽出を行ない
得ないと云う問題を有している。 However, these feature extraction methods are very effective when the pattern is standardized, such as printed characters, but when the pattern is standardized, such as handwritten characters, they are significantly deformed even though they have the characteristics of characters. In such cases, it is extremely unstable. Furthermore, even if the characters are printed, if the character pattern is chipped or faded, feature extraction becomes extremely unstable. In other words, there is a problem in that feature extraction cannot always be performed stably against deformation of character patterns and various noises.

これらの問題を解決するべく、本発明者らは先
に特願昭56−208306号、特願昭57−46749号等に、
新たな特徴抽出法を提唱している。しかし、如何
にして相互に独立で、且つ雑音に対する所謂ふる
まいが相補的な特徴を簡便に抽出するかと云う点
で、新たな解決すべき課題が残されている。 In order to solve these problems, the present inventors previously published Japanese Patent Application No. 56-208306, Japanese Patent Application No. 57-46749, etc.
A new feature extraction method is proposed. However, a new problem remains to be solved in terms of how to easily extract features that are mutually independent and complementary in behavior with respect to noise.

[Purpose of the invention]

本発明はこのような事情を考慮してなされたも
ので、その目的とするところは、手書きされた漢
字を含む多数の文字を読取り対象として、入力さ
れた原文字パターンの大分類識別に必要な特徴を
安定に且つ効率良く抽出することのできる実用性
の高い文字読取り装置を提供することにある。 The present invention was made in consideration of these circumstances, and its purpose is to read a large number of characters, including handwritten kanji, and to identify the major classifications of input original character patterns. It is an object of the present invention to provide a highly practical character reading device that can extract features stably and efficiently.

[Summary of the invention]

本発明は２値量子化されて記憶された原文字パ
ターンの上下左右４方向の外接辺からなる外接枠
を検出し、上記外接枠に囲まれる領域を上下方向
および左右方向にそれぞれ２分してなる上半分、
下半分、左半分および右半分の各部分領域におけ
る上記原文字パターンの文字線の長さをそれぞれ
求め、これらの文字線の長さの前記外接枠に対す
る相対値から前記原文字パターンの部分的複雑さ
を特徴情報として抽出して前記原文字パターンを
大分類識別するようにしたものである。 The present invention detects a circumscribing frame consisting of circumscribing sides in four directions (top, bottom, right, left, and right) of an original character pattern that has been binary quantized and stored, and divides the area surrounded by the circumscribed frame into two in the vertical and horizontal directions, respectively. The upper half,
The lengths of the character lines of the original character pattern in each of the lower half, left half, and right half partial regions are determined, and the partial complexity of the original character pattern is calculated from the relative values of the lengths of these character lines with respect to the circumscribing frame. The character pattern is extracted as feature information and the original character pattern is broadly classified and identified.

また同時に上記部分的複雑さの情報に加えて、
前記各部分領域における文字背景部の重心位置ま
たは面積の情報を求め、あるいは上記文字背景部
の文字輪郭線の長さの外接枠に対する相対値から
求められる平均的凹凸量の情報を総合的に用いて
前記原文字パターンを大分類識別するようにした
ものである。 At the same time, in addition to the above partial complexity information,
Information on the center of gravity position or area of the character background portion in each of the partial regions is obtained, or information on the average unevenness amount obtained from the relative value of the length of the character contour line of the character background portion with respect to the circumscribing frame is comprehensively used. The original character pattern is roughly classified and identified.

〔Effect of the invention〕

かくして本発明によれば、原文字パターンの外
接枠によつて囲まれる領域を上下左右に２分して
なる上半分、下半分、左半分および右半分の各部
分領域における原文字パターンの文字線の長さの
外接枠に対する相対値から、上記原文字パターン
の部分的複雑さを求めて、大分類識別の為の情報
とするので、文字パターンの大きさの変化や位置
のずれなどの種々の雑音に対して十分安定にその
特徴情報を得ることができる。これ故、入力パタ
ーンの大分類識別を安定且つ効率良く行なうこと
ができ、文字認識処理の著しい向上を図り得る。
また漢字等の複雑な文字パターンであつても、ま
た手書文字を対象とする場合であつてもさらには
活字文字を対象とする場合であつても上記特徴は
以下に述べる簡単な方法によつて容易に得ること
ができるので文字認識処理において実用性の高い
顕著な効果を奏し得る。 Thus, according to the present invention, the character lines of the original character pattern in each of the upper half, lower half, left half, and right half partial regions formed by dividing the region surrounded by the circumscribing frame of the original character pattern into two vertically and horizontally. The partial complexity of the above-mentioned original character pattern is determined from the relative value of the length with respect to the circumscribed frame, and this is used as information for major classification identification. The feature information can be obtained with sufficient stability against noise. Therefore, it is possible to stably and efficiently perform broad classification identification of input patterns, and it is possible to significantly improve character recognition processing.
In addition, even when dealing with complex character patterns such as kanji, handwritten characters, and even printed characters, the above features can be achieved using the simple method described below. Since it can be easily obtained, it can have a highly practical and remarkable effect in character recognition processing.

更には上述した各部分領域の部分的複雑さに加
えて文字背景部の重心位置や面積の情報、更には
局部的凹凸量の情報を用いて原文字パターンの大
分類識別を行うので、相互に独立で、且つ雑音に
対する所謂ふるまいが相補的な原文字パターンの
特徴を簡易に、且つ適確に抽出し得るので、その
大分類識別を精度良く、しかも安定に行なうこと
が可能となる。これ故、種々の変形を伴う多種の
文字をそれぞれ適確に認識して読取り入力するこ
とができる等の実用上絶大なる効果が奏せられ
る。 Furthermore, in addition to the partial complexity of each partial region mentioned above, information on the center of gravity position and area of the character background, and information on the amount of local unevenness are used to perform general classification identification of the original character pattern. Since features of the original character pattern that are independent and complementary in behavior to noise can be easily and accurately extracted, it is possible to accurately and stably identify the major classifications. Therefore, great practical effects can be achieved, such as being able to accurately recognize and read and input a wide variety of characters with various deformations.

[Embodiments of the invention]

以下、図面を参照して本発明の一実施例につい
て説明する。 An embodiment of the present invention will be described below with reference to the drawings.

第３図ａ〜ｄは実施例装置における原文字パタ
ーンの部分的複雑さである平均線数の特徴抽出処
理について示すものであり、ここでは手書き入力
された「仕」なる漢字パターンが示される。この
文字パターンは、光電変換されたのち２値量子化
されてパターン記憶装置に記憶されて与えられる
ものである。このようにして与えられる原文字パ
ターンに対して、その特徴抽出処理は先ず上記原
文字パターンを囲む上下左右４方向の外接辺ｕ、
ｄ、ｌ、ｒからなる外接枠を検出することが行な
われる。上記外接辺ｕ、ｄ、ｌ、ｒは文字パター
ンを走査検出し、その文字線の位置座標を相互に
比較することによつて行なわれ、文字パターンの
最左端に接する左外接辺ｌ、同じく最右端に接す
る右外接辺ｒ、そして最上端に接する上外接辺ｕ
と最下端に接する下外接辺ｄをそれぞれ検出する
ことによつて求められる。またこの外接枠の大き
さは、上記上下の外接辺ｕ、ｄの距離Ｖと、左右
の外装辺ｌ、ｒの距離Ｈとしてそれぞれ求められ
る。 FIGS. 3a to 3d show feature extraction processing of the average number of lines, which is the partial complexity of the original character pattern, in the apparatus of the embodiment, and here a handwritten kanji pattern for "shi" is shown. This character pattern is subjected to photoelectric conversion, then binary quantized, and stored in a pattern storage device. For the original character pattern given in this way, the feature extraction process first begins with the circumscribed edges u in four directions surrounding the original character pattern, up, down, left and right,
A circumscribing frame consisting of d, l, and r is detected. The above circumscribed edges u, d, l, and r are determined by scanning and detecting the character pattern and comparing the position coordinates of the character lines with each other. The right circumscribed edge r touches the right edge, and the upper circumscribed edge u touches the top edge.
and the lower circumscribed side d touching the bottom end. The size of this circumscribing frame is determined as the distance V between the upper and lower circumscribing sides u and d, and the distance H between the left and right exterior sides l and r.

しかるのち、上記外接枠で囲まれる領域を上下
方向に２等分して上半分の部分領域Ｕと下半分の
部分領域Ｄとを定義し、同様に上記領域を左右方
向に２等分して左半分の部分領域Ｌと右半分の部
分領域Ｒとをそれぞれ定義する。そして、各部分
領域Ｕ、Ｄ、Ｌ、Ｒにおける文字パターンの文字
線の長さをそれぞれ求める。この文字線の長さ
は、第３図ａ中文字パターンの局部領域を○印で
指示するように斜め方向、縦方向および横方向に
分解して捕えることができ、第３図ｂ〜ｄにそれ
ぞれ示すように２×２画素からなる微小領域の画
素データ配列から、その単位長をls、lt、ｌ、ly
として方向別にその出現数を計数することによつ
て求められる。即ち、左半分の部分領域Ｌにおけ
る文字線長L_Lは、同部分領域Ｌの文字パターン
を走査して上記微小領域におけるパターンを順次
検査し、 L′_L＝１／２｛Σlx＋Σls／２＋Σlt｝として求められる。同様にして右半分の部分領域
Ｒでは L′_R＝１／２｛Σlx＋Σls／２＋Σlt｝として求められ、上半分および下半分の部分領域
Ｕ、Ｄでは L′_U＝１／２｛Σly＋Σls／２＋Σlt｝ L′_D＝１／２｛Σly＋Σls／２＋Σlt｝として求められる。しかして、これらの文字線長
L_L、L_R、L_U、L_Dの各外接辺ｌ、ｒ、ｕ、ｄに対
する相対値は、前記外接枠の大きさを正規化係数
として、 L_L＝L′_L／Ｈ／２＝１／Ｈ｛Σlx＋Σls／２＋Σlt｝ L_R＝１／Ｈ｛Σlx＋Σls／２＋Σlt｝ L_U＝L′_U／Ｖ／２＝１／Ｖ｛Σly＋Σls／２＋Σlt｝ L_D＝１／Ｖ｛Σly＋Σls／２＋Σlt｝としてそれぞれ求められる。このようにして求め
られる各部分領域Ｌ、Ｒ、Ｕ、Ｄにおける文字線
の長さL_L、L_R、L_U、L_Dは物理的には左半分およ
び右半分の各領域にそれぞれ横線成分が何本存在
するか、また上半分および下半分の各領域にそれ
ぞれ縦線成分が何本存在するかと云う平均線数を
表わすものとなる。つまり、これらの情報は各部
分領域における文字パターンの局部的複雑さを示
すことになる。 After that, the area surrounded by the circumscribed frame is divided into two in the vertical direction to define an upper half partial area U and a lower half partial area D, and the above area is similarly divided into two in the left and right direction. A left half partial region L and a right half partial region R are respectively defined. Then, the lengths of the character lines of the character pattern in each of the partial areas U, D, L, and R are determined. The length of this character line can be determined by dividing the local area of the middle character pattern in Figure 3a into the diagonal, vertical and horizontal directions as indicated by circles, and Figures 3b to d show the length of the character line. As shown, from the pixel data array of a micro area consisting of 2 × 2 pixels, the unit length is ls, lt, l, ly.
It is obtained by counting the number of occurrences in each direction. That is, the character line length L L in the left half partial area _L is determined by scanning the character pattern in the same partial area L and sequentially inspecting the patterns in the minute area, as L' _L = 1/2 {Σlx + Σls/2 + Σlt}. Desired. Similarly, for the right half partial region R, L' _R = 1/2 {Σlx + Σls/2 + Σlt}, and for the upper and lower half partial regions U and D, L' _U = 1/2 {Σly + Σls/2 + Σlt} It is obtained as L' _D = 1/2 {Σly + Σls/2 + Σlt}. However, these character line lengths
The relative values of L _L , L _R , L _U , and L _D with respect to each of the circumscribed sides l, r, u, and d are determined by using the size of the circumscribed frame as a normalization coefficient, L _L =L' _L /H/2= 1/H{Σlx+Σls/2+Σlt} L _R =1/H{Σlx+Σls/2+Σlt} L _U =L' _U /V/2=1/V{Σly+Σls/2+Σlt} L _D =1/V{Σly+Σls/2+Σlt} Each is required. Physically, the character line lengths L _L , L R , L U , and L _D in each partial area L, _R , _U , and D obtained in this way are horizontal line components in each of the left and right half areas. It represents the average number of lines, that is, how many vertical line components are present in each region of the upper half and the lower half. In other words, this information indicates the local complexity of the character pattern in each partial area.

かくして、このような各領域の局部的複雑さの
情報に従えば、例えば漢字特有の偏や傍や冠等の
特徴情報を得ることができ、その認識処理の前段
階としての大分類識別の為の情報として有効に利
用することが可能となる。特に、このような情報
を用いれば、手書入力された原文字パターンが、
偏と傍とからなる漢字か、また冠と脚からなる漢
字か、更にはそれ以外のものであるかを容易に且
つ精度良く認識することができるので、その実用
的利点は非常に高い。 In this way, by following information on the local complexity of each region, it is possible to obtain characteristic information such as the bias, sides, and crowns unique to kanji, which can be used for broad classification identification as a preliminary step to recognition processing. It becomes possible to use the information effectively as information. In particular, if such information is used, the original character pattern input by hand can be
Since it is possible to easily and accurately recognize whether a Chinese character consists of ``bia'' and ``side'', a kanji consisting of a crown and a foot, or something else, its practical advantage is very high.

ところで本装置では、上述した部分領域におけ
る局部的な複雑さの情報に加えて、各部分領域に
おける文字背景部の重心位置の情報（面積）や、
文字輪郭線から求められる平均的凹凸量の情報を
も併用して入力文字パターンの大分類識別を行な
うように構成している。 By the way, in this device, in addition to the information on the local complexity in the partial area mentioned above, information on the center of gravity position (area) of the character background part in each partial area,
Information on the average amount of unevenness obtained from the character contours is also used to broadly classify input character patterns.

例えば文字背景部の重心位置は各外接辺ｕ、
ｄ、ｌ、ｒを基準位置（基準線）として、各領域
Ｕ、Ｄ、Ｌ、Ｒを各外接辺ｕ、ｄ、ｌ、ｒから文
字パターン中心に向う方向にそれぞれ走査し、文
字線に到達する迄の走査線長を求めて計算され
る。但し、上記走査の最大長を例えば長さｌ（可
変）として文字パターンの大きさに応じて規定す
るようにすることが信号処理上好ましい。即ち、
これらを走査線長の総和で示される各外接辺ｕ、
ｄ、ｌ、ｒにそれぞれ対応した文字パターンの背
景部の面積Ｓを求める。例えば外接辺ｌにおける
背景部の面積Ｓは、第４図に示すように最大長
Ｈ／２に規定された走査線にて外接辺ｌを基準位
置として文字パターンの中心に向う方向に走査
し、その繰和を求める。但し、走査線が文字線に
到達したとき、その走査は終了し、また文字線に
到達しなかつたときには前記長さｌに達したとき
にその走査を終了とする。そして、このようにし
て求められた面積Ｓで示される領域の各点を
（ｘ、ｙ）として、領域Ｓの重心位置Ｇ（Ｘ、Ｙ）
を次のようにして求める。 For example, the center of gravity of the character background is each circumscribed edge u,
Using d, l, and r as reference positions (reference lines), scan each area U, D, L, and R from each circumscribed side u, d, l, and r in the direction toward the center of the character pattern to reach the character line. It is calculated by finding the scanning line length up to that point. However, from the viewpoint of signal processing, it is preferable that the maximum length of the scanning is defined as, for example, a length l (variable) depending on the size of the character pattern. That is,
Each circumscribed edge u, which is represented by the sum of the scanning line lengths,
The area S of the background portion of the character pattern corresponding to d, l, and r is determined. For example, the area S of the background portion on the circumscribed side l is determined by scanning in the direction toward the center of the character pattern with the circumscribed side l as a reference position using a scanning line defined to have a maximum length H/2 as shown in FIG. Find its recurrence. However, when the scanning line reaches the character line, the scanning ends, and if the scanning line does not reach the character line, the scanning ends when the length l is reached. Then, assuming that each point of the area indicated by the area S obtained in this way is (x, y), the center of gravity position G (X, Y) of the area S is
Find it as follows.

Ｘ＝∬_sｘ・S_(x X=∬ _s x・S _(x

Claims

[Scope of Claims] 1. Means for photoelectrically converting and inputting the original character pattern, means for binary quantizing and storing the input original character pattern, and four directions of the stored original character pattern, top, bottom, left and right. means for detecting a circumscribed frame consisting of circumscribed sides; means for determining the length of each character line of the original character pattern; means for detecting the partial complexity of the original character pattern from the relative value of the length of these character lines with respect to the circumscribing frame; A character recognition device comprising means for broadly classifying and identifying the original character pattern according to the information of the original character pattern. 2 The length of the character line in each partial area is 2×2
2. The character reading device according to claim 1, wherein the length of the boundary line of the original character pattern in a minute region of pixels is determined by integrating the lengths of the boundary lines for each direction of the boundary line. 3. A means for photoelectrically converting and inputting the original character pattern, a means for binary quantizing and storing the inputted original character pattern, and a circumscription consisting of the circumscribed sides of the stored original character pattern in four directions: top, bottom, left, and right. means for detecting a frame, and character lines of the original character pattern in each of the upper half, lower half, left half, and right half partial regions obtained by dividing the region surrounded by the circumscribed frame into two in the vertical direction and the horizontal direction, respectively. means for determining the length of each of the character lines, means for detecting the partial complexity of the original character pattern from the relative value of the length of these character lines with respect to the circumscribed frame, and the position of the center of gravity of the character background in each of the partial areas. or means for determining the relative value of the area value with respect to the circumscribing frame, and means for detecting the character outline of the character background portion in each of the partial areas, and determining the relative value of the character outline length with respect to the circumscribing frame for the original character. means for determining an average amount of unevenness around the pattern; and information on at least one of the average amount of unevenness, the center of gravity position of the character background part, or its area, and the information on the partial complexity to form the original character pattern. 1. A character reading device comprising: means for identifying major categories.