JPH0139156B2

JPH0139156B2 -

Info

Publication number: JPH0139156B2
Application number: JP56151051A
Authority: JP
Inventors: Michiaki Nakanishi
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1981-09-24
Filing date: 1981-09-24
Publication date: 1989-08-18
Also published as: JPS5852783A

Description

【発明の詳細な説明】本発明は文字認識に用いる文字特徴の抽出方式
に関し、従来法によつては隠れてしまう部分の特
徴抽出を可能にしようとするものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a method for extracting character features used in character recognition, and is intended to enable feature extraction of portions that are hidden by conventional methods.

文字認識においては、文字を左方から見た特
徴、上方から見た特徴を求め、文字認識の資料に
供するということが行なわれる。例えば第１図に
示す数字３を例にとると、手書きした該数字３を
ラインスキヤナなどで走査して２値ビデオ信号を
得、一旦メモリに格納し、該メモリから数字３の
ビデオ信号を切出し（一般には他の手書き文字等
と共に走査されて該メモリに格納されているか
ら）、左から右へ走査する各水平を走査上で最初
に０（背景）から１（文字）に変換する点の集まり
を求め、それを左方から見た該数字３の輪郭特徴
とする。この論理では線分l₁，l₂，l₃が左方から
見た文字の特徴（こゝでは、左方特徴という）と
なる。同様に上から下へ走査する各垂直走査線上
で最初に０から１に変る点の集りを求めると第２
図の線分l₄，l₅が得られ、これが上方から見た文
字の特徴（上方特徴）となる。これらの左方特
徴、上方特徴および同様にして求めた右方特徴、
下方特徴は抽出が容易で、しかも数字０，１，２
……などの単純なパターンの文字に対してはよい
特徴データとなり、文字認識に効果的に利用でき
る。例えば上記の線分l₁とl₃の間に入り込んだ線
分l₂があるというものは０〜９の数字では３以外
にはなく、従つて対象は０〜９のいずれかという
ことであればこの特徴１つだけでも数字３の認識
が可能である。しかし、陰に隠れて抽出されない
特徴も多々ある。例えば左方特徴では、大きく凹
んでいる部分P₁，P₂などは抽出されておらず、
上方特徴では抽出されるのはいわば屋根又は庇に
なつている部分のみでその下の全部が隠れてしま
つて抽出されない。従つて漢字はおろかカナなど
の少し複雑なパターンの文字では特徴抽出不充分
となつて、他のものと分離、識別できない文字が
発生する。 In character recognition, characteristics of characters viewed from the left and from above are determined and used as data for character recognition. For example, taking the number 3 shown in Figure 1 as an example, the handwritten number 3 is scanned with a line scanner to obtain a binary video signal, temporarily stored in memory, and the video signal of number 3 is cut out from the memory ( (generally scanned and stored in the memory along with other handwritten characters, etc.), each horizontal scan from left to right is a collection of points that first convert from 0 (background) to 1 (character) on the scan. , and use it as the outline feature of the number 3 seen from the left. According to this logic, line segments l ₁ , l ₂ , and l ₃ are the features of the character viewed from the left (herein referred to as left features). Similarly, if we find the first set of points that change from 0 to 1 on each vertical scanning line scanning from top to bottom, the second
Line segments l ₄ and l ₅ in the diagram are obtained, and these become the features of the character viewed from above (upper features). These left features, upper features, and right features obtained in the same way,
The downward features are easy to extract, and the numbers 0, 1, 2
This provides good feature data for characters with simple patterns such as ..., and can be effectively used for character recognition. For example, there is a line segment l ₂ that intersects between the above line segments l ₁ and l ₃ , but there is no other number than 3 in the range 0 to 9, so the target is either 0 to 9. It is possible to recognize the number 3 using only one characteristic of cigarettes. However, there are many features that are hidden in the background and are not extracted. For example, in the left feature, large concave parts P ₁ and P ₂ are not extracted,
In the case of upper features, only the part that forms a roof or eaves is extracted, and everything below it is hidden and not extracted. Therefore, the feature extraction is insufficient for characters with slightly complex patterns such as kanji and kana, and some characters cannot be separated or identified from other characters.

そこで本発明は陰に隠れる部分の特徴抽出も行
なえるようにして正確、迅速な文字認識を可能に
しようとするものである。即ち本発明は文字パタ
ーンを水平又は垂直方向に走査し、背景と文字と
の白黒変化点の集りとして、該文字パターンの輪
郭を構成する線分を取出し、それを文字認識に用
いる文字特徴とする特徴抽出方式において、各水
平、垂直走査線方向における、１番目を除く奇数
番目の白黒変化点の集りとして、走査方向で最初
に現われる文字パターン部分の陰に隠れた文字パ
ターン部分の輪郭をなす線分を抽出し、それを前
記文字特徴とすることを特徴とするが、次に図面
を参照しながらこれを詳細に説明する。 Therefore, the present invention aims to enable accurate and quick character recognition by making it possible to extract features from hidden parts. That is, the present invention scans a character pattern in the horizontal or vertical direction, extracts line segments that make up the outline of the character pattern as a collection of black-and-white transition points between the background and characters, and uses these as character features for character recognition. In the feature extraction method, a line forming the outline of a character pattern part hidden behind the character pattern part that first appears in the scanning direction is a collection of odd-numbered black and white transition points other than the first in each horizontal and vertical scanning line direction. The present invention is characterized in that the characters are extracted and used as the character features, which will be described in detail below with reference to the drawings.

前記の線分l₁，l₂……は次のようにして認識さ
れる。即ち走査線l₀，l₁，l₂はＹ座標が異なるか
らそのＹ座標別に、０→反転を生じる最初の点の
座標（Ｘ座標）を求めると（Y₀，０），（Y₁，
０），（Y₂，０），（Y₃，X₃），（Y₄，X₄）……の如
きデータ群が得られる。こゝで０は“０→１反転
を生じる点は無い”を示すが、このようなものも
単純にXi（ｉ＝０，１，２……）で示すと、上記
データは（Xi，Yi）で表わされる。これらのデ
ータはメモリのYiアドレスにXiデータを記録す
るという方法をとると、処理が容易である。即ち
アドレスカウンタを逐次＋１してY₀，Y₁，Y₂…
…アドレスのデータX₀，X₁，X₂……を読出し、
ΔX＝X_i+1−Xiを求めてみると、線分l₁，l₂，l₃に
おいては差分ΔXは小さいからこの事を以つて連
続した線分であると判定できる。またΔX＜０な
ら左肩下りの線分、ΔX＞０なら右肩下りの線分
と言える。線分l₁とl₂，l₂とl₃の境ではΔXは突然
大になる。このような場合線分は途切れている、
少なくとも水平走査線と平行な線分で接続されて
いるに過ぎないと言える。線分の端を決定するの
はこの不連続点と、Xi−０からある値を持つよ
うになつた点である。線分l₁の始端は後者、終端
は前者であり、線分l₂は両端が前者、線分l₃は始
端が前者、終端が後者である。そして線分l₂のよ
うに両端が不連続点ということは、数字０〜９の
ように１つにつながつたものにおいては、両端に
文字パターン部分がある、両端が閉じていると言
える。これに対して線分l₁，l₃は両端が開放して
いると言える。両端クローズの線分を持つという
ことは、前述のように数字３の大きな特徴点であ
る。 The line segments l ₁ , l ₂ . . . are recognized as follows. That is, since the scanning lines l ₀ , l ₁ , and l ₂ have different Y coordinates, finding the coordinates (X coordinates) of the first point where 0 → inversion occurs for each Y coordinate, (Y ₀ , 0), (Y ₁ ,
0), (Y ₂ , 0), (Y ₃ , X ₃ ), (Y ₄ , X ₄ )... are obtained. Here, 0 indicates "there is no point that causes 0 → 1 inversion", but if such a point is also simply expressed as Xi (i = 0, 1, 2...), the above data becomes (Xi, Yi ). These data can be easily processed by recording the Xi data at the Yi address in the memory. That is, the address counter is sequentially incremented by 1 and Y ₀ , Y ₁ , Y ₂ , etc.
… Read address data X ₀ , X ₁ , X ₂ …,
When determining ΔX=X _i+1 −Xi, the difference ΔX is small for line segments l ₁ , l ₂ , and l _{3 ,} so it can be determined from this that they are continuous line segments. Also, if ΔX<0, it can be said to be a line segment going down the left shoulder, and if ΔX>0, it can be said to be a line segment going down the right shoulder. At the boundary between line segments l ₁ and l ₂ and l ₂ and l ₃ , ΔX suddenly becomes large. In this case, the line segment is broken,
It can be said that they are connected by line segments parallel to at least the horizontal scanning line. It is this discontinuity point and the point that takes on a certain value from Xi-0 that determines the end of the line segment. The starting end of the line segment _l1 is the latter, and the ending is the former, the both ends of the line segment _l2 are the former, and the starting end of the line segment _l3 is the latter. The fact that both ends are discontinuous points, such as the line segment l ₂ , means that when numbers 0 to 9 are connected, it can be said that there are character pattern portions at both ends, and both ends are closed. On the other hand, both ends of line segments l ₁ and l ₃ can be said to be open. As mentioned above, having a line segment with both ends closed is a major feature of the number 3.

ところで数字３には凹みP₁，P₂があるから、
これをも検出すると、数字３の特徴を一層よりよ
く抽出したことになる。この凹み部分は線分l₁，
l₃の陰になつているので抽出できなかつたもので
あるが、抽出論理を「最初の０→１変化点」では
なく「３番目の（一般化すれば奇数番目の）０→
１変化点」とすると、陰になつた部分を抽出でき
る。即ち第３図に示すように凹んだ部分P₁を通
る走査線l_aについて０―１反転をみるとそれは
Q₁，Q₂，Q₃，Q₄の４点であり、陰れた部分l₆の
輪郭を定める反転点Q₃は３番目である。この
「３番目の反転点」の論理で線分l₆，l₇を抽出で
き、これと第１図の方式つまり「１番目の反転
点」の論理で求めた線分l₂を合せると凹部の最深
部まで入り込んだ線分l₈が得られる。かゝる線分
l₈を用いると数字「３」の認識は一層確実、容易
になる。即ちこの線分のΔXを求めるとそれは
正、負、正、負と変り、数字３の特徴をよく表わ
している。かゝる線分l₈と前記線分l₁，l₃を組み
合せる、即ち垂直方向では線分l₁，l₈，l₃の順で
存在し、そして水平方向では線分l₁，l₃の右方に
あり両端が該線分l₁，l₃と重なる線分l₈があると
いう論理では、相当乱暴に手書きしたものでも数
字３を他のものと分離、識別できる。 By the way, the number 3 has depressions P ₁ and P ₂ , so
If this is also detected, the feature of number 3 will have been extracted even better. This concave part is line segment l ₁ ,
l It could not be extracted because it is in the shadow of ₃ , but the extraction logic is not "the first 0 → 1 change point" but "the third (odd numbered if generalized) 0 →
1 change point", the shaded part can be extracted. That is, as shown in Fig. 3, if we look at the 0-1 inversion of the scanning line _l a passing through the concave part P ₁ , it is
There are four points Q ₁ , Q ₂ , Q ₃ , and Q ₄ , and the inversion point Q ₃ that defines the contour of the shaded portion l ₆ is the third. Line segments l ₆ and l ₇ can be extracted using the logic of this "third reversal point", and by combining these with the line segment l ₂ obtained using the method shown in Figure 1, that is, the logic of the "first reversal point", a concave portion is formed. A line segment l ₈ that penetrates to the deepest part of is obtained. line segment
l Recognizing the number ``3'' becomes more reliable and easier when using ₈ . That is, when we calculate ΔX of this line segment, it changes from positive to negative to positive to negative, which clearly represents the characteristics of the number 3. Combining the line segment l ₈ with the line segments l _{1 and} l ₃ , that is, in the vertical direction, the line segments l ₁ , l ₈ , and l ₃ exist in this order, and in the horizontal direction, the line segments l ₁ and l Based on the logic that there is a line segment l ₈ to the right of ₃ whose ends overlap with the line segments l ₁ and l ₃ , the numeral 3 can be separated and identified from others even if it is handwritten in a very rough manner.

第４図は手書きの「チ」、第５図は手書きの
「テ」の例を示す。これらの相違点は突出部Ｒが
あるか否かが唯一の識別ポイントというケースも
珍らしくない。しかしこれは上部の文字パターン
部分Ｂに隠れているので、第１図の最初の変化点
という論理では抽出できない。これに対して３番
目の変化点、特に垂直走査線における３番目の変
化点という論理を用いると線分l₉，l₁₀が抽出でき
る（両端部は、「最初の変化点」で抽出したも
の）。線分l₉，l₁₀が抽出できれば、「Ｙ座標変化が
一様か」で（垂直方向をＸとする）、突出部Ｒの
有無をチエツクでき、ひいてはテとチの識別が可
能になる。まぎらわしい字は多々あり、例えば片
仮名のウとカ、クとウ、ミとシ、リとン、ニとン
なども乱暴に書かれた場合に単純な左方特徴、上
方特徴などでは識別しにくいものである。これら
も「奇数番目」の論理で陰になる部分を抽出する
と、又はそれと単純な左方特徴等と組合せると識
別可能となることを期待できる。 FIG. 4 shows an example of a handwritten "chi", and FIG. 5 shows an example of a handwritten "te". It is not uncommon for these differences to be determined by the presence or absence of the protrusion R. However, since this is hidden in the upper character pattern portion B, it cannot be extracted using the logic of the first change point in FIG. On the other hand, if we use the logic of the third change point, especially the third change point in the vertical scanning line, line segments l ₉ and l ₁₀ can be extracted (both ends are the ones extracted at the "first change point"). ). If the line segments l ₉ and l ₁₀ can be extracted, the presence or absence of the protrusion R can be checked based on whether the change in the Y coordinate is uniform (with the vertical direction being X), and it becomes possible to identify the tip and the tip. There are many confusing characters, such as katakana ``u'' and ``ka'', ``ku'' and ``u'', ``mi'' and ``shi'', ``riton'', and ``niton'', which are difficult to distinguish using simple left and top features when written in a rough manner. It is something. These can also be expected to be identified by extracting the shadowed parts using the "odd-numbered" logic, or by combining it with a simple left-hand feature, etc.

手書き数字および仮名の認識においては特徴と
して例えば1000種など多数を用い、これらで50段
程度のトリー回路を構成し、それを迹つていくこ
とにより文字認識を行なう。本発明により抽出す
る特徴もその１つに加えて使用されるものであ
る。なお特徴が適切であれば比較的少数の段を迹
るだけで結果を得ることができ、文字認識速度を
上げることができる。この点、本発明方式は甚だ
有効である。 In the recognition of handwritten numbers and kana, a large number of features, such as 1000 types, are used, and these constitute a tree circuit of about 50 stages, and character recognition is performed by walking through the tree circuit. The features extracted by the present invention are also used in addition to one of them. Note that if the features are appropriate, results can be obtained by passing through a relatively small number of stages, and character recognition speed can be increased. In this respect, the method of the present invention is extremely effective.

[Brief explanation of drawings]

第１図および第２図は従来法の説明図、第３図
〜第５図は本発明法の説明図である。図面でl₁〜l₁₀は線分、l₆，l₇，l₉，l₁₀は陰に隠れ
た文字パターン部分の線分である。 1 and 2 are explanatory diagrams of the conventional method, and FIGS. 3 to 5 are explanatory diagrams of the method of the present invention. In the drawing, l ₁ to _{l 10} are line segments, and l ₆ , l ₇ , l ₉ , and l ₁₀ are line segments of hidden character pattern parts.

Claims

[Claims] 1. Scanning a character pattern horizontally or vertically,
In a feature extraction method that extracts line segments constituting the outline of a character pattern as a collection of black-and-white transition points between the background and characters, and uses them as character features for character recognition, 1 in each horizontal and vertical scanning line direction is used. A line segment forming the outline of a character pattern part hidden behind a character pattern part that appears first in the scanning direction is extracted as a collection of odd-numbered black-and-white change points excluding the character pattern part, and this is used as the character feature. A character feature extraction method used for character recognition.