JPH0139156B2 - - Google Patents
Info
- Publication number
- JPH0139156B2 JPH0139156B2 JP56151051A JP15105181A JPH0139156B2 JP H0139156 B2 JPH0139156 B2 JP H0139156B2 JP 56151051 A JP56151051 A JP 56151051A JP 15105181 A JP15105181 A JP 15105181A JP H0139156 B2 JPH0139156 B2 JP H0139156B2
- Authority
- JP
- Japan
- Prior art keywords
- character
- features
- line
- character pattern
- extracted
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/182—Extraction of features or characteristics of the image by coding the contour of the pattern
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Character Discrimination (AREA)
Description
【発明の詳細な説明】
本発明は文字認識に用いる文字特徴の抽出方式
に関し、従来法によつては隠れてしまう部分の特
徴抽出を可能にしようとするものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a method for extracting character features used in character recognition, and is intended to enable feature extraction of portions that are hidden by conventional methods.
文字認識においては、文字を左方から見た特
徴、上方から見た特徴を求め、文字認識の資料に
供するということが行なわれる。例えば第1図に
示す数字3を例にとると、手書きした該数字3を
ラインスキヤナなどで走査して2値ビデオ信号を
得、一旦メモリに格納し、該メモリから数字3の
ビデオ信号を切出し(一般には他の手書き文字等
と共に走査されて該メモリに格納されているか
ら)、左から右へ走査する各水平を走査上で最初
に0(背景)から1(文字)に変換する点の集まり
を求め、それを左方から見た該数字3の輪郭特徴
とする。この論理では線分l1,l2,l3が左方から
見た文字の特徴(こゝでは、左方特徴という)と
なる。同様に上から下へ走査する各垂直走査線上
で最初に0から1に変る点の集りを求めると第2
図の線分l4,l5が得られ、これが上方から見た文
字の特徴(上方特徴)となる。これらの左方特
徴、上方特徴および同様にして求めた右方特徴、
下方特徴は抽出が容易で、しかも数字0,1,2
……などの単純なパターンの文字に対してはよい
特徴データとなり、文字認識に効果的に利用でき
る。例えば上記の線分l1とl3の間に入り込んだ線
分l2があるというものは0〜9の数字では3以外
にはなく、従つて対象は0〜9のいずれかという
ことであればこの特徴1つだけでも数字3の認識
が可能である。しかし、陰に隠れて抽出されない
特徴も多々ある。例えば左方特徴では、大きく凹
んでいる部分P1,P2などは抽出されておらず、
上方特徴では抽出されるのはいわば屋根又は庇に
なつている部分のみでその下の全部が隠れてしま
つて抽出されない。従つて漢字はおろかカナなど
の少し複雑なパターンの文字では特徴抽出不充分
となつて、他のものと分離、識別できない文字が
発生する。 In character recognition, characteristics of characters viewed from the left and from above are determined and used as data for character recognition. For example, taking the number 3 shown in Figure 1 as an example, the handwritten number 3 is scanned with a line scanner to obtain a binary video signal, temporarily stored in memory, and the video signal of number 3 is cut out from the memory ( (generally scanned and stored in the memory along with other handwritten characters, etc.), each horizontal scan from left to right is a collection of points that first convert from 0 (background) to 1 (character) on the scan. , and use it as the outline feature of the number 3 seen from the left. According to this logic, line segments l 1 , l 2 , and l 3 are the features of the character viewed from the left (herein referred to as left features). Similarly, if we find the first set of points that change from 0 to 1 on each vertical scanning line scanning from top to bottom, the second
Line segments l 4 and l 5 in the diagram are obtained, and these become the features of the character viewed from above (upper features). These left features, upper features, and right features obtained in the same way,
The downward features are easy to extract, and the numbers 0, 1, 2
This provides good feature data for characters with simple patterns such as ..., and can be effectively used for character recognition. For example, there is a line segment l 2 that intersects between the above line segments l 1 and l 3 , but there is no other number than 3 in the range 0 to 9, so the target is either 0 to 9. It is possible to recognize the number 3 using only one characteristic of cigarettes. However, there are many features that are hidden in the background and are not extracted. For example, in the left feature, large concave parts P 1 and P 2 are not extracted,
In the case of upper features, only the part that forms a roof or eaves is extracted, and everything below it is hidden and not extracted. Therefore, the feature extraction is insufficient for characters with slightly complex patterns such as kanji and kana, and some characters cannot be separated or identified from other characters.
そこで本発明は陰に隠れる部分の特徴抽出も行
なえるようにして正確、迅速な文字認識を可能に
しようとするものである。即ち本発明は文字パタ
ーンを水平又は垂直方向に走査し、背景と文字と
の白黒変化点の集りとして、該文字パターンの輪
郭を構成する線分を取出し、それを文字認識に用
いる文字特徴とする特徴抽出方式において、各水
平、垂直走査線方向における、1番目を除く奇数
番目の白黒変化点の集りとして、走査方向で最初
に現われる文字パターン部分の陰に隠れた文字パ
ターン部分の輪郭をなす線分を抽出し、それを前
記文字特徴とすることを特徴とするが、次に図面
を参照しながらこれを詳細に説明する。 Therefore, the present invention aims to enable accurate and quick character recognition by making it possible to extract features from hidden parts. That is, the present invention scans a character pattern in the horizontal or vertical direction, extracts line segments that make up the outline of the character pattern as a collection of black-and-white transition points between the background and characters, and uses these as character features for character recognition. In the feature extraction method, a line forming the outline of a character pattern part hidden behind the character pattern part that first appears in the scanning direction is a collection of odd-numbered black and white transition points other than the first in each horizontal and vertical scanning line direction. The present invention is characterized in that the characters are extracted and used as the character features, which will be described in detail below with reference to the drawings.
前記の線分l1,l2……は次のようにして認識さ
れる。即ち走査線l0,l1,l2はY座標が異なるか
らそのY座標別に、0→反転を生じる最初の点の
座標(X座標)を求めると(Y0,0),(Y1,
0),(Y2,0),(Y3,X3),(Y4,X4)……の如
きデータ群が得られる。こゝで0は“0→1反転
を生じる点は無い”を示すが、このようなものも
単純にXi(i=0,1,2……)で示すと、上記
データは(Xi,Yi)で表わされる。これらのデ
ータはメモリのYiアドレスにXiデータを記録す
るという方法をとると、処理が容易である。即ち
アドレスカウンタを逐次+1してY0,Y1,Y2…
…アドレスのデータX0,X1,X2……を読出し、
ΔX=Xi+1−Xiを求めてみると、線分l1,l2,l3に
おいては差分ΔXは小さいからこの事を以つて連
続した線分であると判定できる。またΔX<0な
ら左肩下りの線分、ΔX>0なら右肩下りの線分
と言える。線分l1とl2,l2とl3の境ではΔXは突然
大になる。このような場合線分は途切れている、
少なくとも水平走査線と平行な線分で接続されて
いるに過ぎないと言える。線分の端を決定するの
はこの不連続点と、Xi−0からある値を持つよ
うになつた点である。線分l1の始端は後者、終端
は前者であり、線分l2は両端が前者、線分l3は始
端が前者、終端が後者である。そして線分l2のよ
うに両端が不連続点ということは、数字0〜9の
ように1つにつながつたものにおいては、両端に
文字パターン部分がある、両端が閉じていると言
える。これに対して線分l1,l3は両端が開放して
いると言える。両端クローズの線分を持つという
ことは、前述のように数字3の大きな特徴点であ
る。 The line segments l 1 , l 2 . . . are recognized as follows. That is, since the scanning lines l 0 , l 1 , and l 2 have different Y coordinates, finding the coordinates (X coordinates) of the first point where 0 → inversion occurs for each Y coordinate, (Y 0 , 0), (Y 1 ,
0), (Y 2 , 0), (Y 3 , X 3 ), (Y 4 , X 4 )... are obtained. Here, 0 indicates "there is no point that causes 0 → 1 inversion", but if such a point is also simply expressed as Xi (i = 0, 1, 2...), the above data becomes (Xi, Yi ). These data can be easily processed by recording the Xi data at the Yi address in the memory. That is, the address counter is sequentially incremented by 1 and Y 0 , Y 1 , Y 2 , etc.
… Read address data X 0 , X 1 , X 2 …,
When determining ΔX=X i+1 −Xi, the difference ΔX is small for line segments l 1 , l 2 , and l 3 , so it can be determined from this that they are continuous line segments. Also, if ΔX<0, it can be said to be a line segment going down the left shoulder, and if ΔX>0, it can be said to be a line segment going down the right shoulder. At the boundary between line segments l 1 and l 2 and l 2 and l 3 , ΔX suddenly becomes large. In this case, the line segment is broken,
It can be said that they are connected by line segments parallel to at least the horizontal scanning line. It is this discontinuity point and the point that takes on a certain value from Xi-0 that determines the end of the line segment. The starting end of the line segment l1 is the latter, and the ending is the former, the both ends of the line segment l2 are the former, and the starting end of the line segment l3 is the latter. The fact that both ends are discontinuous points, such as the line segment l 2 , means that when numbers 0 to 9 are connected, it can be said that there are character pattern portions at both ends, and both ends are closed. On the other hand, both ends of line segments l 1 and l 3 can be said to be open. As mentioned above, having a line segment with both ends closed is a major feature of the number 3.
ところで数字3には凹みP1,P2があるから、
これをも検出すると、数字3の特徴を一層よりよ
く抽出したことになる。この凹み部分は線分l1,
l3の陰になつているので抽出できなかつたもので
あるが、抽出論理を「最初の0→1変化点」では
なく「3番目の(一般化すれば奇数番目の)0→
1変化点」とすると、陰になつた部分を抽出でき
る。即ち第3図に示すように凹んだ部分P1を通
る走査線laについて0―1反転をみるとそれは
Q1,Q2,Q3,Q4の4点であり、陰れた部分l6の
輪郭を定める反転点Q3は3番目である。この
「3番目の反転点」の論理で線分l6,l7を抽出で
き、これと第1図の方式つまり「1番目の反転
点」の論理で求めた線分l2を合せると凹部の最深
部まで入り込んだ線分l8が得られる。かゝる線分
l8を用いると数字「3」の認識は一層確実、容易
になる。即ちこの線分のΔXを求めるとそれは
正、負、正、負と変り、数字3の特徴をよく表わ
している。かゝる線分l8と前記線分l1,l3を組み
合せる、即ち垂直方向では線分l1,l8,l3の順で
存在し、そして水平方向では線分l1,l3の右方に
あり両端が該線分l1,l3と重なる線分l8があると
いう論理では、相当乱暴に手書きしたものでも数
字3を他のものと分離、識別できる。 By the way, the number 3 has depressions P 1 and P 2 , so
If this is also detected, the feature of number 3 will have been extracted even better. This concave part is line segment l 1 ,
l It could not be extracted because it is in the shadow of 3 , but the extraction logic is not "the first 0 → 1 change point" but "the third (odd numbered if generalized) 0 →
1 change point", the shaded part can be extracted. That is, as shown in Fig. 3, if we look at the 0-1 inversion of the scanning line l a passing through the concave part P 1 , it is
There are four points Q 1 , Q 2 , Q 3 , and Q 4 , and the inversion point Q 3 that defines the contour of the shaded portion l 6 is the third. Line segments l 6 and l 7 can be extracted using the logic of this "third reversal point", and by combining these with the line segment l 2 obtained using the method shown in Figure 1, that is, the logic of the "first reversal point", a concave portion is formed. A line segment l 8 that penetrates to the deepest part of is obtained. line segment
l Recognizing the number ``3'' becomes more reliable and easier when using 8 . That is, when we calculate ΔX of this line segment, it changes from positive to negative to positive to negative, which clearly represents the characteristics of the number 3. Combining the line segment l 8 with the line segments l 1 and l 3 , that is, in the vertical direction, the line segments l 1 , l 8 , and l 3 exist in this order, and in the horizontal direction, the line segments l 1 and l Based on the logic that there is a line segment l 8 to the right of 3 whose ends overlap with the line segments l 1 and l 3 , the numeral 3 can be separated and identified from others even if it is handwritten in a very rough manner.
第4図は手書きの「チ」、第5図は手書きの
「テ」の例を示す。これらの相違点は突出部Rが
あるか否かが唯一の識別ポイントというケースも
珍らしくない。しかしこれは上部の文字パターン
部分Bに隠れているので、第1図の最初の変化点
という論理では抽出できない。これに対して3番
目の変化点、特に垂直走査線における3番目の変
化点という論理を用いると線分l9,l10が抽出でき
る(両端部は、「最初の変化点」で抽出したも
の)。線分l9,l10が抽出できれば、「Y座標変化が
一様か」で(垂直方向をXとする)、突出部Rの
有無をチエツクでき、ひいてはテとチの識別が可
能になる。まぎらわしい字は多々あり、例えば片
仮名のウとカ、クとウ、ミとシ、リとン、ニとン
なども乱暴に書かれた場合に単純な左方特徴、上
方特徴などでは識別しにくいものである。これら
も「奇数番目」の論理で陰になる部分を抽出する
と、又はそれと単純な左方特徴等と組合せると識
別可能となることを期待できる。 FIG. 4 shows an example of a handwritten "chi", and FIG. 5 shows an example of a handwritten "te". It is not uncommon for these differences to be determined by the presence or absence of the protrusion R. However, since this is hidden in the upper character pattern portion B, it cannot be extracted using the logic of the first change point in FIG. On the other hand, if we use the logic of the third change point, especially the third change point in the vertical scanning line, line segments l 9 and l 10 can be extracted (both ends are the ones extracted at the "first change point"). ). If the line segments l 9 and l 10 can be extracted, the presence or absence of the protrusion R can be checked based on whether the change in the Y coordinate is uniform (with the vertical direction being X), and it becomes possible to identify the tip and the tip. There are many confusing characters, such as katakana ``u'' and ``ka'', ``ku'' and ``u'', ``mi'' and ``shi'', ``riton'', and ``niton'', which are difficult to distinguish using simple left and top features when written in a rough manner. It is something. These can also be expected to be identified by extracting the shadowed parts using the "odd-numbered" logic, or by combining it with a simple left-hand feature, etc.
手書き数字および仮名の認識においては特徴と
して例えば1000種など多数を用い、これらで50段
程度のトリー回路を構成し、それを迹つていくこ
とにより文字認識を行なう。本発明により抽出す
る特徴もその1つに加えて使用されるものであ
る。なお特徴が適切であれば比較的少数の段を迹
るだけで結果を得ることができ、文字認識速度を
上げることができる。この点、本発明方式は甚だ
有効である。 In the recognition of handwritten numbers and kana, a large number of features, such as 1000 types, are used, and these constitute a tree circuit of about 50 stages, and character recognition is performed by walking through the tree circuit. The features extracted by the present invention are also used in addition to one of them. Note that if the features are appropriate, results can be obtained by passing through a relatively small number of stages, and character recognition speed can be increased. In this respect, the method of the present invention is extremely effective.
第1図および第2図は従来法の説明図、第3図
〜第5図は本発明法の説明図である。
図面でl1〜l10は線分、l6,l7,l9,l10は陰に隠れ
た文字パターン部分の線分である。
1 and 2 are explanatory diagrams of the conventional method, and FIGS. 3 to 5 are explanatory diagrams of the method of the present invention. In the drawing, l 1 to l 10 are line segments, and l 6 , l 7 , l 9 , and l 10 are line segments of hidden character pattern parts.
Claims (1)
背景と文字との白黒変化点の集りとして、該文字
パターンの輪郭を構成する線分を取出し、それを
文字認識に用いる文字特徴とする特徴抽出方式に
おいて、 各水平、垂直走査線方向における、1番目を除
く奇数番目の白黒変化点の集りとして、走査方向
で最初に現われる文字パターン部分の陰に隠れた
文字パターン部分の輪郭をなす線分を抽出し、そ
れを前記文字特徴とすることを特徴とした文字認
識に用いる文字特徴の抽出方式。[Claims] 1. Scanning a character pattern horizontally or vertically,
In a feature extraction method that extracts line segments constituting the outline of a character pattern as a collection of black-and-white transition points between the background and characters, and uses them as character features for character recognition, 1 in each horizontal and vertical scanning line direction is used. A line segment forming the outline of a character pattern part hidden behind a character pattern part that appears first in the scanning direction is extracted as a collection of odd-numbered black-and-white change points excluding the character pattern part, and this is used as the character feature. A character feature extraction method used for character recognition.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP56151051A JPS5852783A (en) | 1981-09-24 | 1981-09-24 | Feature extraction system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP56151051A JPS5852783A (en) | 1981-09-24 | 1981-09-24 | Feature extraction system |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JPS5852783A JPS5852783A (en) | 1983-03-29 |
| JPH0139156B2 true JPH0139156B2 (en) | 1989-08-18 |
Family
ID=15510220
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP56151051A Granted JPS5852783A (en) | 1981-09-24 | 1981-09-24 | Feature extraction system |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JPS5852783A (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2626854B2 (en) * | 1992-01-31 | 1997-07-02 | 大和ハウス工業株式会社 | Column-column joint structure |
-
1981
- 1981-09-24 JP JP56151051A patent/JPS5852783A/en active Granted
Also Published As
| Publication number | Publication date |
|---|---|
| JPS5852783A (en) | 1983-03-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6055323A (en) | Face image processing system | |
| JPH0546591B2 (en) | ||
| US4105998A (en) | Pattern recognition processing system | |
| JPS62254282A (en) | Method and apparatus for separating overlapped pattern | |
| Gan et al. | A new approach to stroke and feature point extraction in Chinese character recognition | |
| JPH0139156B2 (en) | ||
| JP2677007B2 (en) | Line discrimination system | |
| JPH11250256A (en) | Graphic recognition processing method and recording medium recording the program | |
| JP2506071B2 (en) | Contour tracking device | |
| JPH0223904B2 (en) | ||
| JPS589471B2 (en) | link link | |
| JP2872757B2 (en) | Line direction determination device | |
| JPH08212292A (en) | Frame recognition device | |
| JPH0326878B2 (en) | ||
| JPH0142029B2 (en) | ||
| JP3084833B2 (en) | Feature extraction device | |
| JPH02166583A (en) | Character recognizing device | |
| JPS6116380A (en) | Area division method | |
| JPH04260980A (en) | Device for recognizing graphic | |
| JPS603676B2 (en) | Intersection extraction method | |
| JPS61125687A (en) | Pattern extraction method | |
| JPH04184584A (en) | Method and device for recognizing character | |
| JPH0522164B2 (en) | ||
| JPS603675B2 (en) | Character recognition method | |
| JPH0527908B2 (en) |