JPS5840782B2

JPS5840782B2 - Graphic processing method

Info

Publication number: JPS5840782B2
Application number: JP53053555A
Authority: JP
Inventors: 徹若原
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: NTT Inc
Priority date: 1978-05-04
Filing date: 1978-05-04
Publication date: 1983-09-07
Also published as: JPS54144829A

Description

【発明の詳細な説明】本発明は１図形処理力式、特に図形処理において、前処
理を経た入力図形を細線化、太め処理及びラベリングを
施す適当な操作により多値濃淡画像に変換し、この多値
画像化された図形に画像処理的手法を適用して識別を果
たすようにした図形処理方式に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention utilizes a graphic processing method, particularly graphic processing, to convert an input graphic that has undergone preprocessing into a multivalued grayscale image through appropriate operations such as thinning, thickening, and labeling. The present invention relates to a graphic processing method that performs identification by applying an image processing method to a graphic that has been converted into a multivalued image.

ここで言う図形処理の対象となる「図形」とは、広い意
味で構造を持った線図形であり識別されるべき幾つか或
いは多数のカテゴリーを有する線図形群を指す。The term "graphics" to be subjected to graphical processing herein refers to line figures that have a structure in a broad sense, and refers to a group of line figures that have several or many categories to be identified.

以下では対象を具体的に文字図形に限って従来技術を説
明する。In the following, the conventional technology will be explained specifically by limiting the target to characters and graphics.

ＯＣＲ技術の現状については１例えば森俊二：「文字認
識技術の研究」大型プロジェクトパターン情報処理シス
テム講演会論文集、１９７７年７月７，８日７〜３２頁
に優れた解説がある。An excellent explanation of the current state of OCR technology can be found in, for example, Shunji Mori: ``Research on Character Recognition Technology,'' Large Project Pattern Information Processing System Lecture Proceedings, July 7th and 8th, 1977, pp. 7-32.

そこに記されているように「重ね合せ」が文字認識技術
の基本原理であり１手法としてはｔｏｐｄｏｗｎ的で
印刷文字認識に適するテンプレート・マツチングとす、
ｏｔｔｏｍｕｐ的で手書き文字認識に適する構造解析
法に大別される。As stated there, "superposition" is the basic principle of character recognition technology, and one method is template matching, which is top-down and suitable for printed character recognition.
It is broadly divided into structural analysis methods that are ottom-up and suitable for handwritten character recognition.

印刷文字及び一部の手書き文字（英数字・カナ）ｆこつ
いてはＯＣＲ実用化の目途がついており（例えば上掲の
論文集参照）。There are prospects for practical use of OCR for printed characters and some handwritten characters (alphanumeric characters and kana) (for example, see the collection of papers listed above).

現在一番の難題は手書き漢字ＯＣＲである。Currently, the biggest challenge is handwritten kanji OCR.

手書き漢字は、多様な手書き変形に加え、図形として複
雑な構造をもちまたカテゴリーも非常に多数である。In addition to a variety of handwritten deformations, handwritten kanji have a complex graphical structure and a large number of categories.

こうした複雑な手書き線図形識別を構造解析法をこ依る
場合１輪郭解析法は直列処理であり線の切れ・接触に致
命的であり、背景解析法は輪郭の大域的特徴を捉えるこ
とができるが、いわば荒すぎることから、線解析法を採
る立場が従来の主流となっている。When relying on the structural analysis method to identify such complex handwritten line figures 1. The contour analysis method is a serial process and is fatal to line breakage and contact, while the background analysis method can capture the global characteristics of the contour. , so to speak, is too rough, so the conventional mainstream position has been to adopt the line analysis method.

線解析法では、まず入力文字図形は２値化して細線化さ
れて入力文字図形（例えば明朝体など）の線幅の影響を
なくし、次いで得られた心線図形から特徴を抽出する。In the line analysis method, the input character figure is first binarized and thinned to eliminate the influence of the line width of the input character figure (for example, Mincho font), and then features are extracted from the obtained heart line figure.

その手法には１文字線を逐次抽出していくものと１文字
線の端点、分岐点に着目していくものがある（前者につ
いては岩田：「手書き漢字の自動判定」第５１回電波
研研究発表会予稿、１９７６年、６２〜７１頁、後者に
ついては小川１平塚：「正読率が９５％以上の手書き
漢字読み取りシステム」日経エレクトロニクス、１９７
４年８月１２日号。There are two methods: one that sequentially extracts single character lines, and another that focuses on the end points and branching points of single character lines (for the former, see Iwata: ``Automatic recognition of handwritten kanji'', 51st Radio Research Institute Proceedings of the presentation, 1976, pp. 62-71, for the latter, Hiratsuka Ogawa: “Handwritten kanji reading system with a correct reading rate of over 95%” Nikkei Electronics, 197
August 12, 4th issue.

８２〜１１９頁）。(pp. 82-119).

しかし、次に示すような欠点をもち、解決の方向は未だ
明らかでない。However, it has the following drawbacks, and the solution is still unclear.

即ち■細線化処理での「ヒゲ」や「ｖ−＋Ｙ」現象につ
いては局所的処理では対処し峰ない。That is, (2) the "whiskers" and "v-+Y" phenomena in the thinning process cannot be dealt with by local processing.

■心線の傾きや直線性の判定が難しい。■It is difficult to judge the inclination and linearity of the core wire.

■ミクロ的に線を追跡しｂｏｔｔｏｍｕｐに特徴を抽
出するから有効な大分類法がない。■There is no effective general classification method because lines are traced microscopically and features are extracted from the bottom up.

■線の切れや接触に弱い。上記の欠点■〜■は、認識対
象が手書き漢字などの文字図形にとどまらず一般に構造
を持つ線図形群である場合に、構造解析の立場から線解
析法を適用シ認識しようとする際に生ずる本質的難点で
ある。■Vulnerable to wire breakage and contact. The above-mentioned drawbacks ■～■ arise when attempting to recognize the recognition target by applying the line analysis method from the standpoint of structural analysis when the recognition target is not only characters and figures such as handwritten kanji, but generally a group of line figures that have a structure. This is an essential difficulty.

本発明は、これらの欠点を解決するため、２値化された
入力図形をその幾何学的及び位相的特徴が反映された多
値濃淡画像に変換してから画像処理的手法で識別を行な
うという発想転換を企てたものであり、入力文字図形の
線幅の影響を無視すべく心線化（細線化）を行った上で
太め処理を行ってゆき場合によって生じるかも知れない
ヒゲなどによる影響を抑圧できる形で多値濃淡画像をつ
くるようにして、入力図形の手書き変形に対しても安定
でありかつ大分類も容易に処理手法に組み込めるという
特徴をもつ図形処理方式を提供することを目的としてい
る。In order to solve these drawbacks, the present invention converts a binarized input figure into a multivalued gray image that reflects its geometric and topological characteristics, and then performs identification using an image processing method. This is an attempt to change the way of thinking, and in order to ignore the influence of the line width of the input character shape, it performs core line (thinning) and then thickens it, which may cause the influence of whiskers etc. in some cases. It is an object of the present invention to provide a figure processing method that is stable even against handwritten deformation of input figures by creating a multivalued grayscale image in a form that can suppress the image processing, and is characterized in that large classification can be easily incorporated into the processing method. It is said that

本発明の場合、入力図形に前処理を施してからその構造
を反映した多値画像に変換するまでの図形処理方式Ａと
、画像に変換された入力図形を画像処理的手法に基き図
形識別を行なうまでの図形処理方式Ｂとの二段構成をも
っている。In the case of the present invention, a figure processing method A is used that preprocesses an input figure and converts it into a multivalued image that reflects its structure, and figure identification is performed on the input figure converted into an image based on an image processing method. It has a two-stage structure with graphic processing method B up to the point where it is executed.

本発明が対象とする図形は、先述したように、構造をも
ち多種カテゴリーから構成される線図形の集合一般であ
る。As mentioned above, the objects of the present invention are general collections of line figures that have a structure and are composed of various categories.

ここで具体的には手書き文字とくに手書き漢字を念頭に
置くが、上記のように広く適用されることは言うまでも
ない。Here, we specifically consider handwritten characters, especially handwritten kanji, but it goes without saying that it can be widely applied as described above.

以下（Ａ）（Ｂ）の順に説ツ明する。Explain in the following order (A) and (B). I will clarify.

処理（Ａ）入力図形に対して、ノイズ除法、大きさ・位置の正規化
、２値化の前処理を施すと、第１図のような２値図形が
得られる。Processing (A) When the input figure is subjected to preprocessing such as noise division, normalization of size and position, and binarization, a binary figure as shown in FIG. 1 is obtained.

次いでこの２値図形に細線化処理を施し線幅を「ｌ」に
規格化して、心線部分にラベル０を付けると第２図のよ
うになる。Next, this binary figure is subjected to line thinning processing, the line width is normalized to "l", and a label 0 is attached to the center line portion, resulting in a figure as shown in FIG.

得られた心線（ラベル０の領域）を核として太め処理を
施す。The obtained core wire (area labeled 0) is used as a core and thickened.

太め処理をｏｎｅ５ｔｅｐ実行する（線幅をｒｌＪず
つ太らす）毎に、得られた領域に１゜２．３．・・・と
順次ラベルを付けていく。Each time the thickening process is executed for 1 5 steps (thickening the line width by rlJ), the resulting area is expanded by 1°2.3. . . . and sequentially label them.

その様子を第３図に示す。The situation is shown in Figure 3.

図形背景である白地部分に全てラベルが付与された時点
で太め処理及びラベリング作業は完了する（第４図参照
）。The thickening process and labeling work are completed when labels have been applied to all the white parts that are the figure backgrounds (see FIG. 4).

ラベル０，１゜２．３．・・・で埋まった画面において
、ラベル値にの関数ｆ（ｋ）（ｆは単調非減少関数１例
・ｆ（ｋ）＝ｋ。Label 0,1゜2.3. In a screen filled with ..., a function f(k) for the label value (f is a monotonically non-decreasing function, for example, f(k) = k.

ｒ（ｋ）＝ｋ、ｆ（ｋ）＝ｅｘｌ迫のを濃淡
レベルとしてその点（画素）に付与することにより、も
との入力図形は多値濃淡画像に変換される。By assigning r(k)=k, f(k)=exl to that point (pixel) as a gradation level, the original input figure is converted into a multivalued gradation image.

なお関数形をｆ（ロ）−に以外に選ぶことは適当な重み
づけをすることに相当する点を注意しておく。Note that selecting a function form other than f(b)- corresponds to appropriate weighting.

以上説明した手続きによって得られた多値濃淡画像が入
力図形の構造を忠実１こ反映していることを先ず説明す
る。First, it will be explained that the multivalued gray image obtained by the procedure described above faithfully reflects the structure of the input figure.

第１１こ、細線化処理をこまって得られた心線部分（ラ
ベル０の領域）は入力図形の幾何学的特徴そのものであ
る。Eleventh, the core line portion (area labeled 0) obtained through the thinning process is the geometric feature of the input figure itself.

ここでの細線化処理はあくまで線幅を「ｌ」に規格化し
て入力図形が含む線の多少や図形全体としての複雑さ等
を定量化するための手段であり、線をミクロに追跡して
構造を抽出するための前処理ではないから「Ｖ→Ｙ」現
象等をこ神経質になる必要はないという点は注意される
べきである。The line thinning process here is just a means to standardize the line width to "l" and quantify the number of lines included in the input figure and the complexity of the figure as a whole. It should be noted that since this is not a preprocessing for extracting a structure, there is no need to be concerned about the "V→Y" phenomenon.

第２に、太め処理を利用した図形背景部分のラベリング
は入力図形の位相的特徴を表現するための手段になって
いる。Second, labeling of the graphic background using thickening processing is a means for expressing the topological features of the input graphic.

というのは、白地部分の各点に付与されたラベル値は、
その白点に一番近い心線部分からの距離になっているか
ら、心線部分に対する位置関係すなわち位相的特徴を反
映しているからである。This is because the label value given to each point on the white background is
This is because the distance is from the core portion closest to the white point, and reflects the positional relationship with respect to the core wire portion, that is, the topological characteristics.

第３に、最終的にこのラベル値の関数ｆを濃淡レベルと
し多値濃淡画像に変換すること１こより、もとの入力図
形がもつ幾何学的及び位相的特徴が同時に画像表現の中
に吸収される。Third, by finally converting the label value function f into a multi-valued grayscale image, the geometric and topological features of the original input figure are simultaneously absorbed into the image representation. be done.

この画像表現は１例えて言えば、心線部分を尾根とする
山脈を眺めるという王台に入力図形を３次元的に表現し
ていることに相当する。For example, this image representation corresponds to a three-dimensional representation of the input figure on a platform that looks at a mountain range with the central line as a ridge.

以上より明らかなように、入力図形がもつ構造を濃淡レ
ベルに還元した点に本発明の画期的な発想転換がある。As is clear from the above, there is an epoch-making change in the idea of the present invention in that the structure of the input figure is reduced to shading levels.

更に、この画像表現は重要な作用として手書き変形に対
して極めて安定であるという特長をもつ。Furthermore, an important feature of this image representation is that it is extremely stable against handwritten deformation.

何故ならば、成る画素に注目すると、２値表現の場合は
手書き変形（こよリレベルが０から１或いは１から０へ
というＦ有」「無」の変化を生ずるが、この多値表現の
場合には手書き変形が生じても線部分１こ近い濃淡レベ
ル間でレベル値変化を生ずるにとどまるからである。This is because, if we pay attention to the pixels that consist of pixels, in the case of binary representation, handwritten deformation (the level changes from 0 to 1 or from 1 to 0, with or without F), but in the case of multivalued representation, This is because, even if handwritten deformation occurs, the level value changes only between the light and shade levels close to one line portion.

処理Ｂ上記図形処理方式Ａに示した変換を入力図形に施して得
られる多値濃淡画像をまず大分類する。Processing B: First, the multivalued grayscale image obtained by applying the transformation shown in the above graphic processing method A to the input graphic is roughly classified.

ここで大分類に用いる特徴量は２つある。Here, there are two feature quantities used for major classification.

ともにラベリングが完了した時点での図形を利用する。Both use the shapes at the time labeling is completed.

１つは、ラベルに領域に属する画素数をＰｋと定義し、
Ｐｋの絶対数とｋによる変化パターンである。One is to define the number of pixels belonging to the region in the label as Pk,
This is a change pattern depending on the absolute number of Pk and k.

もう１つは、さらにラベル０，１，２．・・・ｋ領域に
属する画素総数をＡｋ（：ΣＰｉ）と定義ｉ＝１しておいて入力図形の複雑さを評価する量Ｐｋ２／Ａｋ
（ｋ＝０．１．２、・・・）（この量は（周長）２
／（面積）に相当する）である。The other one is further labeled 0, 1, 2 . ...The total number of pixels belonging to the k area is defined as Ak (:ΣPi), and the quantity Pk2/Ak is used to evaluate the complexity of the input figure.
(k=0.1.2,...) (This amount is (perimeter) 2
/(area)).

続いて大分類以降の識別処理即ち後述する如き識別処理
１こ移行する。Subsequently, the process moves to identification processing after major classification, that is, identification processing 1 as described later.

まずは多値画像表現（ＮＸＮ画素とする）から第１，２
．・・・、Ｎ行の順に各画素の濃淡レベル値を読み込ん
でＮ２次元ベクトルと読み換える。First, let's start with the first and second multivalued image representations (NXN pixels).
．． ..., the gray level values of each pixel are read in the order of N rows and read as N two-dimensional vectors.

各図形は画像表現での濃淡レベルを成分とするＮ２次元
のベクトル表現を得る。Each figure obtains an N2-dimensional vector representation whose components are gray levels in image representation.

成る１つの図形カテゴリーに属する様々なＮ２次元ベク
トルの標本集合からＫａｒｈｕｎｅｎＬｏｅｖｅ変換
（以下ＫＬ変換と略記）ｆこよって大きい固有値に属す
る固有ベクトルを求めると、この固有値、固有ベクトル
は注目する図形カテゴリーを特徴づける。From a sample set of various N2-dimensional vectors belonging to one figure category consisting of .

同一カテゴリー内１こおける様々な変形とくに手書き変
形を一種の雑音と見なした点が本質的である。The essential point is that various deformations within the same category, especially handwritten deformations, are regarded as a kind of noise.

認識対象とするすべての図形カテゴリーについて各々の
固有値、固有ベクトルを予め求めておいて、カテゴリー
未知の入力Ｎ２次元ベクトルが与えられると、どの固有
ベクトルを主成分として含むかを評価して、その最も多
く含まれた固有ベクトルが代表する図形カテゴリーに同
定して識別を完了する。The eigenvalues and eigenvectors of all figure categories to be recognized are calculated in advance, and when an input N2-dimensional vector of unknown category is given, which eigenvectors are included as principal components is evaluated, and the eigenvectors that contain the most The identification is completed by identifying the figure category represented by the eigenvector.

以上より明らかなように１本質的にはテンプレート・マ
ツチングの手法に帰着させるために、同一カテゴリー内
での様々な変形がそのカテゴリー毎に特有な振舞いを示
すという点に注目して、各カテゴリーの特徴をＫＬ変換
によって固有ベクトルの形で抽出するという手段を取っ
ている。As is clear from the above, in order to essentially reduce the method to template matching, we focus on the fact that various transformations within the same category exhibit unique behavior for each category. The method is to extract features in the form of eigenvectors using KL transformation.

第５図は本発明の図形処理方式の一実施例構成を示す。FIG. 5 shows the configuration of an embodiment of the graphic processing method of the present invention.

図中の符号１０は線図形入力部であって例えばフライン
グ・スポット・スキャナによる図形走査を行なうもの、
１１は前処理部であって雑音除去や正規化や２値化など
を行なうもの、１２は細線化処理部であって第２図に示
す如く線幅Ｉｌｌの細線化を行なうもの、１３は太め処
理兼ラベリング処理部であって第３図や第４図に示す如
き太め処理とラベリングを行なうもの、１４は大分類処
理部、１５は大分類用特徴量の辞書メモリ、１６は多値
化処理部、１Ｔは識別処理部、１８は辞書パターン・メ
モリを表わしている。Reference numeral 10 in the figure is a line figure input unit, which performs figure scanning using, for example, a flying spot scanner;
11 is a preprocessing unit that performs noise removal, normalization, and binarization, 12 is a thinning processing unit that thins the line width Ill as shown in FIG. 2, and 13 is a thicker line. A processing and labeling processing unit that performs thick processing and labeling as shown in Figs. 3 and 4; 14 is a large classification processing unit; 15 is a dictionary memory for large classification features; 16 is a multi-value processing 1T represents an identification processing section, and 18 represents a dictionary pattern memory.

処理部１０１１をへて周知の如く、前処理を行なわれた
第１図図示の如き２値図形が得られる。After passing through the processing section 1011, a binary figure as shown in FIG. 1 is obtained which has undergone preprocessing as is well known.

該２個図形は、細線化処理部１２によって第２図図示の
如く線幅「ｌ」の図形に変換される。The two figures are converted by the thinning processing unit 12 into a figure with a line width of "l" as shown in FIG.

太め処理兼ラベリング処理部１３は、第２図図示の図形
に順次線幅「ｌ」に相当する幅づつ太め処理を行ない、
第３図および第４図図示の如く順次ラベルを付与してゆ
く。The thickening processing and labeling processing unit 13 sequentially performs thickening processing on the figure shown in FIG. 2 by a width corresponding to the line width "l",
Labels are sequentially applied as shown in FIGS. 3 and 4.

大分類処理部１４は、上記（処理Ｂ」に示した如く、特
微量即ち（ｉ）Ｐｓｃの絶対数とｋによる変化パターン
、または（ｉｌ）評価量Ｐｋ２／Ａｋにもとづいて、入
力図形の大分類を行なう。As shown in "Processing B" above, the major classification processing unit 14 determines the size of the input figure based on the characteristic quantity, that is, (i) the change pattern depending on the absolute number of Psc and k, or (il) the evaluation quantity Pk2/Ak. Perform classification.

即ち、メモリー５上に各認識対象図形１こ対応した上記
（ｉ）または（１１）の情報が格納されている。That is, the information (i) or (11) above corresponding to each figure to be recognized is stored in the memory 5.

そして、認識すべき入力図形ｆこついて上ｇ山）または
（１１）の特徴量を抽出し、該特徴量とメモリー５から
順次読出される特徴量と照合し、入力図形が属する候補
図形を抽出する。Then, extract the feature quantity of the input figure to be recognized (f) or (11), compare the feature quantity with the feature quantity sequentially read from the memory 5, and extract the candidate figure to which the input figure belongs. do.

即ち大分類する。多値化処理部１６は、上記「処理Ｂ」
に示したＮ２次元ベクトルを得る処理を行なう。In other words, it is roughly classified. The multivalue processing unit 16 performs the above-mentioned "processing B"
The process of obtaining the N2-dimensional vector shown in is performed.

そして識別処理部１７は、上記大分類されて候補として
抽出された図形に対応してメモリ１８から読出されてく
る辞書パターンと、上記多値化処理部１６からの出力と
照合される。Then, the identification processing section 17 compares the output from the multi-value processing section 16 with the dictionary pattern read out from the memory 18 corresponding to the figures that have been roughly classified and extracted as candidates.

なお、第５図の構成部１３．１４，１５に関する技術的
態様について以下に補足する。Note that technical aspects regarding the components 13, 14, and 15 in FIG. 5 will be supplemented below.

■ 太め処理兼ラベリング処理部１３ｆこ関して。■ Regarding the thick processing and labeling processing section 13f.

距離変換については１例えばＡ、Ｒｏｓｅｎｆｅｌｄ
。For distance transformations, 1 e.g. A, Rosenfeld
.

Ａ、Ｃ，Ｋａｋ：ＤｉｇｉｔａｌＰｉｃｔｕｒｅＰｒ
ｏｃｅｓｓｉｎｇ”、ｌ）ｐ、３５２−３５７゜Ａｃａ
ｄｅｍｉｃＰｒｅｓｓ、１９７６に解説がある。A, C, Kak: Digital Picture Pr
ocessing”, l)p, 352-357゜Aca
There is an explanation in demic Press, 1976.

ここでは、メツシュ表現での２画素（ｉ、Ｄ。Here, two pixels (i, D.

（ｋ、Ａ）間の距離としてｄ（（ｉｔＪ）（ｋｔｌ））＝ｒｎａｘ（
Ｉｔｋ’ｌ。As the distance between (k, A), d((it J) (k t l)) = rnax(
It k'l.

１ｊ−ｌ＋）を用いた場合の距離変換（太め兼ラベリン
グ処理）について実施例を示す。An example will be shown regarding distance conversion (thick labeling processing) when using 1j-l+).

まず。２値の原パターン（サイズＮＸＮメツシュ）を次
のｈ（ｉ、ｊ）に変換する。first. The binary original pattern (size NXN mesh) is converted into the following h(i, j).

０：文字心線部 ”（°・ｊ）−や、つありイ但し１Ｍは距離変換値の上限上記のパターンｈ（ｉ、ｊ）Ｆこ対し、２×２メツシユ
の論理マスクを用いて、（ｉ）左上隅から右下隅に向っ
て右方向走査するマスク走査（以下＼走査という）と、
（ｉｉ）右下隅から左上隅１こ向って左方向に走査す
るマスク走査（以下＼走査という）と、（ｉｉｉ）左
下隅から右上隅に向って右方向走査するマスク走査（以
下／走査という）と、ＧＶＩ右上隅から左下隅に向って
左方向に走査するマスク走査（以下ｌ走査という）を順
次行って距離変換を完了する。0: Character center line part" (°・j) (i) Mask scanning (hereinafter referred to as \scanning) that scans in the right direction from the upper left corner to the lower right corner,
(ii) Mask scanning that scans from the lower right corner to the upper left corner to the left (hereinafter referred to as \scan); (iii) Mask scanning that scans from the lower left corner to the upper right corner in the right direction (hereinafter referred to as /scan) Then, mask scans (hereinafter referred to as l-scans) that scan leftward from the upper right corner of the GVI toward the lower left corner are sequentially performed to complete the distance conversion.

各１走査、＼走査、／走査、Ｚ走査によって得られるパ
ターンを順次ｇ１ｔｇ２．ｇａ、ｇ＜とする。The patterns obtained by each one scan, \scan, /scan, and Z scan are sequentially g1t g2. Let ga, g<.

ｇ＋（ｉｔｊ）が最終的に得られる距離変換場である。g+(itj) is the distance transformation field finally obtained.

各走査方向についての変換内容を以下に示す。The conversion contents for each scanning direction are shown below.

（１）＼走査 ■ 大分類用特徴量の辞書メモリ一部１５に関して。(1)\scan ■ Concerning part 15 of the dictionary memory for major classification features.

認識対象図形である各カテゴリーθについて。For each category θ that is a figure to be recognized.

そのカテゴリーの平均パターンに太め兼ラベリング処理
までを施した図形から、各ラベルｋｆごつき、画素数Ｐｋ（θ）およびＡｋ（θ）＝ Σ ｐＨ（０）
１＝０を算出して、２つの特微量ベクトル（とも（こＭ次元）を生成してメモリーに格納しておく。From the figure obtained by thickening and labeling the average pattern of that category, each label kf roughness, number of pixels Pk (θ) and Ak (θ) = Σ pH (0)
Calculate 1=0, generate two feature vectors (with M dimensions), and store them in memory.

■ 大分類処理部１４に関して。■ Regarding the major classification processing section 14.

入力線図形１こ太め兼ラベリング処理までを施した図形
から、各ラベルｋにつき。For each label k from the input line figure that has been made thicker and has been subjected to labeling processing.

を算出して。Calculate.

次元）２つの特微量ベクトル（ともにＭを生成する。dimension) Two feature vectors (both M generate.

犬鋒類用特微量メモリ一部１５より、順次カテゴリーθ
昏こ関する特徴’ｆｉＰ（４、Ｌ（θ）を呼び出し１例
えば、２ベクトルａ、ｂ間の類似度を用いて。Category θ is sequentially selected from part 15 of the special amount memory for dogs.
Call the coma-related feature 'fiP(4, L(θ))1, for example, using the similarity between two vectors a and b.

ψ２但し、ψ１、ψ２は閾イ直を満たすカテゴリーθを候補図形として選出する。ψ2 However, ψ1 and ψ2 are equal to the threshold A category θ that satisfies the following is selected as a candidate figure.

即ち大分類処理が行われる。以上説明したように１本発
明は、複雑な構造をもち多種カテゴリーを含む線図形群
を認識対象とする場合、従来の構造解析の立場での線解
析法では免れないｂｏｔｔｏｍｕｐ的性格、細線化処
理の抱える難点、及び変形とくに手書き変形に対する不
安定性を一挙に解決し、第１に、２値入力図形の幾何学
的及び位相的特徴を反映する多値画像表現を求めたもの
であり、第２１こ画像として扱うことによって大分類及
び識別を一貫した画像処理の流れの中に実現したもので
ある。That is, major classification processing is performed. As explained above, 1. When a line figure group having a complex structure and including various categories is to be recognized, the bottom-up character and thinning that are inevitable with the conventional line analysis method from the viewpoint of structural analysis are realized. This method solves the difficulties of processing and the instability with respect to deformation, especially handwritten deformation, at once.Firstly, it seeks a multivalued image representation that reflects the geometric and topological features of the binary input figure. By treating the images as 21 images, major classification and identification are realized in a consistent flow of image processing.

本発明は図形処理方式として汎用的性格を持つ点が大き
な利点であり、入力図形の構造は自づと多値画像表現の
中に反映され１画像表現の中にある構造に関する情報が
ＫＬ変換によって抽出され識別に用いられるという流れ
は認識対象に依存しない。The great advantage of the present invention is that it is versatile as a graphic processing method, and the structure of the input graphic is naturally reflected in the multivalued image representation, and the information regarding the structure in one image representation is processed by KL transformation. The flow of extraction and use for identification does not depend on the recognition target.

具体的な応用分野としては１手書き文字とくｆこ手書き
漢字の認識がある。Specific applications include recognition of handwritten characters and handwritten kanji.

その場合は、多様な手書き変形が画像表現１こおける雑
音としてＫＬ変換によって有効に吸収される。In that case, various handwritten deformations are effectively absorbed as noise in the image representation by the KL transformation.

[Brief explanation of drawings]

第１図は入力図形にノイズ除去１泣置・大きさの正規化
、２値化の前処理を施した結果を表わす図（入力図形と
しては手書き漢字の「太」を用いである）、第２図はこ
の２値図形１こ細線化処理を施し、得られた心線部分子
こラベル０を付けたところを示す図、第３図はこの心線
部分を出発点として太め処理を実行し、ｏｎｅ５ｔ
ｅｐ毎に得られる領域に順次１，２，３．・・・とラベ
ルを付けていく様子を示す図、第４図は枠内がすべてラ
ベルで埋まり、太め処理及びラベリング作業が完了した
時点を表わす図、第５図は本発明の一実施例構成を示す
。図中、１０は線図形入力部、１１は前処理部。１２は細線化処理部、１３は太め処理兼ラベリング処理
部、１４は大分類処理部、１５は大分類用特徴量の辞書
メモリ、１６は多値化処理部、１Ｔは識別処理部、１８
は辞書パターン・メモリを表わす。Figure 1 shows the results of preprocessing of noise removal, size normalization, and binarization on the input figure (the handwritten kanji ``tai'' is used as the input figure). Figure 2 shows the binary figure being thinned by 1 and the resulting core line is given a molecular label of 0, and Figure 3 is the result of thickening processing using this core line as a starting point. , one 5t
1, 2, 3, etc. in the area obtained for each ep. . . . Figure 4 is a diagram showing how the frame is completely filled with labels and the thickening process and labeling work have been completed. Figure 5 is a diagram showing the configuration of an embodiment of the present invention. shows. In the figure, 10 is a line figure input section, and 11 is a preprocessing section. 12 is a thinning processing section, 13 is a thickening processing and labeling processing section, 14 is a large classification processing section, 15 is a dictionary memory for large classification feature values, 16 is a multivalue processing section, 1T is an identification processing section, 18
represents a dictionary pattern memory.

Claims

[Scope of Claims] 1 In figure processing 1, an input figure subjected to preprocessing is converted into a core line with a line width of "1", and a center line part is extracted from which the influence of the line width of the input figure itself is eliminated. and a thinning processing unit that attaches a label O to the core wire portion. Using the label Q area obtained in the thinning processing section as a starting point, the width is
A thick processing/label processing section is installed which performs thickening processing in increments of 1" and labels the areas obtained for each thickening process sequentially as 1, 2, 3, etc., and receives an input figure. After filling a certain frame consisting of labels with labels, convert the original input figure into a multi-valued gray image by using a monotonically non-decreasing function f(k) for the label value as the gray level at each point, and perform the conversion. A graphic processing method characterized in that a recognition process is performed on a multivalued grayscale image based on the multivalued grayscale image. 2 The above recognition process uses the figures to which the above labels have been given to determine the number of pixels Pk belonging to each label area and Ak=-Σ
Using Pi, the change in Pk due to kf is roughly classified into two quantities: i=0 and/or (Pk)2/Ak, and then the Karhun method is applied to the multivalued image representation.
2. The graphic processing method according to claim 1, wherein eigenvectors belonging to large eigenvalues are obtained by en-Loive transformation, and the eigenvalues and eigenvectors for each graphic category are used to identify the figure.