JPH0821059B2

JPH0821059B2 - Unnecessary data removal method for character contour generation

Info

Publication number: JPH0821059B2
Application number: JP63168106A
Authority: JP
Inventors: 太郎 ▲高▼本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1988-07-06
Filing date: 1988-07-06
Publication date: 1996-03-04
Anticipated expiration: 2011-03-04
Also published as: JPH0217588A

Description

【発明の詳細な説明】〔目次〕概要産業上の利用分野従来の技術発明が解決しようとする課題課題を解決するための手段作用実施例発明の効果〔概要〕文字認識を行うために必要な文字輪郭を生成する際
に、文字を囲む文字枠内の不要イメージデータを効率良
く消去する文字輪郭生成方式に関し、文字認識率を向上させると共に、処理効率を高めるこ
とを目的とし、文字像から得られるベクトルに外接する外接四辺形か
ら、所定の大きさを拡大した第１の文字枠と第１の文字
枠から更に所定の大きさを拡大した第２の文字枠を作成
し、第２の文字枠内のイメージデータから、文字輪郭情
報としての輪郭テーブルを生成する文字輪郭生成方式で
あって、第２の文字枠内に存在するイメージデータか
ら、輪郭テーブルを作成する輪郭テーブル作成手段と、
輪郭テーブルから領域が連なるイメージデータを選別
し、夫々独立した領域を持つイメージデータに異なる値
のラベルを付与した後、夫々のラベル値毎のイメージデ
ータの領域の最大と最小の座標値を輪郭テーブルから求
めてラベル領域テーブルを作成するラベル付け処理手段
と、ラベル領域テーブルを参照し、一画素の領域が連な
るイメージデータと、Ｎ×Ｎ画素以下の領域を持つイメ
ージデータと、第１の文字枠の占める領域内より外側に
存在する領域の大きいイメージデータとを選別し、輪郭
テーブルから選別されたイメージデータの情報を消去す
る切出し処理手段とを設けて構成する。DETAILED DESCRIPTION [Table of Contents] Outline Industrial field of application Conventional technology Problems to be solved by the invention Means for solving the problems Action Example Effect of the invention [Outline] Necessary for performing character recognition A character contour generation method that efficiently erases unnecessary image data in a character frame that surrounds a character when generating a character contour, and aims to improve the character recognition rate and the processing efficiency. From the circumscribed quadrilateral circumscribing the vector to be created, a first character frame with a predetermined size enlarged and a second character frame with a predetermined size further enlarged from the first character frame are created. A character contour generation method for generating a contour table as character contour information from image data in a frame, which is a contour table for creating a contour table from image data existing in a second character frame. Bull making method,
After selecting image data with continuous areas from the contour table and assigning labels with different values to the image data having independent areas, the maximum and minimum coordinate values of the area of the image data for each label value are calculated in the contour table. Labeling processing means for creating a label area table obtained from the image data, image data in which one pixel area is continuous with reference to the label area table, image data having an area of N × N pixels or less, and a first character frame And a cutout processing unit that selects image data having a large area existing outside the area occupied by the area and deletes the information of the selected image data from the contour table.

[Industrial applications]

本発明は文字認識を行うために必要な文字輪郭を生成
する際に、当該文字を囲む文字枠内に存在する不要イメ
ージデータを効率良く除去する文字輪郭生成時の不要デ
ータ除去方式に関する。The present invention relates to an unnecessary data removal method at the time of generating a character contour that efficiently removes unnecessary image data existing in a character frame surrounding the character when generating a character contour necessary for character recognition.

文字記入位置が任意である図面等に記載された文字を
認識するためには、前処理として当該文字を囲む文字枠
を設けることが行われている。これは、一つの文字の領
域を決定し、この文字領域内のイメージデータから文字
輪郭を生成して、文字を認識するためである。In order to recognize a character written in a drawing or the like where a character entry position is arbitrary, a character frame surrounding the character is provided as a preprocessing. This is because the area of one character is determined, the character contour is generated from the image data in this character area, and the character is recognized.

ところで、この文字枠内には図面等の地汚れや読取装
置等から発生するノイズに基づくイメージデータや、他
の文字の一部がイメージデータとして侵入することがあ
り、このような不要イメージデータが文字枠内に存在し
ている場合、文字枠内のイメージデータから文字輪郭を
生成する時に、読取った文字とは異なった文字輪郭を生
成し、文字認識を誤る原因になるため、この不要イメー
ジデータを効率良く消去し、文字認識率を高めると共に
処理効率を向上させる必要がある。By the way, in this character frame, image data due to background stains such as drawings or noise generated by a reading device, or part of other characters may intrude as image data. If it exists in the character frame, when the character contour is generated from the image data in the character frame, a character contour different from the read character is generated, which may cause a character recognition error. It is necessary to delete the data efficiently to improve the character recognition rate and the processing efficiency.

[Conventional technology]

第９図は従来の技術を説明する図である。 FIG. 9 is a diagram illustrating a conventional technique.

第９図（ａ）に示す如く、文字枠としては、文字イメ
ージデータ50のベクトル化によって得られたベクトル51
に外接する外接四辺形52が考えられる。しかし、この外
接四辺形52からなる文字枠内のイメージデータのみを取
り出したのでは、第９図（ａ）から判明するように、当
該文字枠外にある文字イメージデータ50の一部が削られ
てしまう。As shown in FIG. 9 (a), the character frame is a vector 51 obtained by vectorizing the character image data 50.
A circumscribed quadrilateral 52 circumscribing to is possible. However, if only the image data in the character frame consisting of the circumscribed quadrangle 52 is taken out, as can be seen from FIG. 9 (a), a part of the character image data 50 outside the character frame is deleted. I will end up.

このため、ベクトル51の外接四辺形52を適量拡げた範
囲を文字認識対象領域とすることが行われている。例え
ば、ａ＝（文字イメージ線太さ＋α）×1/2 で計算されるａだけ外方向に拡げた第９図（ｂ）の53に
示す如き仮想文字枠を生成する。ここで、αは実験的に
定めるものである。そして、このようにして定めた仮想
文字枠53の範囲内のイメージデータから、文字輪郭を示
す輪郭テーブルを作成するが、この時第９図（ｂ）の斜
線を施した部分に示す如く、仮想文字枠53の中に、図面
の地汚れや読取装置等から発生するノイズに基づくイメ
ージデータや隣接する文字のイメージデータの一部が侵
入することがある。Therefore, a range in which the circumscribed quadrangle 52 of the vector 51 is expanded by an appropriate amount is set as a character recognition target area. For example, a virtual character frame as shown at 53 in FIG. 9 (b), which is expanded outward by a calculated by a = (character image line thickness + α) × 1/2, is generated. Here, α is experimentally determined. Then, a contour table showing the character contour is created from the image data within the range of the virtual character frame 53 determined in this way. At this time, as shown in the shaded portion of FIG. In the character frame 53, image data based on background stains on the drawing or noise generated by a reading device or the like, or part of image data of adjacent characters may enter.

輪郭テーブルを作成する場合、例えば第９図（ｃ）に
示す如く、仮想文字枠53のＸ方向の座標が１〜16であ
り、Ｙ方向の座標が１〜13であるとし、仮想文字枠53内
のイメージデータ，，をＹ方向に走査して、白画
素から黒画素に変化した点の座標値と、黒画素から白画
素に変化した点の座標値を求めると、第９図（ｄ）に示
す如き輪郭テーブルが求められる。When creating the contour table, for example, as shown in FIG. 9C, it is assumed that the virtual character frame 53 has coordinates 1 to 16 in the X direction and 1 to 13 in the Y direction. The image data in the inside are scanned in the Y direction, and the coordinate value of the point changed from the white pixel to the black pixel and the coordinate value of the point changed from the black pixel to the white pixel are obtained. A contour table as shown in is requested.

即ち、例えば、Ｘ座標が１の場合、Ｙ方向に走査する
と、画素の変化点が無く、Ｘ座標が２の場合、Ｙ方向に
走査すると、イメージデータの白画素から黒画素に変
化する点のＹ座標値は８であり、黒画素から白画素に変
化する点のＹ座標値は９である。同様に、Ｘ座標が５の
場合、Ｙ方向に走査すると、イメージデータの白画素
から黒画素に変化する点のＹ座標値は４であり、黒画素
から白画素に変化する点のＹ座標値は６であり、次のイ
メージデータの白画素から黒画素に変化する点のＹ座
標値は８であり、黒画素から白画素に変化する点のＹ座
標値は９である。That is, for example, when the X coordinate is 1, there is no pixel change point when scanned in the Y direction, and when the X coordinate is 2, when there is a point where the white pixel of the image data changes to a black pixel when scanned in the Y direction. The Y coordinate value is 8, and the Y coordinate value of the point where the black pixel changes to the white pixel is 9. Similarly, when the X coordinate is 5, when scanning in the Y direction, the Y coordinate value of the point of changing the white pixel to the black pixel of the image data is 4, and the Y coordinate value of the point of changing the black pixel to the white pixel. Is 6, the Y coordinate value of the point of changing the white pixel to the black pixel of the next image data is 8, and the Y coordinate value of the point of changing the black pixel to the white pixel is 9.

又、仮想文字枠53内のイメージデータをＸ方向に走査
して、白画素から黒画素に変化した点の座標値と、黒画
素から白画素に変化した点の座標値を求めると、第９図
（ｅ）に示す如き輪郭テーブルが求められる。Further, the image data in the virtual character frame 53 is scanned in the X direction, and the coordinate value of the point where the white pixel is changed to the black pixel and the coordinate value of the point where the black pixel is changed to the white pixel are obtained. The contour table as shown in FIG.

即ち、例えば、Ｙ座標が３の場合、Ｘ方向に走査する
と、イメージデータの白画素から黒画素に変化する点
のＸ座標値は14であり、黒画素から白画素に変化する点
のＸ座標値は15である。That is, for example, when the Y coordinate is 3, the X coordinate value of the point of changing the white pixel to the black pixel of the image data when scanning in the X direction is 14, and the X coordinate of the point of changing the black pixel to the white pixel. The value is 15.

同様に、Ｙ座標が４の場合、Ｘ方向に走査すると、イ
メージデータの白画素から黒画素に変化する点のＸ座
標値は５であり、黒画素から白画素に変化する点のＸ座
標値は７であり、再び白画素から黒画素に変化する点の
Ｘ座標値は８であり、黒画素から白画素に変化する点の
Ｘ座標値は12である。Similarly, when the Y coordinate is 4, the X coordinate value of the point at which the white pixel changes to the black pixel of the image data when scanning in the X direction is 5, and the X coordinate value of the point at which the black pixel changes to the white pixel. Is 7, the X coordinate value of the point at which the white pixel changes to the black pixel again is 8, and the X coordinate value of the point at which the black pixel changes to the white pixel is 12.

そして、イメージデータの白画素から黒画素に変化
する点のＸ座標値は14であり、黒画素から白画素に変化
する点のＸ座標値は16である。Then, the X coordinate value of the point of changing the white pixel to the black pixel of the image data is 14, and the X coordinate value of the point of changing the black pixel to the white pixel is 16.

このように、第９図（ｃ）の認識対象文字のイメージ
データの外に、ノイズによるイメージデータや、他
の文字のイメージデータが混入した状態で輪郭テーブ
ルを作成すると、第９図（ｄ）の及びに示す座標値
と第９図（ｅ）の及びに示す座標値の如く、認識対
象文字輪郭にとっては不要な座標値が混入する。従っ
て、この輪郭テーブルに基づき、文字輪郭を生成して文
字認識を行うと、文字認識が出来ないか、文字認識を誤
ることとなる。In this way, when the contour table is created in a state where image data due to noise and image data of other characters are mixed in addition to the image data of the recognition target character of FIG. 9C, when the contour table is created, FIG. Unnecessary coordinate values are mixed in for the recognition target character contour, such as the coordinate values shown in and of and the coordinate values shown in and of FIG. 9 (e). Therefore, if the character contour is generated and the character recognition is performed based on the contour table, the character recognition cannot be performed or the character recognition is erroneous.

このため、ノイズを除去する手段として、図面等を読
取る読取装置にマスク処理装置を付加して、ノイズに基
づくイメージデータを除去するようにしている。For this reason, as a means for removing noise, a mask processing device is added to a reading device for reading a drawing or the like to remove image data based on noise.

第10図はノイズ除去を説明する図である。 FIG. 10 is a diagram for explaining noise removal.

マスク処理装置は、例えば３×３画素以下のイメージ
データは除去するが、これ以上の大きさのイメージデー
タは除去しない。即ち、第10図（ａ）の枠の大きさが３
×３画素であるとすると、斜線で示す如く、この枠内に
入る大きさのイメージデータは消去されるが、第10図
（ｂ）（ｃ）の斜線部分の如く、この枠外にイメージデ
ータが出ていれば、第10図（ｃ）の如く、実際には書く
ことが出来ない幅が一画素で長さが４画素の如き細長い
イメージデータも除去することが出来ない。The mask processing device removes, for example, image data of 3 × 3 pixels or less, but does not remove image data of a size larger than this. That is, the size of the frame in FIG. 10 (a) is 3
Assuming that the number of pixels is × 3 pixels, the image data of a size that falls within this frame is erased as indicated by the diagonal lines, but the image data is outside this frame as indicated by the hatched portions in FIGS. 10 (b) and (c). If it appears, it is impossible to remove the slender image data such as one pixel in width and four pixels in length which cannot be actually written as shown in FIG. 10 (c).

[Problems to be Solved by the Invention]

上記の如く、従来は仮想文字枠内にノイズに基づく細
長いイメージデータや複数画素が集合したイメージデー
タが混入し、更に他の文字のイメージデータが混入した
りすると、不要なイメージデータの座標値が輪郭テーブ
ル内に混入するため、このような文字輪郭テーブルに基
づいて、文字認識を行った場合、文字認識率が低下する
という問題がある。As described above, conventionally, when slender image data based on noise or image data in which a plurality of pixels are gathered is mixed in a virtual character frame, and image data of another character is mixed, the coordinate value of unnecessary image data is changed. Since the characters are mixed in the contour table, when the character recognition is performed based on such a character contour table, there is a problem that the character recognition rate decreases.

本発明はこのような問題点に鑑み、文字輪郭テーブル
に混入した不要な座標値を効率良く除去して、文字認識
率を向上させると共に、処理効率を高めることを目的と
している。In view of such a problem, the present invention aims to efficiently remove unnecessary coordinate values mixed in the character contour table, improve the character recognition rate, and improve the processing efficiency.

[Means for solving the problem]

第１図は本発明の原理を説明する図である。 FIG. 1 is a diagram for explaining the principle of the present invention.

輪郭テーブル作成手段31には２値イメージデータと、
文字イメージデータから得られるベクトルの座標値が入
力され、輪郭テーブル作成手段31は２値イメージデータ
とベクトルの座標に基づき、文字のベクトルに外接する
外接四辺形を作成し、この外接四辺形より所定の大きさ
だけ拡大した第１の文字枠を作成し、更にこの第１の文
字枠より所定の大きさを拡大した第２の文字枠を作成
し、この第２の文字枠内のイメージデータから、第９図
（ｄ）又は第９図（ｅ）に示す如き輪郭テーブルを作成
する。The contour table creating means 31 stores binary image data,
The coordinate value of the vector obtained from the character image data is input, and the contour table creating means 31 creates a circumscribed quadrilateral circumscribing the character vector based on the binary image data and the coordinates of the vector, and the circumscribed quadrilateral is predetermined from the circumscribed quadrilateral. Create a first character frame that is enlarged by the size of, and then create a second character frame that is enlarged by a predetermined size from this first character frame. From the image data in this second character frame, , A contour table as shown in FIG. 9 (d) or FIG. 9 (e) is created.

この該輪郭テーブル作成手段31が作成した輪郭テーブ
ルから、ラベル付け処理手段32は領域が連なるイメージ
データを選別し、夫々独立した領域を持つイメージデー
タに異なる値のラベルを付与した後、夫々のラベル値毎
のイメージデータの領域の最大と最小の座標値を該輪郭
テーブルから求めてラベル領域テーブルを作成する。From the contour table created by the contour table creating means 31, the labeling processing means 32 selects image data in which regions are continuous, assigns different value labels to the image data having independent regions, and then labels each image data. A label area table is created by obtaining the maximum and minimum coordinate values of the area of the image data for each value from the contour table.

切出し処理手段33は、該ラベル付け処理手段32が作成
したラベル領域テーブルを参照し、各ラベル値毎のイメ
ージデータの領域が、一画素の領域が連なるような細長
いイメージデータと、Ｎ×Ｎ画素以下の領域を持つイメ
ージデータと、前記第１の文字枠の占める領域内より外
側に存在する領域の大きいイメージデータとを選別し、
該選別されたイメージデータに付与されたラベル値に基
づき、前記輪郭テーブルから該選別されたイメージデー
タの情報を消去する。The cut-out processing means 33 refers to the label area table created by the labeling processing means 32, and the area of the image data for each label value is elongated image data in which one pixel area is continuous and N × N pixels. Image data having the following areas and image data having a large area existing outside the area occupied by the first character frame are selected,
Information on the selected image data is deleted from the contour table based on the label value given to the selected image data.

[Action]

上記の如く構成することにより、輪郭テーブル作成手
段31は、文字のベクトルに外接する外接四辺形から、夫
々所定の大きさを拡大した第１と第２の文字枠を作成
し、この第２の文字枠内のイメージデータから、従来作
成していた２種類の輪郭テーブルの中の１種類の輪郭テ
ーブルを作成する。With the above-described configuration, the contour table creating means 31 creates the first and second character frames each having a predetermined size enlarged from the circumscribed quadrangle circumscribing the character vector, and the second character frame is created. From the image data in the character frame, one type of contour table out of the two types of contour tables conventionally created is created.

そして、ラベル付け処理手段32は、この輪郭テーブル
から夫々独立した領域を備えるイメージデータを選別
し、夫々にラベル値を付与した後、夫々のラベル値毎の
イメージデータ領域の最大と最小の座標値を求め、ラベ
ル領域テーブルを作成する。Then, the labeling processing means 32 selects image data having independent areas from the contour table, assigns a label value to each, and then sets the maximum and minimum coordinate values of the image data area for each label value. And create a label area table.

切出し処理手段33はこのラベル領域テーブルから、各
イメージデータの占有する領域が、一画素の連続する細
長いものと、Ｎ×Ｎ画素以下のものと、第１の文字枠の
占める領域より外に出ている領域の大きいイメージデー
タを選別する。From the label area table, the cut-out processing means 33 outputs the area occupied by each image data to the outside of the area occupied by the first character frame, that is, a continuous elongated one pixel, N × N pixels or less. Image data with large area is selected.

そして、この選別したイメージデータは不要イメージ
データと判定して、輪郭テーブルから削除するため、文
字輪郭を生成するのに必要なイメージデータのみを輪郭
テーブルに残すことが可能となる。Then, the selected image data is determined as unnecessary image data and deleted from the contour table, so that only the image data necessary for generating the character contour can be left in the contour table.

従って、この輪郭テーブルにより文字認識を行えば、
文字認識を誤らせる不要イメージデータが無いため、文
字認識率が向上すると共に、処理が容易であるため処理
効率を高めることが出来る。Therefore, if character recognition is performed using this contour table,
Since there is no unnecessary image data that causes character recognition error, the character recognition rate is improved and the processing efficiency is improved because the processing is easy.

〔Example〕

第２図は本発明の一実施例を説明するブロック図で、
第３図は第２図の動作を説明するフローチャートで、第
４図は仮想文字枠を説明する図で、第５図は輪郭テーブ
ルの一例を示す図で、第６図はラベルテーブルの一例を
示す図で、第７図はラベル領域テーブルの一例を示す図
で、第８図はラベル選別テーブルの一例を示す図であ
る。FIG. 2 is a block diagram illustrating an embodiment of the present invention.
FIG. 3 is a flow chart for explaining the operation of FIG. 2, FIG. 4 is a diagram for explaining a virtual character frame, FIG. 5 is a diagram showing an example of a contour table, and FIG. 6 is an example of a label table. FIG. 7 is a diagram showing an example of the label area table, and FIG. 8 is a diagram showing an example of the label selection table.

輪郭テーブル作成部34に端子Ａを経て、第９図（ａ）
の文字イメージデータ50に示す如き２値イメージデータ
と、該文字イメージデータの第９図（ａ）ベクトル51に
示す如きベクトルの座標値が入力する。輪郭テーブル作
成部34は、この２値イメージデータとベクトルの座標値
に基づき、第３図に示す如く、文字のベクトル外接四辺
形を作成する。FIG. 9 (a) via the terminal A to the contour table creating section 34.
The binary image data shown in the character image data 50 and the coordinate values of the vector shown in the vector 51 of FIG. 9 (a) of the character image data are input. The contour table creation unit 34 creates a vector circumscribed quadrilateral of the character as shown in FIG. 3 based on the binary image data and the coordinate values of the vector.

そして、第３図に示す如く、入力された文字イメージ
データの大きさ、向きを整える。即ち、文字イメージの
大きさ、向きを例えば所定の大きさの正立に整える。そ
して、第９図（ｂ）で説明したと同様にして、第４図に
示す如く、仮想文字枠53を作成する。そして、仮想文字
枠53から、例えば、更に2aだけ外方向に拡大した仮想文
字枠54を作成する。Then, as shown in FIG. 3, the size and orientation of the input character image data are adjusted. That is, the size and orientation of the character image are adjusted to erect up to a predetermined size, for example. Then, in the same manner as described with reference to FIG. 9B, the virtual character frame 53 is created as shown in FIG. Then, from the virtual character frame 53, for example, a virtual character frame 54 further expanded outward by 2a is created.

仮想文字枠54を作成したことにより、第９図（ｃ）の
で示した隣接文字のイメージデータは、第４図に示
す如く仮想文字枠54内に大きく侵入して来る。By creating the virtual character frame 54, the image data of the adjacent character shown by (c) in FIG. 9 largely invades the virtual character frame 54 as shown in FIG.

続いて、輪郭テーブル作成部34は、この仮想文字枠54
の内部に存在するイメージデータから、第９図（ｅ）に
示す輪郭テーブルと同様にして、第５図に示す如き輪郭
テーブルを作成し、輪郭テーブルメモリ37に格納する。
この場合、第９図（ｄ）に示す如き輪郭テーブルを作成
しても良いことは勿論である。Then, the contour table creating unit 34 uses the virtual character frame 54.
A contour table as shown in FIG. 5 is created from the image data existing inside the contour table in the same manner as the contour table shown in FIG.
In this case, of course, a contour table as shown in FIG. 9 (d) may be created.

本実施例では第５図に示す示く、横軸にＹをとる輪郭
テーブルを作成した場合に付いて説明する。In the present embodiment, description will be made regarding the case where a contour table having Y on the horizontal axis is created as shown in FIG.

ラベル付け処理部35は輪郭テーブルメモリ37に格納さ
れた第５図に示す輪郭テーブルから、第６図に示す如き
ラベルテーブルを作成して、ラベルテーブルメモリ38に
格納する。The labeling unit 35 creates a label table as shown in FIG. 6 from the contour table shown in FIG. 5 stored in the contour table memory 37 and stores it in the label table memory 38.

即ち、第３図に示す如く、ラベル値の初期値を１とし
てから、第５図の輪郭テーブルにおいて、第３図に示
す如く、Ｙ方向の座標値を最小値から順次＋１としてＸ
座標値を抽出する。本例では、Ｙ方向の座標値９におい
てＸ座標値の20と21が抽出される。この時、このＸ座標
値20と21を比較値とし、この座標値が抽出されたＹ方向
の座標値９において、第６図に示す如く、ラベル値１を
付与する。That is, as shown in FIG. 3, the initial value of the label value is set to 1, and then, in the contour table of FIG. 5, the coordinate values in the Y direction are sequentially incremented from the minimum value to +1 as shown in FIG.
Extract the coordinate values. In this example, X coordinate values 20 and 21 are extracted at the coordinate value 9 in the Y direction. At this time, the X coordinate values 20 and 21 are used as comparison values, and the label value 1 is given to the extracted coordinate value 9 in the Y direction as shown in FIG.

次に第３図に示す如く、Ｙ座標値±１のＸ座標値に
該比較値と重なる座標値を持つものがあるか調べる。即
ち、Ｙ方向の座標値10又は８においてＸ座標値の20〜21
に重なるものがあるか調べる。即ち、比較値20〜21と同
一数のものがあるか調べる。第５図の場合、Ｙ方向の座
標値８においては重なるものが存在しないが、Ｙ方向の
座標値10においてはＸ座標値に20と22があり、比較値20
及び21に重なる座標値であるため、このＹ方向の座標値
10におけるＸ座標値の20〜22の領域は、Ｙ方向の座標値
９におけるＸ座標値の20〜21のイメージデータと連なる
領域のイメージデータであると判定する。Next, as shown in FIG. 3, it is checked whether or not the X coordinate value of the Y coordinate value ± 1 has a coordinate value that overlaps with the comparison value. That is, when the coordinate value in the Y direction is 10 or 8, the X coordinate value is 20 to 21.
Check if there is any overlap. That is, it is checked whether there are the same numbers as the comparison values 20 to 21. In the case of FIG. 5, there is no overlap in the coordinate value 8 in the Y direction, but in the coordinate value 10 in the Y direction, there are 20 and 22 in the X coordinate values, and the comparison value 20
Since this is the coordinate value that overlaps with and 21, this coordinate value in the Y direction
The area of X coordinate value 20 to 22 in 10 is determined to be the image data of the area continuous with the image data of X coordinate value 20 to 21 in the Y direction coordinate value 9.

即ち、Ｙ方向の座標値９におけるＸ座標値の20及び21
の黒画素と、Ｙ方向の座標値10におけるＸ座標値が20及
び21の黒画素は、Ｙ方向の座標値９と10において隣接し
ているため、同一のイメージデータであると判定する。That is, 20 and 21 of the X coordinate value at the coordinate value 9 in the Y direction
And the black pixels whose X coordinate values are 20 and 21 in the Y-direction coordinate value 10 are adjacent to each other in the Y-direction coordinate values 9 and 10, and are therefore determined to be the same image data.

続いて、ラベル付け処理部35は、第３図において、
Ｙ座標値±１のＸ座標値に該比較値と重なる座標値を持
つものがあったため、この重なる座標値をもつものが、
既に抽出済の領域を除いて二つ以上あるか調べる。Subsequently, the labeling processing unit 35, in FIG.
Some of the X coordinate values of the Y coordinate value ± 1 had coordinate values that overlapped with the comparison value.
Examine whether there are two or more excluding the already extracted area.

即ち、第３図において抽出されたイメージデータ領
域に対して、隣接するイメージデータの領域が、既に抽
出済の領域を除いて二つ以上の分岐した領域に分かれて
存在するか否かを調べる。That is, with respect to the image data area extracted in FIG. 3, it is checked whether or not the area of the adjacent image data is divided into two or more branched areas except the already extracted area.

分岐した領域が存在しなければ、第３図に移行し、
分岐した領域が存在する場合、ラベル付け処理部35は第
３図で抽出されたＹ座標値と比較値とを内部のスタッ
クに格納し、第３図に示す如く、第３図において検
出された隣接するイメージデータは、Ｙ座標上で−１側
であるか調べる。If there is no branched region, move to Fig. 3,
If there is a branched region, the labeling unit 35 stores the Y coordinate value extracted in FIG. 3 and the comparison value in the internal stack, and as shown in FIG. 3, it is detected in FIG. It is checked whether the adjacent image data is on the -1 side on the Y coordinate.

−１側であれば、第３図に示す如く、Ｙ座標値から
１を減算して、第３図に示す如く、Ｙ座標値上のＸ座
標値に比較値と重なる座標値を持つものがあるか調べ
る。即ち、20〜21と重なるものがあるか調べ、重なるも
のがあれば、ラベル値１を付与するが、＋１側であれ
ば、第３図に示す如く、Ｙ座標値に１を加算して、第
３図に示す如く、Ｙ座標値上のＸ座標値に比較値と重
なる座標値を持つものを抽出する。即ち、第５図の場合
においてはＹ座標値10でＹ座標値９の比較値20〜21と重
なるものがあるため、Ｙ座標値10のＸ座標値20〜22を比
較値とすると共に、第６図に示す如く、Ｙ方向の座標値
10において、ラベル値１を付与する。On the -1 side, as shown in FIG. 3, 1 is subtracted from the Y coordinate value, and as shown in FIG. 3, the X coordinate value on the Y coordinate value has a coordinate value that overlaps the comparison value. Check if there is. That is, it is checked whether there is any overlap with 20 to 21, and if there is any overlap, the label value 1 is given, but if it is the +1 side, 1 is added to the Y coordinate value as shown in FIG. As shown in FIG. 3, an X coordinate value on the Y coordinate value having a coordinate value overlapping with the comparison value is extracted. That is, in the case of FIG. 5, the Y coordinate value 10 may overlap the comparison values 20 to 21 of the Y coordinate value 9. Therefore, the X coordinate values 20 to 22 of the Y coordinate value 10 are set as the comparison values and As shown in Fig. 6, coordinate values in the Y direction
At 10, a label value of 1 is given.

続いて、ラベル付け処理部35は第３図のルーチンに
戻り、Ｙ座標値±１のＸ座標値に該比較値と重なる座標
値を持つものがあるか調べる。即ち、Ｙ方向の座標値11
又は９におけるＸ座標値において、比較値の20〜22と重
なるものがあるか調べる。Subsequently, the labeling processor 35 returns to the routine of FIG. 3 and checks whether or not the X coordinate value of the Y coordinate value ± 1 has a coordinate value that overlaps the comparison value. That is, the coordinate value in the Y direction 11
Or, in the X coordinate value in 9, it is checked whether there is any overlap with the comparison value 20 to 22.

そして、重なるものがあれば、既に抽出済の領域を除
いて二つ以上あるか調べる。第５図の場合、Ｙ座標値９
において、Ｘ座標値20〜21は既に抽出済の領域であるた
め、これを除き、Ｙ座標値11においてはＸ座標値21〜22
が存在するため、前記同様に処理して第３図におい
て、この領域にラベル値１を付与する。Then, if there is an overlap, it is checked whether there are two or more excluding the already extracted area. In the case of FIG. 5, Y coordinate value 9
In the above, since the X coordinate values 20 to 21 are already extracted areas, except for this, in the Y coordinate value 11, the X coordinate values 21 to 22
Therefore, the label value 1 is given to this area in FIG.

この動作を繰り返し、第５図のに示す領域に対し、
第６図に示す如く、総てラベル値１を付与すると、第３
図において、隣接するイメージデータが検出されなく
なる。This operation is repeated, and for the area shown in FIG.
As shown in FIG. 6, if all label values are given, the third
In the figure, adjacent image data is no longer detected.

ここで、ラベル付け処理部35は第３図に示す如く、
スタックにデータがあるか調べ、スタックにデータがあ
れば、Ｙ座標値と比較値を復元し、第３図のルーチン
に戻るが、スタックにデータが無ければ、総ての輪郭デ
ータにラベルを付けたか調べる。即ち、第４図の，
，に夫々ラベル付けが済んだか調べる。Here, the labeling unit 35, as shown in FIG.
Check if there is data in the stack. If there is data in the stack, restore the Y coordinate value and the comparison value, and return to the routine of Fig. 3. If there is no data in the stack, label all contour data. Check whether That is, in FIG.
Check if the labels have been labeled for each.

この場合、第４図に未だラベル付けの済まない輪郭デ
ータが残っているため、ラベル付け処理部35はラベルの
付いていない輪郭データの中の最もＹ座標値の小さいも
のにラベル値を付けるため、ラベル値に＋１してから、
即ち、ラベル値を２としてから第３図のルーチンに戻
る。In this case, since the unlabeled contour data still remains in FIG. 4, the labeling processing unit 35 assigns the label value to the unlabeled contour data having the smallest Y coordinate value. +1 to the label value, then
That is, the label value is set to 2, and the process returns to the routine of FIG.

第５図において、Ｙ座標値９までは既にラベルが付い
ており、Ｙ座標値10において、ラベルの付かないＸ座標
値の11と13が抽出される。従って、ラベル付け処理部35
はＸ座標値の11〜13を比較値として、第６図に示す如
く、ラベル値２を付与し、第３図の処理を実行する。In FIG. 5, labels up to the Y coordinate value 9 are already attached, and at the Y coordinate value 10, X coordinate values 11 and 13 which are not labeled are extracted. Therefore, the labeling processor 35
Assigns a label value of 2 as shown in FIG. 6 using the X coordinate values 11 to 13 as comparison values, and executes the processing of FIG.

即ち、Ｙ座標値11において、Ｘ座標値11〜16はＹ座標
値10の比較値11〜13に重なる座標値であるため、ラベル
付け処理部35はラベル値２を付与する。そして、第３図
〜のルーチンで検出されるように、Ｙ座標値10のＸ
座標値14と18が比較値11〜16に重なる座標値であるた
め、分岐した領域であると判定し、Ｙ座標値10のＸ座標
値11と13は既に抽出済であるため除き、Ｘ座標値14と18
に第６図に示す如くラベル値２を付与する。That is, in the Y coordinate value 11, since the X coordinate values 11 to 16 are coordinate values that overlap the comparison values 11 to 13 of the Y coordinate value 10, the labeling processing unit 35 gives the label value 2. Then, as detected by the routines of FIG.
Since the coordinate values 14 and 18 are coordinate values that overlap the comparison values 11 to 16, it is determined that the area is a branched area, and the X coordinate values 11 and 13 of the Y coordinate value 10 are already extracted and are excluded. Values 14 and 18
A label value of 2 is given to the label as shown in FIG.

又、Ｙ座標値11のＸ座標値17と19は、Ｙ座標値10の比
較値14〜18に重なる座標値であるため、同様にラベル値
２を付与する。Further, since the X coordinate values 17 and 19 of the Y coordinate value 11 are coordinate values that overlap the comparison values 14 to 18 of the Y coordinate value 10, the label value 2 is similarly given.

Ｙ座標値12において、Ｘ座標値12〜14はＹ座標値11の
比較値11〜16に重なる座標値であり、Ｙ座標値13におい
て、Ｘ座標値13〜15はＹ座標値12の比較値12〜14に重な
る座標値であり、Ｙ座標値14において、Ｘ座標値14〜17
はＹ座標値13の比較値13〜15に重なる座標値である。In the Y coordinate value 12, the X coordinate values 12 to 14 are coordinate values that overlap the comparison values 11 to 16 of the Y coordinate value 11, and in the Y coordinate value 13, the X coordinate values 13 to 15 are the comparison values of the Y coordinate value 12. The coordinate value overlaps 12 to 14, and the Y coordinate value is 14 and the X coordinate value is 14 to 17.
Are coordinate values that overlap the comparison values 13 to 15 of the Y coordinate value 13.

更に、Ｙ座標値15において、Ｘ座標値16〜18はＹ座標
値14の比較値14〜17に重なる座標値であり、Ｙ座標値16
において、Ｘ座標値17〜18はＹ座標値15の比較値16〜18
に重なる座標値であり、Ｙ座標値17において、Ｘ座標値
16〜18はＹ座標値16の比較値17〜18に重なる座標値であ
る。Furthermore, in the Y coordinate value 15, the X coordinate values 16 to 18 are coordinate values that overlap the comparison values 14 to 17 of the Y coordinate value 14, and the Y coordinate value 16
In, the X coordinate value 17-18 is the comparison value 16-18 of the Y coordinate value 15.
Is a coordinate value that overlaps
16 to 18 are coordinate values that overlap the comparison values 17 to 18 of the Y coordinate value 16.

Ｙ座標値18において、Ｘ座標値13〜17は、第３図〜
に示すルーチンで検出されるように、Ｙ座標値17の比
較値12〜14と重なる座標値であり、且つ、Ｙ座標値17の
Ｘ座標値16〜18とも重なる座標値であり、分岐した領域
があると判定される。In the Y coordinate value 18, the X coordinate values 13 to 17 are shown in FIG.
As detected by the routine shown in (1), the coordinate value overlaps with the comparison values 12 to 14 of the Y coordinate value 17, and also overlaps with the X coordinate values 16 to 18 of the Y coordinate value 17, and the branched region It is determined that there is.

Ｙ座標値17において、Ｘ座標値16〜18は既に抽出済の
領域であるため除かれ、Ｘ座標値12〜14はＹ座標値18の
比較値13〜17に重なり、Ｙ座標値17の比較値12〜14はＹ
座標値16のＸ座標値12〜13と重なる座標値であることか
ら、ラベル付け処理部35はこの隣接する領域に、第６図
に示す如くラベル値２を付与する。In the Y coordinate value 17, the X coordinate values 16 to 18 are excluded because they are already extracted regions, and the X coordinate values 12 to 14 overlap the comparison values 13 to 17 of the Y coordinate value 18, and the Y coordinate value 17 is compared. Values 12-14 are Y
Since the coordinate value is a coordinate value that overlaps the X coordinate values 12 to 13 of the coordinate value 16, the labeling processing unit 35 assigns the label value 2 to this adjacent area as shown in FIG.

同様にして、ラベル付け処理部35は、第５図のＹ座標
値14において、ラベルの付かないＸ座標値の８と12を抽
出し、Ｘ座標値の８〜12を比較値として、第３図の処
理を実行するが、Ｙ座標値13及びＹ座標値15に比較値８
〜12と重なる座標値が無いため、第６図に示す如く、Ｙ
座標値14において、ラベルの付かないＸ座標値の８と12
にラベル値３を付与する。Similarly, the labeling unit 35 extracts the unlabeled X coordinate values 8 and 12 in the Y coordinate value 14 of FIG. 5, and sets the X coordinate values 8 to 12 as the comparison value to obtain the third value. The process shown in the figure is executed, but the comparison value 8 is added to the Y coordinate value 13 and the Y coordinate value 15.
As there are no coordinate values that overlap with ~ 12, as shown in Fig. 6, Y
At coordinate value 14, the unlabeled X coordinate values 8 and 12
A label value of 3 is assigned to.

ラベル付け処理部35は総ての輪郭データにラベルを付
与すると、ラベルテーブルメモリ38に第６図に示す如き
ラベルテーブルを格納した後、このラベルテーブルを参
照し、第３図に示す如く、第４図に示す仮想文字枠54
において、ラベル値が同一の輪郭データの最大と最小の
座標値を求め、これをラベル領域テーブルとしてラベル
領域テーブルメモリ39に格納する。After labeling all contour data, the labeling unit 35 stores the label table as shown in FIG. 6 in the label table memory 38, and then refers to this label table, as shown in FIG. Virtual character frame 54 shown in FIG.
At, the maximum and minimum coordinate values of the contour data having the same label value are obtained, and these are stored in the label area table memory 39 as a label area table.

即ち、第４図に示す輪郭データのラベル値は１であ
るが、この輪郭データのＹ方向座標値の最大値、即
ち、黒画素から白画素に変化する点の座標値は第４図と
第７図に示す如く18であり、Ｘ方向座標値の最小値、即
ち、白画素から黒画素に変化する点の座標値は20であ
る。又、Ｙ方向座標値の最小値、即ち、白画素から黒画
素に変化する点の座標値は９であり、Ｘ方向座標値の最
大値、即ち、黒画素から白画素に変化する点の座標値は
28である。That is, the label value of the contour data shown in FIG. 4 is 1, but the maximum value of the Y-direction coordinate value of this contour data, that is, the coordinate value of the point changing from the black pixel to the white pixel is the same as that shown in FIG. As shown in FIG. 7, the value is 18 and the minimum value of the X-direction coordinate value, that is, the coordinate value of the point at which the white pixel changes to the black pixel is 20. Further, the minimum value of the Y-direction coordinate value, that is, the coordinate value of the point that changes from the white pixel to the black pixel is 9, and the maximum value of the X-direction coordinate value, that is, the coordinate of the point that changes from the black pixel to the white pixel. value is
28.

次ぎに、ラベル値２の輪郭データのＹ方向座標値の
最大値、即ち、黒画素から白画素に変化する点の座標値
は19であり、Ｘ方向座標値の最小値、即ち、白画素から
黒画素に変化する点の座標値は11である。又、Ｙ方向座
標値の最小値、即ち、白画素から黒画素に変化する点の
座標値は10であり、Ｘ方向座標値の最大値、即ち、黒画
素から白画素に変化する点の座標値は19である。Next, the maximum value of the Y-direction coordinate value of the contour data of label value 2, that is, the coordinate value of the point changing from the black pixel to the white pixel is 19, and the minimum value of the X-direction coordinate value, that is, from the white pixel. The coordinate value of the point that changes to a black pixel is 11. Further, the minimum value of the Y-direction coordinate value, that is, the coordinate value of the point that changes from the white pixel to the black pixel is 10, and the maximum value of the X-direction coordinate value, that is, the coordinate of the point that changes from the black pixel to the white pixel. The value is 19.

更に、ラベル３の輪郭データのＹ方向座標値の最大
値、即ち、黒画素から白画素に変化する点の座標値は15
であり、Ｘ方向座標値の最小値、即ち、白画素から黒画
素に変化する点の座標値は８である。又、Ｙ方向座標値
の最小値、即ち、白画素から黒画素に変化する点の座標
値は14であり、Ｘ方向座標値の最大値、即ち、黒画素か
ら白画素に変化する点の座標値は12である。Further, the maximum value of the Y-direction coordinate value of the contour data of label 3, that is, the coordinate value of the point where the black pixel changes to the white pixel is 15
The minimum value of the X-direction coordinate value, that is, the coordinate value of the point where the white pixel changes to the black pixel is 8. Further, the minimum value of the Y-direction coordinate value, that is, the coordinate value of the point that changes from the white pixel to the black pixel is 14, and the maximum value of the X-direction coordinate value, that is, the coordinate of the point that changes from the black pixel to the white pixel. The value is 12.

切出し処理部36はラベル領域テーブルメモリ39から、
第７図に示す如きラベル領域テーブルを読出して参照
し、第３図に示す如く、一画素の領域が連なるイメー
ジデータ、即ち、幅が一画素の細長い領域を持つイメー
ジデータがあるか調べる。The cutout processing unit 36 uses the label area table memory 39 to
The label area table as shown in FIG. 7 is read out and referred to, and as shown in FIG. 3, it is checked whether there is image data in which one pixel area is continuous, that is, image data having an elongated area having a width of one pixel.

第７図に示すラベル領域テーブルからラベル値３の輪
郭データの領域を作成すると、第４図のに示す大き
さの領域が作成され、幅が一画素の細長い領域を持つイ
メージデータであることが分かる。When an area of contour data having a label value of 3 is created from the label area table shown in FIG. 7, an area having the size shown in FIG. 4 is created, and the image data has an elongated area having a width of 1 pixel. I understand.

ここで、幅が一画素の細長い領域を持つイメージデー
タが存在した場合、切出し処理部36は第３図に示す如
く、第８図に示す如きラベル選択テーブルを作成し、ラ
ベル値３の領域はノイズに基づく不要イメージデータで
あることを示す符号としてC0Hを付与して登録する。Here, when there is image data having an elongated area having a width of one pixel, the clipping processing unit 36 creates a label selection table as shown in FIG. 8 as shown in FIG. C0H is added and registered as a code indicating that the image data is unnecessary image data based on noise.

続いて、切出し処理部36は第３図に示す如く、Ｎ×
Ｎ画素以下の領域、例えば３×３画素以下の領域を持つ
イメージデータがあるか調べる。Subsequently, the cutout processing unit 36, as shown in FIG.
It is checked whether there is image data having a region of N pixels or less, for example, a region of 3 × 3 pixels or less.

第７図に示すラベル領域テーブルからラベル値２の輪
郭データの領域を作成すると、第４図のに示す大き
さの領域が作成され、ラベル値１の輪郭データの領域
を作成すると、第４図のに示す大きさの領域が作成さ
れる。When the contour data area of label value 2 is created from the label area table shown in FIG. 7, the area of the size shown in FIG. 4 is created, and when the contour data area of label value 1 is created, FIG. An area of the size shown in is created.

との領域は３×３画素以上の大きさであるため、
切出し処理部36は第３図の処理は行わず、第３図の
処理に移行する。即ち、隣接文字等からの侵入イメージ
データがあるか調べる。Since the area of and is 3 × 3 pixels or more,
The cut-out processing unit 36 does not perform the processing shown in FIG. 3 and shifts to the processing shown in FIG. That is, it is checked whether there is intrusion image data from adjacent characters or the like.

これは、第４図に示す仮想文字枠53と比較して、仮想
文字枠53の外側に出る領域の方が大きい輪郭データがあ
るか調べる。This is compared with the virtual character frame 53 shown in FIG. 4, and it is checked whether or not there is larger contour data in the area outside the virtual character frame 53.

前記の如く、ラベル値２の輪郭データの領域を作
成すると、領域は仮想文字枠53の外側に出る領域は無
い。As described above, when the contour data area having the label value 2 is created, there is no area outside the virtual character frame 53.

しかし、ラベル値１の輪郭データの領域を作成す
ると、領域は仮想文字枠53の外側に出る領域が大き
い。これは、領域のＸ座標値20と28に対し、仮想文字
枠53のＸ座標値は22であり、仮想文字枠54のＸ座標値は
28であるため、その比（22−20）／（28−22）から判定することが出来る。However, when the contour data area having the label value 1 is created, the area is large outside the virtual character frame 53. This is because the X coordinate value of the virtual character frame 53 is 22 and the X coordinate value of the virtual character frame 54 is
Since it is 28, it can be judged from the ratio (22-20) / (28-22).

従って、切出し処理部36は第８図に示す如く、ラベル
値２の輪郭データには、例えば符号０を付与し、ラベル
値１の輪郭データには、侵入に基づく不要イメージデー
タとして、例えば符号80Hを付与してラベル選別テーブ
ルに登録し、このラベル選別テーブルをラベル選別テー
ブルメモリ30に格納する。Therefore, as shown in FIG. 8, the cut-out processing unit 36 assigns, for example, the reference numeral 0 to the contour data of the label value 2, and to the contour data of the label value 1 as unnecessary image data based on the intrusion, for example, the reference numeral 80H. Is added and registered in the label selection table, and this label selection table is stored in the label selection table memory 30.

切出し処理部36は、次にラベル選別テーブルメモリ30
に格納した第８図に示す如きラベル選別テーブルを参照
し、C0Hと80Hの符号を付与されたラベル値のイメージデ
ータは不要であると認識し、０の符号が付与されたラベ
ル値のイメージデータのみを用い、第３図に示す如
く、不要イメージデータを除いた横軸がＹ方向の輪郭テ
ーブルを作成する。即ち、第５図の輪郭テーブルのと
を除いた輪郭テーブルを作成する。The cutout processing unit 36 then determines the label selection table memory 30.
By referring to the label selection table as shown in FIG. 8 stored in FIG. 8, it is recognized that the image data of the label value assigned the code of C0H and 80H is unnecessary, and the image data of the label value assigned the code of 0 is recognized. As shown in FIG. 3, a contour table in which the horizontal axis is in the Y direction is created by using only the above. That is, the contour table except for the contour table of FIG. 5 is created.

そして、この横軸がＹ方向の輪郭テーブルから、横軸
がＸ方向の輪郭テーブルを作成する。そして、切出し済
文字輪郭情報として、新たに作成した二つの輪郭テーブ
ルを端子Ｂを経て送出する。Then, a contour table in which the horizontal axis is in the X direction is created from the contour table in which the horizontal axis is in the Y direction. Then, as the cut-out character contour information, the two newly created contour tables are transmitted via the terminal B.

〔The invention's effect〕

以上説明した如く、本発明は文字を認識するため作成
される文字枠内の不要イメージデータを効率良く除去す
ることが可能なため、文字認識率を高めると共に、処理
効率を向上させることが出来る。As described above, according to the present invention, unnecessary image data in a character frame created for recognizing a character can be efficiently removed, so that the character recognition rate and the processing efficiency can be improved.

[Brief description of drawings]

第１図は本発明の原理を説明する図、第２図は本発明の一実施例を説明するブロック図、第３図は第２図の動作を説明するフローチャート、第４図は仮想文字枠を説明する図、第５図は輪郭テーブルの一例を示す図、第６図はラベルテーブルの一例を示す図、第７図はラベル領域テーブルの一例を示す図、第８図はラベル選別テーブルの一例を示す図、第９図は従来の技術を説明する図、第10図はノイズ除去を説明する図である。図において、 31は輪郭テーブル作成手段、32はラベル付け処理手段、
33は切出し処理手段、34は輪郭テーブル作成部、35はラ
ベル付け処理部、36は切出し処理部、37は輪郭テーブル
メモリ、38はラベルテーブルメモリ、39はラベル領域テ
ーブルメモリ、40はラベル選別テーブルメモリ、50は文
字イメージデータ、51はベクトル、52は外接四辺形、5
3,54は仮想文字枠である。1 is a diagram for explaining the principle of the present invention, FIG. 2 is a block diagram for explaining an embodiment of the present invention, FIG. 3 is a flow chart for explaining the operation of FIG. 2, and FIG. 4 is a virtual character frame. FIG. 5 is a diagram showing an example of a contour table, FIG. 6 is a diagram showing an example of a label table, FIG. 7 is a diagram showing an example of a label area table, and FIG. 8 is a label selection table. FIG. 9 is a diagram illustrating an example, FIG. 9 is a diagram illustrating a conventional technique, and FIG. 10 is a diagram illustrating noise removal. In the figure, 31 is a contour table creating means, 32 is a labeling processing means,
33 is a cutout processing means, 34 is a contour table creation unit, 35 is a labeling processing unit, 36 is a cutout processing unit, 37 is a contour table memory, 38 is a label table memory, 39 is a label area table memory, and 40 is a label selection table. Memory, 50 is character image data, 51 is vector, 52 is circumscribed quadrilateral, 5
3, 54 are virtual character frames.

Claims

[Claims]

1. In order to recognize a character contour image, a first character frame in which a predetermined size is enlarged from a circumscribed quadrangle circumscribing a vector obtained from the character image, and a predetermined character further than the first character frame are provided. And a second character frame in which the size of the second character frame is enlarged, and a contour table as character contour information is generated from the image data in the second character frame. From the image data existing in the character frame, the contour table creating means (31) for creating the contour table, and the image data in which regions are continuous from the contour table created by the contour table creating means (31) are selected, and each is independent. After assigning different value labels to the image data having the specified area, the label area table is created by obtaining the maximum and minimum coordinate values of the area of the image data for each label value from the contour table. By referring to the labeling processing means (32) and the label area table created by the labeling processing means (32), image data in which one pixel area is continuous and image data having an area of N × N pixels or less And the image data having a large area existing outside the area occupied by the first character frame, and selected from the contour table based on the label value given to the selected image data. A unnecessary data removing method at the time of generating a character contour, which is provided with a cutting processing means (33) for erasing information of image data.