JPS6327752B2

JPS6327752B2 -

Info

Publication number: JPS6327752B2
Application number: JP55187050A
Authority: JP
Inventors: Akira Inoe; Masumi Yoshida
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1980-12-27
Filing date: 1980-12-27
Publication date: 1988-06-06
Also published as: JPS57111677A

Description

【発明の詳細な説明】本発明は文字図形分離方式に関し、特に文字と
図形が混在して記載されている画像情報からでも
その文字情報のみを分離して認識できるようにし
た文字図形分離方式に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a character/figure separation method, and more particularly to a character/figure separation method that allows only character information to be separated and recognized even from image information containing a mixture of characters and figures. .

例えばプリント回路の設計図面や論理回路等通
常の図面では、通常は文字と図形が混在して画か
れているのが普通である。したがつてこのような
図面を認識する場合、図形と文字の混在がその自
動認識処理を困難にしている。これは図形に用い
る処理方式と文字に用いる処理方式が全く異質の
ものであることによる。したがつて図面を認識し
て、例えば手書き図面をデータ処理装置により製
図するような場合、図形ならば直線とか、矩形と
か正方形とかあるいは円とか、ある程度のパター
ンが決つており、それにもとづき製図することが
できるが、文字の場合の識別は図形の識別と全く
異質なために、図面を認識する場合、先ず文字と
図形を分離する必要が生ずる。 For example, in ordinary drawings such as design drawings of printed circuits and logic circuits, characters and figures are usually drawn in a mixture. Therefore, when recognizing such drawings, the mixture of figures and characters makes automatic recognition difficult. This is because the processing method used for graphics and the processing method used for characters are completely different. Therefore, when recognizing a drawing and, for example, drawing a handwritten drawing using a data processing device, there is a certain certain pattern of shapes, such as straight lines, rectangles, squares, or circles, and it is necessary to draw the drawing based on that pattern. However, since the identification of characters is completely different from the identification of graphics, when recognizing a drawing, it is first necessary to separate the characters and graphics.

本発明はこのような必要性に応じた文字と図形
とを分離することを可能にした文字図形分離方式
の提供を目的とするものであり、このために本発
明の文字図形分離方式では、文字と図形が混在さ
れた画像情報から文字と図形を分離する文字図形
分離方式において、前記画像情報を入力する入力
手段と、該入力手段から入力された入力情報を保
持する情報保持手段と、該情報保持手段に保持さ
れた情報を四辺形のマスクを用いて特徴抽出を行
なう特徴抽出手段を有し、該情報保持手段に保持
された該入力情報を、該マスクを走査して、該マ
スクの左辺に前記文字を構成する部分が存在せ
ず、且つ右辺に該部分が存在すれば文字情報エリ
アの開始位置と判断し、次に該マスクを走査して
該マスクの上辺及び下辺に該部分が存在せず、且
つ該右辺に該部分が存在すれば該文字情報エリア
内であると判断し、更に該マスクを走査して該マ
スクの左辺の該部分が存在し、且つ右辺に該部分
が存在しなければ該文字領域の端であると判断し
て該文字情報エリアを識別することにより、文字
と図形を分離することを特徴とする。 The object of the present invention is to provide a character/figure separation method that makes it possible to separate characters and figures in accordance with such needs. In a character/figure separation method for separating characters and figures from image information containing a mixture of images and figures, an input means for inputting the image information, an information holding means for holding input information input from the input means, and the information It has a feature extraction means for extracting features from the information held in the holding means using a quadrilateral mask, and scans the input information held in the information holding means through the mask to extract features from the left side of the mask. If the part constituting the character does not exist in , and the part exists on the right side, it is determined to be the start position of the character information area, and then the mask is scanned and the part exists on the top and bottom sides of the mask. If not, and the part exists on the right side, it is determined that it is within the character information area, and the mask is further scanned to find that the part on the left side of the mask exists, and the part exists on the right side. If not, the character information area is determined to be at the edge of the character area and the character information area is identified, thereby separating the characters and graphics.

以下本発明の一実施例を詳述するに先立ち、第
１図〜第３図にもとづき本発明の概略を説明す
る。 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Before describing one embodiment of the present invention in detail, an outline of the present invention will be explained based on FIGS. 1 to 3.

まず、第１図に示すように横方向の幅がYl、
縦方向の高さがTlで各辺の幅が単位長の中空の
マスクＭを使用する。そして第２図イに示すよう
にこのマスクＭのうち領域Ａがすべて「０」であ
つて領域Ｂのうちの一部にでも「１」があるか否
かをバツフアに記憶されている画像情報について
走査する。なおここで「１」とは画像情報が存在
することを示し、「０」はそれが存在しないこと
を示している。この第２図イに示すモード０の状
態で画像情報をｘ方向に走査する。そして第４図
に示す如く、文字領域の最初の部分に入るとき、
モード０が成立する。すなわち、第２図イにおけ
る領域Ａがオール「０」、領域Ｂに少なくとも１
つの「１」が存在することになる。このモード０
が成立したとき、マスクは第２図ハに示すよう
に、領域ＥとＦがそれぞれオール「０」か否かを
チエツクするモード３になり、領域ＥとＦがすべ
て「０」であるこのモード３が成立したとき、マ
スクはその位置で、第４図に示す領域Ｇに「１」
が存在するか否かをチエツクするモード２にな
る。このようにして、モード３とモード２が成立
すれば文字領域が連続しているものと判断して、
この状態でｘ方向の走査が行なわれる。モード０
ではＡの部分がすべて０でなければならないのに
対し、モード２ではモード０のＡの部分を無視
（０でも１でも良い）し、Ｂに相当する部分だけ
を検証する。この理由は、モード０は文字列の始
まりをみつけるためのものであり、モード２は文
字列を構成する個別文字間のつながりをみつける
ためのものである。そしてモード３のチエツクの
ときに、例えば領域Ｅに「１」が存在すれば、そ
れは文字情報ではなく、線とかあるいはパターン
等の図形情報とみなし、再びモード０でｘ方向の
走査が行なわれる。そしてモード２のチエツクに
おいて「１」が存在しなければ、文字領域の終り
か否かを判断するため、第２図ロに示すロのモー
ド１、つまり領域Ｄがオール「０」で領域Ｃに
「１」が存在するか否かをチエツクする。そして
このモード１が成立すれば、第４図に示すように
これを文字の終りと判断する。 First, as shown in Figure 1, the horizontal width is Yl,
A hollow mask M with a vertical height of Tl and a width of each side of unit length is used. Then, as shown in FIG. 2A, the image information stored in the buffer indicates whether all the areas A in this mask M are ``0'' and there are even some ``1''s in area B. Scan about. Note that here, "1" indicates that image information exists, and "0" indicates that it does not exist. Image information is scanned in the x direction in the mode 0 state shown in FIG. 2A. As shown in Figure 4, when entering the first part of the character area,
Mode 0 is established. That is, in FIG. 2A, area A is all 0, and area B is at least 1.
There will be ``1''. This mode 0
When this is true, the mask enters mode 3, which checks whether areas E and F are all 0, as shown in Fig. 2C, and this mode in which areas E and F are all 0 is activated. 3 is established, the mask prints "1" in the area G shown in FIG. 4 at that position.
Mode 2 is entered to check whether or not exists. In this way, if mode 3 and mode 2 are established, it is determined that the character area is continuous, and
In this state, scanning in the x direction is performed. mode 0
In contrast, in mode 2, the A part in mode 0 is ignored (it can be 0 or 1), and only the part corresponding to B is verified. The reason for this is that mode 0 is for finding the beginning of a character string, and mode 2 is for finding connections between individual characters that make up the string. When checking in mode 3, for example, if "1" exists in area E, it is assumed that it is not character information but graphic information such as a line or pattern, and scanning in the x direction is performed again in mode 0. If "1" does not exist in the mode 2 check, in order to judge whether or not it is the end of the character area, mode 1 shown in FIG. Check whether "1" exists. If mode 1 is established, this is determined to be the end of the character, as shown in FIG.

ところでこの場合、文字の大きさが画面により
異なることが多いため、マスクＭの大きさを固定
することはできない。それでこのマスクの大きさ
すなわちその幅Ylと高さTlを決定するため、第
５図に示す如き処理を最初に行なう。 By the way, in this case, the size of the mask M cannot be fixed because the size of the characters often differs depending on the screen. Therefore, in order to determine the size of this mask, that is, its width Yl and height Tl, the process shown in FIG. 5 is first performed.

(a) まず、第５図イに示すように全画面情報を走
査してその「１」，「０」を求める。そしてこの
全画面情報よりその画像情報の変化点すなわち
「０」→「１」および「１」→「０」に変化す
る点を求める。例えば第５図ロに示すように、
いまラインｌ上を走査するとき、x₁では「０」
→「１」に変化し、x₂では「１」→「０」に変
化し、x₃では「０」→「１」に変化し、x₄では
「１」→「０」に変化する。このように変化点
を求めたとき、その変化点間のインターバルの
和_T-1 〓^x=1 ｌと変化点の個数Tyをカウントする。そ
してその変化点間のインターバルの和を変化点
数Tyで除して変化点間の平均距離を求め
る。(a) First, as shown in FIG. 5A, the entire screen information is scanned to find its "1" and "0". Then, from this full-screen information, the changing points of the image information, that is, the points where the image information changes from "0" to "1" and from "1" to "0" are determined. For example, as shown in Figure 5B,
Now when scanning on line l, x ₁ is "0"
→ changes to “1”, x ₂ changes from “1” to “0”, x ₃ changes from “0” to “1”, and x ₄ changes from “1” to “0”. When the change points are determined in this way, the sum of the intervals between the change points _T-1 〓 ^x=1 l and the number of change points Ty are counted. Then, the average distance between the change points is determined by dividing the sum of the intervals between the change points by the number of change points Ty.

(b) そして全画面を走査して、変化点間の距離が
上記平均距離より小さくなる変化点の座
標を、その近傍に文字を含む侯補点として記憶
する。これにより第５図ハに示す如き侯補点情
報が得られる。 (b) Then, the entire screen is scanned, and the coordinates of the change points where the distance between the change points is smaller than the above-mentioned average distance are stored as the interpolation points that include characters in the vicinity. As a result, the information on the marquee complement points as shown in FIG. 5C is obtained.

(c) 次に連続する侯補点のインターバルの最大値
をM_Lとする。例えば第８図イの原画において、
変化点の１例が同図ロの黒点（「０」→「１」）、
白点（「１」→「０」）として部分的に示され
る。そのインターバルは第８図ハのl₁，l₂……
lnとして示される。このうちこのインターバル
の平均距離以下のインターバルは、第８
図ニに示すl₁，l₂，l₄……で示される。そして
インターバルの最大値MLは、第８図ホの如く
示される。(c) Let M _L be the maximum value of the interval between the next consecutive interpolation points. For example, in the original picture in Figure 8A,
An example of a change point is the black dot in Figure B (“0” → “1”).
Partially shown as a white dot (“1” → “0”). The intervals are l ₁ , l ₂ . . . in Figure 8 C.
Denoted as ln. Among these, the interval less than the average distance of this interval is the 8th
They are indicated by l ₁ , l ₂ , l ₄ ... shown in Figure D. The maximum value ML of the interval is shown as in FIG. 8(e).

(d) 同様の走査を縦方向にも行ない、連続する侯
補点の始点と終点のインターバルの平均を
Tlmとする。そしてこのTlmの5/4倍を上記マ
スクＭの高さTlとし、またその5/8を該マスク
Ｍの幅Ylとする。(d) Perform a similar scan in the vertical direction, and calculate the average interval between the start and end points of consecutive interpolation points.
Tlm. Then, 5/4 times this Tlm is the height Tl of the mask M, and 5/8 is the width Yl of the mask M.

Tl＝５／４Tlm Yl＝５／８Tlm この数値は発明者等の実験により得られたもの
である。このようにして定められた大きさのマス
クＭを使用して、上記の如き文字の識別を行なう
ものである。第８図の例では、Tlmは同図ヘに
示す如きものとなり、Tl，Ylは同図トに示す如
きものとなる。 Tl=5/4Tlm Yl=5/8Tlm These values were obtained through experiments by the inventors. The above-mentioned characters are identified using the mask M having the size thus determined. In the example of FIG. 8, Tlm is as shown in F of the figure, and Tl and Yl are as shown in G of the figure.

次に本発明の一実施例構成を第６図および第７
図にもとづき説明する。 Next, the configuration of one embodiment of the present invention is shown in FIGS. 6 and 7.
This will be explained based on the diagram.

第６図は本発明の一実施例構成を示し、第７図
は本発明の動作状態を説明するフローチヤートで
ある。 FIG. 6 shows the configuration of an embodiment of the present invention, and FIG. 7 is a flowchart for explaining the operating state of the present invention.

図中、１は入力部、２は画像メモリ、３はバツ
フア、４はパラメータ演算部、５はアドレス発生
部、６は領域ROM、７は制御部、８はアンド回
路、９は特徴抽出部、１０はゲート、１１はアド
レス・テーブル、１２は画像クリア回路、１３は
出力メモリである。 In the figure, 1 is an input section, 2 is an image memory, 3 is a buffer, 4 is a parameter calculation section, 5 is an address generation section, 6 is an area ROM, 7 is a control section, 8 is an AND circuit, 9 is a feature extraction section, 10 is a gate, 11 is an address table, 12 is an image clear circuit, and 13 is an output memory.

入力部１は図面を読取りこれを電気信号に変換
するものであつて、この入力部１から入力された
画像情報は画像メモリ２に保持される。バツフア
３は画像情報よりその変化点を求めたり、マスク
走査を行うための作業領域用のバツフア・メモリ
である。 The input section 1 reads a drawing and converts it into an electrical signal, and image information input from the input section 1 is held in an image memory 2. The buffer 3 is a buffer memory for use as a work area for determining points of change from image information and for performing mask scanning.

パラメータ演算部４は、画像情報から変化点を
検出し、これにより上記M_L，Tl，Yl等のパラメ
ータ値を演算し、これらを保持するものである。 The parameter calculation unit 4 detects changing points from the image information, calculates parameter values such as M _L , Tl, Yl, etc., and holds them.

アドレス発生部５は、バツフア３にデータをセ
ツトしたりあるいはこのセツトされたデータを読
出すためのアドレス信号を発生するものである。 The address generator 5 generates an address signal for setting data in the buffer 3 or reading the set data.

領域ROM６はマスクＭの領域を定めるデータ
が出力されるものであり、そのマスクＭを決定す
るためのパラメータは上記パラメータ演算部４か
ら伝達される。 The area ROM 6 is for outputting data that defines the area of the mask M, and parameters for determining the mask M are transmitted from the parameter calculation unit 4.

制御部７は、画面情報から文字を分離するまで
のデータ処理に必要な制御信号を発生するための
制御部である。 The control unit 7 is a control unit for generating control signals necessary for data processing up to separating characters from screen information.

特徴抽出部９は第４図に示すモードにもとづき
文字情報か否かを識別するものである。 The feature extractor 9 identifies whether or not the information is character information based on the mode shown in FIG.

アドレス・テーブル１１は、文字の存在する始
端座標xsと文字の存在が終了する終端座標xEが
記入されるレジスタである。 The address table 11 is a register in which the start coordinate xs where a character exists and the end coordinate xE where the character ends are entered.

画像クリア回路１２は、画像メモリ２にセツト
された画像情報のうち、文字領域のみをクリアす
るクリア回路である。 The image clear circuit 12 is a clear circuit that clears only the character area of the image information set in the image memory 2.

出力メモリ１３は画像メモリ２にセツトされた
画像情報から文字領域がクリアされたものが出力
用に保持されるメモリである。 The output memory 13 is a memory in which the image information set in the image memory 2 with the character area cleared is held for output.

次に第６図の動作について説明する。 Next, the operation shown in FIG. 6 will be explained.

(1) まず図面が入力部１にセツトされると、入力
部１はその図面を電気信号に変換し、これを画
像メモリ２に送出する。そして画像メモリ２に
は図面の画像情報が保持される。それから制御
部７は画像メモリ２からバツフア３に対し順次
画像情報を読出すように制御を行なう。そして
このバツフア３にセツトされた画像情報を第５
図イに示すように走査し、これによりパラメー
タ演算部４は変化点を検出する。そしてこのよ
うにして検出した変化点にもとづき、パラメー
タ演算部４は上記(a)〜(d)に詳述した如き演算を
行なつてML，TlおよびYlを求め、これらの値
をその保持用バツフアで保持するとともに演算
結果を領域ROM６に伝達する。かくして領域
ROM６では、これらの値のうち、TlおよびYl
により所定の大きさのマスクを作成する。(1) First, when a drawing is set in the input section 1, the input section 1 converts the drawing into an electrical signal and sends it to the image memory 2. The image memory 2 holds image information of the drawing. Then, the control section 7 performs control to sequentially read image information from the image memory 2 to the buffer 3. Then, the image information set in this buffer 3 is transferred to the 5th buffer.
Scanning is performed as shown in FIG. Then, based on the change points detected in this way, the parameter calculation unit 4 performs the calculations detailed in (a) to (d) above to obtain ML, Tl, and Yl, and uses these values for storage. It is held in a buffer and the calculation result is transmitted to the area ROM6. thus the area
In ROM6, among these values, Tl and Yl
Create a mask of a predetermined size.

(2) このようにして得られたマスクを使用して画
像情報をアクセスするが、まず、アドレス発生
部５から画像情報の左上端の座標位置にマスク
Ｍを置きこのマスクＭを順次ｘ軸方向に走査
し、上記モード０（第７図のフローチヤートで
はモード０をC₁と表示する）すなわちC₁が成
立するか否かをチエツクする。もしC₁が成立
しなければそのときのｘ座標が右端WDHにな
るまで順次この走査を続ける。そしてこのマス
クが右端まで走査してもC₁が成立しなければ、
今度はマスクをｙ軸方向に１つ進め、同様の走
査を行なう。このような走査は、アドレス発生
部５から発生されるアドレスに応じ行なわれ、
マスク領域は領域ROM６から送出される、マ
スクＭの領域のみ「１」になる信号をアンド回
路８に送出することにより、マスク制御が行な
われ、各モードが成立するか否かの識別は特徴
抽出部９で行なわれるものである。(2) The image information is accessed using the mask obtained in this way. First, the mask M is placed at the coordinate position of the upper left corner of the image information from the address generation unit 5, and this mask M is sequentially moved in the x-axis direction. It is checked whether or not the above-mentioned mode 0 (mode 0 is indicated as C ₁ in the flowchart of FIG. 7), that is, C ₁ is established. If C ₁ does not hold, this scanning is continued in sequence until the x-coordinate at that time reaches the right end WDH. And even if this mask is scanned to the right end, if C ₁ does not hold, then
This time, the mask is advanced one step in the y-axis direction and a similar scan is performed. Such scanning is performed according to the address generated from the address generation section 5,
For the mask area, mask control is performed by sending a signal that is sent from the area ROM 6 and becomes "1" only in the area of the mask M to the AND circuit 8, and whether or not each mode is established is determined by feature extraction. This will be done in Section 9.

(3) そしてある位置で上記C₁が成立すれば、第
４図に示すように、そのときのマスクの左上隅
の位置を文字領域始点xsと定め、これを特徴
抽出部９のレジスタに保持する。そしてｘ軸方
向の座標を１つ進めてモード３（C₃）が成立す
るか否かを特徴抽出部９で判別する。勿論この
とき第４図に示す領域Ｇに「１」が存在するこ
とをチエツクするモード２のチエツクも行な
う。しかしながら、上記モード３が不成立の場
合には、その領域は文字領域でないものと判断
し、先に特徴抽出部９内で保持した文字領域始
点xsを消滅させる。そして再びC₁が成立する
か否かをチエツクする。(3) If the above C ₁ is established at a certain position, the upper left corner position of the mask at that time is determined as the character area starting point xs, and this is stored in the register of the feature extraction unit 9. do. Then, the feature extraction unit 9 determines whether mode 3 (C ₃ ) is established by advancing the coordinate in the x-axis direction by one. Of course, at this time, a mode 2 check is also performed to check that "1" exists in area G shown in FIG. However, if mode 3 is not established, it is determined that the area is not a character area, and the character area starting point xs previously held in the feature extraction unit 9 is deleted. Then, check whether _C1 holds true again.

(4) もしも上記(3)においてC₃が成立し、更に次
いでモード２のチエツクが成立すれば、その時
点ではモード３（C₃）のチエツクが不成立とい
うことになる。すなわち第２図ロに示す領域Ｄ
がオール「０」ではなく、「１」が存在したこ
とを示す。そしてこのときのｘ座標から上記
xsを引いたｘ−xsが上記MLよりも小さけれ
ば、ｘ座標が＋１されてマスクが進行し、C₃
が成立するか否かがチエツクされる。そしてマ
スクの進行にともない、C₃は成立したものの、
次いでモード２のチエツクのとき領域Ｇに
「１」が存在しなかつたときには、これはすな
わちC₂が成立したことになる。そして第４図
に示すように、このときの座標ｘの値を文字領
域終点xEと定めこれをアドレス・テーブル１
１に記入する。このようにして文字領域始点
xsと文字領域終点xEが揃つて得られたとき、
ゲート１０を開き、これらの文字領域始点xs
と文字領域終点xEがアドレス・テーブル１１
に送出される。そしてこれにより画像クリア回
路１２が動作し、画像メモリ２に保持された画
像情報より上記文字領域始点xsと文字領域終
点xEの間でかつマスクＭの高さYl（或いはマス
クＭの内側の高さYl′）の領域をクリアする。(4) If C ₃ is established in the above (3) and then the check of mode 2 is established, then the check of mode 3 (C ₃ ) is not established at that point. In other words, area D shown in Figure 2B
indicates that there were not all 0's but 1's. And from the x coordinate at this time, the above
If x−xs, which is obtained by subtracting xs, is smaller than the above ML, the x coordinate is incremented by 1, the mask progresses, and C ₃
It is checked whether or not it holds true. As the mask progressed, _C3 was established, but
Next, when checking in mode 2, if "1" does not exist in region G, this means that _C2 is established. Then, as shown in Figure 4, the value of the coordinate x at this time is determined as the character area end point xE, and this is set as
Fill in 1. In this way, the starting point of the character area
When xs and character area end point xE are obtained,
Open gate 10 and set these character area starting points xs
and character area end point xE is address table 11
sent to. As a result, the image clearing circuit 12 operates, and from the image information held in the image memory 2, the height Yl of the mask M (or the height inside the mask M) is determined between the character area start point xs and the character area end point xE. Clear the area of Yl′).

(5) このような動作を全画像情報について行な
い、その文字領域がすべてクリアされたのち
に、画像メモリ２に保持されたデータが出力メ
モリ１３に伝達される。勿論このとき出力メモ
リ１３に伝達された画像情報は文字情報がクリ
アされている。このようにして出力メモリ１３
から文字の分離された画像情報を得ることがで
きる。(5) After this operation is performed on all the image information and all the character areas are cleared, the data held in the image memory 2 is transmitted to the output memory 13. Of course, the character information of the image information transmitted to the output memory 13 at this time has been cleared. In this way, the output memory 13
Image information with separated characters can be obtained from .

以上説明の如く本発明によれば、マスクを使用
して文字領域のみ分離することができるので、画
像情報から文字情報を区別することが可能になり
図形情報のみを取出すことが非常に容易になる。 As explained above, according to the present invention, only the character area can be separated using a mask, so it becomes possible to distinguish character information from image information, and it becomes very easy to extract only graphic information. .

なお上記説明では文字領域を画像クリア回路で
クリアした例について説明したが、勿論文字領域
始点および文字領域終点間を上記のようにクリア
する代りにこの領域のみを取出すこともできる。
そして取出した情報によりこの文字領域に記載さ
れた文字を識別するような操作を行なうこともで
きる。 In the above explanation, an example has been explained in which the character area is cleared by the image clearing circuit, but of course, instead of clearing the area between the character area start point and the character area end point as described above, it is also possible to extract only this area.
It is also possible to perform operations such as identifying the characters written in this character area using the retrieved information.

[Brief explanation of the drawing]

第１図は本発明において使用するマスク、第２
図は該マスクを使用した識別モードの説明図、第
３図はマスクの走査状態説明図、第４図は文字の
存在と識別モードおよび文字領域始点および文字
領域終点の説明図、第５図はマスクの大きさの決
定等に必要なパラメータの作成の説明図、第６図
は本発明の一実施例構成図、第７図はその動作状
態を説明するフローチヤート、第８図は本発明の
動作説明図である。図中、１は入力部、２は画像メモリ、３はバツ
フア、４はパラメータ演算部、５はアドレス発生
部、６は領域ROM、７は制御部、８はアンド回
路、９は特徴抽出部、１０はゲート、１１はアド
レス・テーブル、１２は画像クリア回路、１３は
出力メモリをそれぞれ示す。 Figure 1 shows the mask used in the present invention, and Figure 2 shows the mask used in the present invention.
The figure is an explanatory diagram of the identification mode using the mask, Fig. 3 is an explanatory diagram of the scanning state of the mask, Fig. 4 is an explanatory diagram of the presence of characters, the identification mode, the character area start point and the character area end point, and Fig. 5 is an explanatory diagram of the character area start point and character area end point. An explanatory diagram of the creation of parameters necessary for determining the size of a mask, etc., Fig. 6 is a configuration diagram of an embodiment of the present invention, Fig. 7 is a flowchart explaining its operating state, and Fig. 8 is an illustration of the method of the present invention. It is an operation explanatory diagram. In the figure, 1 is an input section, 2 is an image memory, 3 is a buffer, 4 is a parameter calculation section, 5 is an address generation section, 6 is an area ROM, 7 is a control section, 8 is an AND circuit, 9 is a feature extraction section, 10 is a gate, 11 is an address table, 12 is an image clear circuit, and 13 is an output memory.

Claims

[Scope of Claims] 1. A character/figure separation method for separating characters and figures from image information containing a mixture of characters and figures, comprising: an input means for inputting the image information; and holding input information input from the input means. and a feature extracting means for extracting features from the information held in the information holding means using a quadrilateral mask. When scanning, if the part constituting the character does not exist on the left side of the mask and the part exists on the right side, it is determined that the character information area is the starting position, and then the mask is scanned to find the part of the mask. If the portion does not exist on the upper and lower sides and the portion exists on the right side, it is determined that the character information area is within the character information area, and further scans the mask to find that the portion on the left side of the mask exists; If the part does not exist on the right side, it is determined that it is the end of the character area, and the character information area is identified, thereby separating characters and graphics.