JPH0715702B2

JPH0715702B2 - Character pattern cutting device

Info

Publication number: JPH0715702B2
Application number: JP60285021A
Authority: JP
Inventors: 文夫依田; 陽二前田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1985-12-18
Filing date: 1985-12-18
Publication date: 1995-02-22
Anticipated expiration: 2010-02-22
Also published as: JPS62144288A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、互に入りくんだ文字同士を分離して切り出す
文字パターン切り出し装置に関するものである。Description: TECHNICAL FIELD The present invention relates to a character pattern cutout device that separates and cuts out characters that have entered each other.

[Conventional technology]

第２図は、例えば特公昭57−51144号公報に示された従
来のこの種の装置の構成図である。図中１は用紙、２は
用紙１上に記入された文字列を光学的に走査して光電変
換する走査手段、３は光電変換された文字列のパターン
（以後、「文字列パターン」と呼ぶ）を記憶する文字列
パターン記憶手段、４は上記文字列パターンを上から順
に水平方向に走査して白画素が連続する領域（以後「白
領域」と呼ぶ）の左端と右端との座標を検出し記憶する
白領域検出手段、５は上記白領域検出手段４で検出した
白領域の情報に基づいて上下に隣接する白領域の連続性
を調べ、最上段から最下段に至るすべての走査間で互い
に連続する白領域の列を選択して記憶する連続性チェッ
ク手段、６は上記連続性チェック手段で選択した白領域
の列の情報に基づいて上記文字列パターン記憶手段に格
納した文字列のパターンから１文字ずつ文字パターンを
切り出して出力する文字分離手段である。FIG. 2 is a block diagram of a conventional device of this type disclosed in, for example, Japanese Patent Publication No. 57-51144. In the figure, 1 is a sheet, 2 is a scanning means for optically scanning and photoelectrically converting a character string written on the sheet 1, and 3 is a pattern of the photoelectrically converted character string (hereinafter referred to as "character string pattern"). ) Is stored in the character string pattern storage means 4, and the character string pattern is horizontally scanned from above to detect the coordinates of the left end and the right end of an area where white pixels are continuous (hereinafter referred to as "white area"). Then, the white area detecting means 5 stores the continuity of the vertically adjacent white areas based on the information of the white areas detected by the white area detecting means 4, and detects all the scans from the uppermost stage to the lowermost stage. Continuity check means for selecting and storing columns of white areas that are continuous with each other; 6 is a pattern of character strings stored in the character string pattern storage means based on information of columns of white areas selected by the continuity checking means. Character pattern for each character from A character separation means for clipping the output.

第３図（Ａ）〜（Ｃ）は、第２図に示す従来装置の動作
列を説明するための図である。図中７は文字列パターン
「23」の例、８は白画素、９は黒画素、10は白領域の
例、11は選択された白領域の列の例、12と13は切り出す
領域の例である。FIGS. 3A to 3C are views for explaining the operation sequence of the conventional device shown in FIG. In the figure, 7 is an example of a character string pattern “23”, 8 is a white pixel, 9 is a black pixel, 10 is an example of a white area, 11 is an example of a column of a selected white area, and 12 and 13 are examples of an area to be cut out. Is.

次に第３図（Ａ）〜（Ｃ）を用いて第２図に示す従来の
この種装置の動作について説明する。Next, the operation of the conventional device of this type shown in FIG. 2 will be described with reference to FIGS.

まず、用紙１上の文字列は走査手段２で光電変換され文
字列パターン記憶手段３に格納される。次に文字列パタ
ーン記憶手段３内の文字列パターン「23」７は白領域検
出手段４に渡される。First, the character string on the sheet 1 is photoelectrically converted by the scanning means 2 and stored in the character string pattern storage means 3. Next, the character string pattern “23” 7 in the character string pattern storage means 3 is passed to the white area detection means 4.

白領域検出手段４では、文字列パターン「23」７を左右
方向に走査して白画素が連続する白領域の左端と右端と
の座標を検出する。すなわち、上記文字列パターン「2
3」７に対して第１番目の走査では左端と右端との座標
がそれぞれx1とx3,x6とx8である白領域を検出する。同
様に、第３番目の走査における白領域，…，第10番目の
走査における白領域と順次検出していき上記文字列パタ
ーン「23」７から第３図Ｂ図の例に示すように各走査に
おける白領域10を求めて上記連続性チェッ手段５に送
る。The white area detecting means 4 scans the character string pattern "23" 7 in the left-right direction to detect the coordinates of the left edge and the right edge of the white area where white pixels are continuous. That is, the character string pattern "2
In the first scan for "3" and 7 ", the white areas whose left end and right end coordinates are x1 and x3, x6 and x8, respectively, are detected. Similarly, the white area in the third scanning, ..., The white area in the tenth scanning are sequentially detected, and the character string pattern “23” 7 to each of the scans as shown in the example of FIG. 3B. The white area 10 in is obtained and sent to the continuity checking means 5.

次に上記連続性チェック手段５では、上記白領域検出手
段４で検出した白領域の情報に基づいて最上段の走査か
ら最下段の走査に至るすべての走査間で互いに連続し合
う白領域を選択し、上記文字分離手段６に転送する。Next, the continuity checking means 5 selects white areas which are continuous with each other between all scans from the uppermost scan to the lowermost scan based on the information of the white areas detected by the white area detecting means 4. Then, it is transferred to the character separating means 6.

すなわち、上記連続性チェック手段５は２つの白領域の
左端と右端との座標で示される区間に重なり合う領域が
あるか否かによつて白領域の連続性を判定するように構
成されている。そして、上記連続性チェック手段５では
まず第３図（Ｂ）に示した白領域10の情報に基づて第１
番目と第２番目との走査における白領域の連続性、第２
番目と第３番目との走査における白領域の連続性，…，
と順次チェックを行い隣接する走査間で連続する白領域
を検出していき、次に第１番目の走査における白領域か
ら第10番目の走査における白領域まで互いに連続し合う
第３図（Ｃ）の例に示す白領域の列11を選択する。That is, the continuity checking means 5 is configured to determine the continuity of the white area depending on whether or not there is an overlapping area in the section indicated by the coordinates of the left end and the right end of the two white areas. Then, in the continuity checking means 5, first, based on the information of the white area 10 shown in FIG.
The continuity of the white area in the second and second scans, the second
Continuity of white areas in the 3rd and 3rd scans, ...,
Is sequentially checked to detect a continuous white area between adjacent scans, and then the white area in the first scan to the white area in the tenth scan are continuous with each other (FIG. 3C). The column 11 in the white area shown in the example of is selected.

最後に、上記文字分離手段６では、上記連続性チェック
手段５で選択した第３図（Ｃ）の例に示す白領域11で分
離される領域12と領域13とに対応するパターンを上記文
字列パターン記憶手段３から読み出すことにより文字
「２」と文字「３」とを分離して切り出し外部装置へ出
力する。Finally, in the character separation means 6, the patterns corresponding to the areas 12 and 13 separated by the white area 11 shown in the example of FIG. By reading from the pattern storage means 3, the character "2" and the character "3" are separated and cut out and output to an external device.

[Problems to be solved by the invention]

従来の文字切り出し装置は以上のように文字列パターン
から抽出した白領域の上下方向の連続性を調べ、最上段
から最下段に至るすべての走査間で互いに連続し合う白
領域で分離される領域を個々の文字領域として切り出す
ように構成されていたので、黒画素が８連結で連結して
いる領域（以後「黒連結成分」と呼ぶ）が複数個集つて
構成される文字（以後「分離文字」と呼ぶ）を含む文字
列から文字を切り出す場合は各連結成分毎に分離して切
り出されてしまうという問題点があつた。The conventional character extraction device examines the continuity in the vertical direction of the white area extracted from the character string pattern as described above, and the areas separated by the white area that is continuous between all the scans from the top row to the bottom row. Since it is configured to cut out as individual character areas, a character composed of a plurality of areas (hereinafter referred to as “black connected components”) in which black pixels are connected in eight connections (hereinafter referred to as “separated character”). There is a problem that when a character is cut out from a character string including "," each connected component is separated and cut out.

例えば、第４図（Ａ），（Ｂ）は分離文字「シ」を含む
文字列の従来装置の処理結果の例を示した図であり、図
中14は文字列パターン「ナシタ」の例、15は上記連続性
チェック手段５で上記文字列パターン「ナシタ」から選
択したすべての走査間で連続し合う白領域の列の例、16
〜20は文字列パターン「ナシタ」14から切り出す領域の
例である。For example, FIGS. 4 (A) and 4 (B) are diagrams showing an example of a processing result of a conventional device for a character string containing a separation character “shi”, and 14 in the figure is an example of a character string pattern “nashita”, Reference numeral 15 is an example of a row of white areas which are continuous between all the scans selected from the character string pattern "Nashita" by the continuity check means 5, 16
.About.20 are examples of areas cut out from the character string pattern "Nashita" 14.

従来の装置では、第４図（Ａ）に示す文字列パターン
「ナシタ」14から第４図（Ｂ）に示す上下に連続する白
領域15を検出し、この白領域15で分離される５つの領域
「ナ」16,「ヽ」17,「ヽ」18,「ノ」19,「タ」20をそれ
ぞれ文字領域として切り出す。In the conventional device, the white string 15 continuous in the vertical direction shown in FIG. 4 (B) is detected from the character string pattern “Nashita” 14 shown in FIG. 4 (A), and five white regions 15 separated by this white region 15 are detected. The areas "na" 16, "ヽ" 17, "ヽ" 18, "no" 19, and "ta" 20 are respectively cut out as character areas.

このように、従来の装置では分離文字のない数字などの
文字列からは個々の文字を正しく切り出すことが出来る
が分離文字を含む仮名，漢字などからなる一般の日本語
文字列から正しく文字を切り出せないという問題点があ
つた。In this way, with conventional devices, individual characters can be cut out correctly from character strings such as numbers without separation characters, but characters can be correctly cut out from general Japanese character strings consisting of kana, kanji, etc. that contain separation characters. There was a problem that it did not exist.

この発明は、上記のような問題点を解消するためになさ
れたもので、「シ」や「言」などの分離文字が入りくん
で重なり合う文字列からも個々の文字を正しく切り出す
ことが出来る装置を得ることを目的とする。The present invention has been made to solve the above problems, and is an apparatus capable of correctly cutting out individual characters even from a character string in which separated characters such as "shi" and "word" are included and overlap each other. Aim to get.

[Means for solving problems]

このためこの発明にかかる文字切り出し装置は、第１番
目の走査線に沿つて得られた白領域は第１の走査のセパ
レータ領域とし、第２番目以降第ｎ番目のセパレータ領
域は第ｎ番目の白領域が第ｎ−１番目のセパレータ領域
複数個と隣接する場合、第ｎ番目の白領域と第ｎ−１番
目のセパレータ領域が共通する領域として決定するセパ
レータ領域検出手段を設け、このセパレータ領域に基づ
いて文字パターンを切り出すことを特徴とするものであ
る。Therefore, in the character slicing device according to the present invention, the white area obtained along the first scanning line is the separator area for the first scan, and the second to nth separator areas are the nth separator area. When a white area is adjacent to a plurality of (n-1) th separator areas, a separator area detecting means for determining the nth white area and the (n-1) th separator area as common areas is provided. It is characterized in that a character pattern is cut out based on.

[Action]

この発明にかかるセパレータ領域検出手段は、第１番目
の走査線に沿つて得られた白領域は第１の走査のセパレ
ータ領域とし、第２番目以降第ｎ番目のセパレータ領域
は第ｎ番目の白領域が第ｎ−１番目のセパレータ領域複
数個と隣接する場合、第ｎ番目の白領域と第ｎ−１番目
のセパレータ領域が共通する領域として決定する。そし
て、このセパレータ領域に基づいて文字パターンを切り
出す。In the separator area detecting means according to the present invention, the white area obtained along the first scanning line is the separator area for the first scanning, and the second to nth separator areas are the nth white areas. When the area is adjacent to the plurality of (n-1) th separator areas, the nth white area and the (n-1) th separator area are determined as common areas. Then, the character pattern is cut out based on this separator area.

〔Example〕

以下図面を用いて詳細に説明する。 The details will be described below with reference to the drawings.

第１図は、この発明の実施例を示す図である。図中１〜
６は第２図に示した上記従来装置と同一のものである。
図中21は上記白領域検出手段４で検出した白領域の情報
に基づいてセパレータ領域を検出し記憶するセパレータ
領域検出手段である。FIG. 1 is a diagram showing an embodiment of the present invention. 1 in the figure
6 is the same as the conventional device shown in FIG.
Reference numeral 21 in the drawing denotes a separator area detecting means for detecting and storing the separator area based on the information of the white area detected by the white area detecting means 4.

第５図は第４図（Ａ）に示す文字列パターン「ナシタ」
14から白領域を検出した例を示す図であり、図中22は検
出した白領域の例である。FIG. 5 shows the character string pattern “Nashita” shown in FIG. 4 (A).
It is a figure which shows the example which detected the white area | region from 14, and 22 in the figure is an example of the detected white area.

第６図は第４図（Ａ）に示す文字列パターン「ナシタ」
14からセパレータ領域を検出した例を示す図であり、図
中23は検出したセパレータ領域の例である。FIG. 6 shows the character string pattern “Nashita” shown in FIG. 4 (A).
It is a figure which shows the example which detected the separator area | region from 14, 23 is an example of the detected separator area | region.

第７図は上記セパレータ領域検出手段でセパレータ領域
を検出するアルゴリズムを説明するための説明図であ
る。図中24〜26は第ｎ−１番目の走査におけるセパレー
タ領域の例、27と28は第ｎ番目の走査で得た白領域の
例、29〜31は第ｎ番目の走査におけるセパレータ領域の
例である。FIG. 7 is an explanatory diagram for explaining an algorithm for detecting the separator area by the separator area detecting means. In the figure, 24 to 26 are examples of separator areas in the (n-1) th scan, 27 and 28 are examples of white areas obtained in the nth scan, and 29 to 31 are examples of separator areas in the nth scan. Is.

次に第４図（Ａ）の例に示す文字列パターン「ナシタ」
14から文字を切り出す場合を例にとり、第５図〜第７図
を用いて第１図に示す実施例のセパレータ領域検出手段
までの動作を説明する。Next, the character string pattern "Nashita" shown in the example of FIG.
The operation up to the separator area detecting means of the embodiment shown in FIG. 1 will be described with reference to FIGS. 5 to 7 by taking the case of cutting out characters from 14 as an example.

まず用紙１上に記入された文字列は上記走査手段２で光
電変換され、この結果得た文字列パターン「ナシタ」14
は上記文字列パターン記憶手段３に渡される。First, the character string written on the sheet 1 is photoelectrically converted by the scanning means 2, and the resultant character string pattern "Nashita" 14
Is passed to the character string pattern storage means 3.

次に上記文字列パターン「ナシタ」14は上記白領域検出
手段４に渡される。また上記白領域検出手段４では上記
文字列パターン「ナシタ」を左右方向に走査して第５図
の例に示す白領域22を検出し、この白領域22の左端と右
端との座標を上記セパレータ検出手段21に転送する。Next, the character string pattern "Nashita" 14 is passed to the white area detecting means 4. Further, the white area detecting means 4 scans the character string pattern "Nashita" in the left-right direction to detect the white area 22 shown in the example of FIG. 5, and the coordinates of the left end and the right end of the white area 22 are set to the separator. Transfer to the detection means 21.

上記セパレータ領域検出手段21では、上記白領域検出手
段４で検出した白領域22の情報に基づいて各走査におけ
るセパレータ領域を順次決定していく。すなわち、まず
第１番目の走査で得た白領域の左端と右端との座標で示
される領域を第１番目の走査のセパレータ領域として記
憶する。次に、上述の如く求めた第１番目の走査のすべ
てのセパレータ領域と第２番目の走査で得た各白領域と
の連続性を調べ、第１番目の走査における複数個のセパ
レータ領域と連続している第２番目の走査の白領域が検
出された場合はこの白領域を上記複数個のセパレータ領
域に対応させて分割してこの各領域を第２番目の走査に
おけるセパレータ領域として決定し、第１番目の走査に
おける複数個のセパレータ領域と連続してない第２番目
の白領域が検出された場合この白領域を１つのセパレー
タ領域として決定する。以下同様に、第３番目の走査に
おけるセパレータ領域から第10番目の走査におけるセパ
レータ領域と順次決定し、第６図の例に示すセパレータ
領域23を検出する。The separator area detecting means 21 sequentially determines the separator area in each scan based on the information of the white area 22 detected by the white area detecting means 4. That is, first, the area indicated by the coordinates of the left edge and the right edge of the white area obtained by the first scan is stored as the separator area of the first scan. Next, the continuity between all the separator areas of the first scan obtained as described above and each white area obtained in the second scan is checked, and the continuity with the plurality of separator areas in the first scan is determined. When a white area of the second scan is detected, the white area is divided corresponding to the plurality of separator areas, and each area is determined as a separator area in the second scan. When the second white area which is not continuous with the plurality of separator areas in the first scan is detected, this white area is determined as one separator area. Similarly, the separator area in the third scan is sequentially determined to the separator area in the tenth scan, and the separator area 23 shown in the example of FIG. 6 is detected.

次にセパレータ領域の抽出方法を例を用いて更に詳しく
説明する。例えば、第７図は第ｎ−１番目の走査におけ
るセパレータ領域と第ｎ番目の走査で得た白領域との情
報から第ｎ番目の走査でのセパレータ領域を検出する例
を示した図である。第７図において第ｎ番目の走査で得
た白領域27は、第ｎ−１番目の走査における１つのセパ
レータ領域24とのみ連続する。そこで白領域27を第ｎ番
目の走査のセパレータ領域29として決定し、その左端と
右端との座標を記憶する。また、第ｎ番目の走査での白
領域28は第ｎ−１番目の走査のセパレータ領域25とセパ
レータ領域26との２つのセパレータ領域と連続してい
る。これはセパレータ領域25とセパレータ領域26との間
に存在する領域に分離文字の一部である黒連結成分が存
在する場合に発生する現象である。そこで、上記セパレ
ータ領域検出手段21は第ｎ番目の走査の白領域28と第ｎ
−１番目の走査の２つのセパレータ領域25と26との共通
する領域を求めて白領域28を分割し、これを第ｎ番目の
セパレータ領域30とセパレータ領域31として決定する。
そして、それぞれのセパレータ領域の左端と右端との座
標を記憶する。Next, the method of extracting the separator area will be described in more detail using an example. For example, FIG. 7 is a diagram showing an example of detecting the separator area in the nth scan from the information of the separator area in the (n-1) th scan and the white area obtained in the nth scan. . In FIG. 7, the white area 27 obtained in the nth scan is continuous with only one separator area 24 in the (n-1) th scan. Therefore, the white area 27 is determined as the separator area 29 for the nth scan, and the coordinates of its left end and right end are stored. The white area 28 in the nth scan is continuous with the two separator areas 25 and 26 in the (n-1) th scan. This is a phenomenon that occurs when a black connected component that is a part of a separated character exists in the area existing between the separator area 25 and the separator area 26. Therefore, the separator area detecting means 21 detects the white area 28 of the nth scan and the nth scan.
The white area 28 is divided by obtaining a common area between the two separator areas 25 and 26 of the -1st scan, and this is determined as the nth separator area 30 and the separator area 31.
Then, the coordinates of the left end and the right end of each separator area are stored.

このように複数個と隣接する場合にのみ共通領域をセパ
レータ領域とすることにより、分離文字に対してのみそ
の分離部分を連結させる作用を有し、入りくんで重なり
合う文字間を連結してしまうような不具合が生じない。
例えば、第５図の第４番目の走査におけるx15,x16の白
領域はその前の走査の白領域と１つしか隣接していない
ので第６図に示すようにそのままセパレータ領域とな
り、「シ」と「タ」が連結してしまうことはなく、正し
く切り出せる。また、分離文字ではないが、第３図の文
字列を含むような場合でも、第９番目の走査におけるx7
〜x12の白領域はその前の走査の白領域と１つしか隣接
していないのでそのままセパレータ領域となり、「２」
と「３」が連結してしまうことはなく、正しく切り出せ
る。By using the common area as the separator area only when the plural characters are adjacent to each other in this way, it has the effect of connecting the separated parts only to the separated characters, and may connect the characters that are trapped and overlap each other. No trouble occurs.
For example, since the white area of x15, x16 in the fourth scan of FIG. 5 is adjacent to the white area of the previous scan only one, it becomes the separator area as shown in FIG. It can be cut out correctly without being combined with "ta". In addition, even if the character string shown in Fig. 3 is included, although it is not a separation character, x7 in the 9th scan
Since the white area of ~ x12 is adjacent to the white area of the previous scan only one, it becomes the separator area as it is and "2"
It can be cut out correctly without being connected with "3".

以上の如く第１図に示す実施例のセパレータ領域検出手
段21で求めた各走査におけるセパレータ領域の情報は、
次に上記連続性チェック手段５に転送される。そして上
記連続性チェック手段５では、従来の装置と同様に上下
の走査間におけるセパレータ領域の連続性をチェックし
最上段から最下段に至るすべての走査間で互いに連続し
合うセパレータ領域を選択する。例えば、第８図は、第
６図の例に示したセパレータ領域23から上記連続性チェ
ック手段５で選択したセパレータ領域の列の例を示す図
であり、図中32と33は選択したセパレータ領域の列、34
〜36はセパレータ領域32と33とで分離される領域であ
る。As described above, the information of the separator area in each scan obtained by the separator area detecting means 21 of the embodiment shown in FIG.
Then, the data is transferred to the continuity checking means 5. Then, the continuity checking means 5 checks the continuity of the separator areas between the upper and lower scans as in the conventional apparatus, and selects the separator areas which are continuous between all the scans from the uppermost stage to the lowermost stage. For example, FIG. 8 is a diagram showing an example of the columns of the separator areas selected by the continuity checking means 5 from the separator areas 23 shown in the example of FIG. 6, and 32 and 33 in the figure show the selected separator areas. Row of, 34
Numerals to 36 are areas separated by the separator areas 32 and 33.

最後に、上記文字分離手段６は、上記連続性チェック手
段５で選択した第８図のセパレータ領域の列32と33とで
分離される領域34〜領域36に対応するパターン「ナ」，
「シ」，「タ」を上記文字列パターン記憶手段３から読
み出して外部装置に出力する。Finally, the character separating means 6 selects the pattern "na" corresponding to the areas 34 to 36 separated by the columns 32 and 33 of the separator areas of FIG. 8 selected by the continuity checking means 5.
The characters "shi" and "ta" are read from the character string pattern storage means 3 and output to an external device.

なお、上記実施例では、カタカナの文字列から文字を切
り出す場合について説明したが、この発明はこれに限ら
ず英字、平仮名、漢字等が混在した文字列から文字を切
り出す場合に用いてもよい。In the above embodiment, the case where the character is cut out from the character string of katakana has been described, but the present invention is not limited to this, and may be used when cutting out the character from a character string in which English characters, hiragana, kanji, and the like are mixed.

また、上記実施例では横書きの文字列から文字を切り出
す場合について説明したが、これに限定するものではな
く縦書きの文字列から文字を切り出す場合に用いてもよ
い。Further, in the above embodiment, the case of cutting out a character from a horizontally written character string has been described, but the present invention is not limited to this and may be used when cutting out a character from a vertically written character string.

〔The invention's effect〕

以上説明したようにこの発明にかかる文字切り出し装置
は、第１番目の走査線に沿つて得られた白領域は第１の
走査のセパレータ領域とし、第２番目以降第ｎ番目のセ
パレータ領域は第ｎ番目の白領域が第ｎ−１番目のセパ
レータ領域複数個と隣接する場合、第ｎ番目の白領域と
第ｎ−１番目のセパレータ領域が共通する領域として決
定するセパレータ領域検出手段を設け、このセパレータ
領域に基づいて文字パターンを切り出すので、「シ」な
どの分離文字を含む入りくんで重なり合う文字列から個
々の文字を正しく切り出すことのできる装置が得られる
効果がある。As described above, in the character slicing device according to the present invention, the white area obtained along the first scanning line is used as the first scanning separator area, and the second to nth separator areas are the second scanning area. When the n-th white area is adjacent to a plurality of the (n-1) th separator areas, a separator area detecting unit that determines the n-th white area and the (n-1) th separator area as a common area is provided. Since the character pattern is cut out based on this separator area, there is an effect that a device that can correctly cut out individual characters from a character string that contains a separated character such as "shi" and that overlaps and overlaps is obtained.

[Brief description of drawings]

第１図は、この発明の一実施例の構成を示すブロック
図、第２図は、従来装置の構成を示すブロック図、第３
図（Ａ），（Ｂ），（Ｃ）と第４図（Ａ），（Ｂ）は従
来装置の動作の例を説明するための図、第５図，第６
図，第８図はこの発明の装置の動作の例を説明するため
の図、第７図はセパレータ領域を検出するアルゴリズム
を説明する図である。図中１は用紙、２は走査手段、３は文字列パターン記憶
手段、４は白領域検出手段、５は連続性チェック手段、
６は文字分離手段、21はセパレータ領域検出手段であ
る。なお図中同一あるいは相当部分には同一符号を付して表
示してある。FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention, FIG. 2 is a block diagram showing the configuration of a conventional device, and FIG.
4 (A), (B), (C) and FIGS. 4 (A), (B) are diagrams for explaining an example of the operation of the conventional device, FIG. 5, and FIG.
8 and 9 are diagrams for explaining an example of the operation of the apparatus of the present invention, and FIG. 7 is a diagram for explaining an algorithm for detecting a separator area. In the figure, 1 is a sheet, 2 is a scanning means, 3 is a character string pattern storing means, 4 is a white area detecting means, 5 is a continuity checking means,
6 is a character separating means, and 21 is a separator area detecting means. Note that the same or corresponding parts in the drawings are denoted by the same reference numerals.

Claims

[Claims]

1. A character pattern slicing device that optically scans a character string written on a sheet of paper or the like and divides a pixel into a white region and a black region along a scanning line to cut out a character pattern for each character. The white area obtained along the first scan line is used as the separator area for the first scan, and for the second to nth (n is an integer) scan lines, the white area is determined by the nth scan line. For each of the obtained white areas, the continuity with the (n-1) th separator area is examined, and when one nth white area is adjacent to a plurality of (n-1) th separator areas, the nth A region in which the one white region and the adjacent n-1th separator region are common is determined as a separator region, and the nth one white region is the nth region.
In the case where it is adjacent to only one -1st separator area, a separator area detection means for determining the nth one white area as a separator area is provided, and a character pattern is cut out based on this separator area. A device for cutting out character patterns.