JP4830331B2

JP4830331B2 - Character image cutting device and program

Info

Publication number: JP4830331B2
Application number: JP2005092520A
Authority: JP
Inventors: 俊哉小山
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2005-03-28
Filing date: 2005-03-28
Publication date: 2011-12-07
Anticipated expiration: 2025-03-28
Also published as: JP2006277092A

Description

本発明は、文字認識処理に用いられる文字画像の切り出し技術に関する。 The present invention relates to a character image clipping technique used for character recognition processing.

文字を示す画像データから抽出した特徴点と、予めデータベースに登録してある文字の特徴点とを比較することにより、画像データにより示される文字を認識し、認識した文字を示すテキストデータを生成する技術がある。 By comparing the feature points extracted from the image data indicating characters with the feature points of characters registered in advance in the database, the characters indicated by the image data are recognized, and text data indicating the recognized characters is generated. There is technology.

上記のような文字認識処理において、画像データが複数の文字を示す場合、その画像データから各々の文字を示す画像データを切り出すことが必要となる。すなわち、画像データに含まれる連続したオン画素群のいずれが１つの文字を構成するものであるかを特定する必要がある。そのような文字画像の切り出し技術を開示したものとして、例えば特許文献１がある。
特開平１０−２０７９８５号公報 In the character recognition process as described above, when image data indicates a plurality of characters, it is necessary to cut out image data indicating each character from the image data. That is, it is necessary to specify which of the continuous on-pixel groups included in the image data constitutes one character. For example, Patent Literature 1 discloses such a character image segmentation technique.
Japanese Patent Laid-Open No. 10-207985

特許文献１に開示の技術によれば、各々のオン画素群を１つの文字の構成要素と仮定して文字認識処理を行った場合の結果および隣接する２以上のオン画素群を１つの文字の構成要素と仮定して文字認識処理を行った場合の結果を、単語辞書に登録されている単語と照合することにより、いずれの結果が正しいものであるかを特定する。 According to the technique disclosed in Patent Document 1, the result of character recognition processing assuming that each on-pixel group is a constituent element of one character and two or more adjacent on-pixel groups of one character By comparing the result of character recognition processing assuming that it is a component with a word registered in the word dictionary, it is specified which result is correct.

上記の従来技術による場合、文字認識処理が複数回行われる上に、文字認識処理の結果を単語と照合する処理が必要となるため、１文字を示す画像データの切り出しに時間がかかる、という問題がある。また、予め単語辞書の整備が必要であり、かつ単語辞書の記憶領域の確保も必要である。 In the case of the above prior art, since the character recognition process is performed a plurality of times and the process of matching the result of the character recognition process with a word is required, it takes time to cut out image data indicating one character. There is. In addition, it is necessary to prepare a word dictionary in advance and to secure a storage area for the word dictionary.

上述の事情に鑑み、本発明は簡便かつ高速に画像データから各々の文字部分を切り出す手段を提供することを目的とする。 In view of the circumstances described above, an object of the present invention is to provide means for cutting out each character portion from image data simply and at high speed.

上述の課題を解決するため、本発明は、平面上に配置された画像を構成する複数の画素の各々の属性値を示す画素データの集まりからなる画像データを取得する画像データ取得手段と、前記画像データ取得手段により取得された画像データに含まれる画素データにより示される画素のうち、所定の閾値を超える属性値を示すオン画素の中から、前記平面上の予め定められた升目により表される領域内において互いに連続して配置されているオン画素の集まりを領域内画像として認識する領域内画像認識手段と、前記領域内画像認識手段により認識された領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在するか否かを判定する判定手段と、前記判定手段により前記オン画素の集まりが存在する場合には当該オン画素の集まりを当該領域内画像に統合した画像を拡張領域内画像として認識し、当該領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在しない場合には当該領域内画像を拡張領域内画像として認識する拡張領域内画像認識手段と、前記拡張領域内画像認識手段により認識された拡張領域内画像を示すデータを、１文字を表す画像データとして出力する出力手段とを備えることを特徴とする文字画像切出装置を提供する。 In order to solve the above-described problem, the present invention provides an image data acquisition unit that acquires image data including a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane; Of the pixels indicated by the pixel data included in the image data acquired by the image data acquisition means, the pixel is represented by a predetermined grid on the plane from among the on pixels indicating attribute values exceeding a predetermined threshold value. a region in the image recognition means for recognizing a set of oN pixels as an area in the image being arranged in succession to one another in the region, before Symbol territory outside adjacent to the recognized area in the image by the area in the image recognition means a determination unit configured to determine whether there is a collection of on-pixels arranged mutually in succession, the on if a collection of pixels is present by the determining means There recognizes an image obtained by integrating a group of the ON pixels in the area in the image as an extended area in an image, a collection of on-pixels that are arranged consecutively to each other in front Symbol territory outside and adjacent to the region in an image If not, the extended area image recognition means for recognizing the in-area image as the extended area image, and the data indicating the extended area image recognized by the extended area image recognition means are image data representing one character. And a character image cutting device.

また、本発明は、平面上に配置された画像を構成する複数の画素の各々の属性値を示す画素データの集まりからなる画像データを取得する画像データ取得手段と、前記画像データ取得手段により取得された画像データに含まれる画素データにより示される画素のうち、所定の閾値を超える属性値を示すオン画素の中から、前記平面上の予め定められた升目により表される領域内において互いに連続して配置されているオン画素の集まりを領域内画像として認識する領域内画像認識手段と、前記領域内画像認識手段により認識された領域内画像に対し膨張処理を行い、膨張領域内画像を生成する膨張画像生成手段と、前記膨張画像生成手段により生成された膨張領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在するか否かを判定する判定手段と、前記判定手段により前記オン画素の集まりが存在する場合には当該オン画素の集まりを当該膨張領域内画像に統合した画像を拡張膨張領域内画像として認識し、当該膨張領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在しない場合には当該膨張領域内画像を拡張膨張領域内画像として認識する拡張膨張領域内画像認識手段と、前記拡張膨張領域内画像認識手段により認識された拡張膨張領域内画像を示すデータを、１文字を表す画像データとして出力する出力手段とを備えることを特徴とする文字画像切出装置を提供する。 In addition, the present invention provides an image data acquisition unit that acquires image data including a collection of pixel data indicating attribute values of a plurality of pixels constituting an image arranged on a plane, and the image data acquisition unit acquires the image data. Among the pixels indicated by the pixel data included in the image data that has been processed, the pixels that are consecutive from each other in the area represented by the predetermined grid on the plane are selected from the on pixels that indicate attribute values that exceed a predetermined threshold. An intra-region image recognition unit that recognizes a collection of on-pixels arranged as an intra-region image, and performs an expansion process on the intra-region image recognized by the intra-region image recognition unit to generate an in-expansion region image an expansion image generation means, the dilated image generating means by adjacent the generated expanded area images on the pixels that are arranged consecutively to each other in front Symbol territory outside A determination unit configured to determine whether Mari is present, the determination unit by the ON pixels of the on-pixels gather the expansion region in the integrated image extended expansion area in an image in the image of when the collection is present recognized as recognized extend the expansion region in an image when the collection of oN pixels that are arranged consecutively to each other does not exist in the previous SL territory outside and adjacent to the expansion region in an image as an extended expansion zone in the image Characters comprising: in-expansion area image recognition means; and output means for outputting data indicating the in-expansion area image recognized by the in-expansion area image recognition means as image data representing one character An image cutting device is provided.

好ましい態様において、前記文字画像切出装置は前記拡張膨張領域内画像認識手段により認識された拡張膨張領域内画像に対し収縮処理を行い、収縮拡張膨張領域内画像を生成する収縮画像生成手段をさらに備え、前記出力手段は、拡張膨張領域内画像を示すデータの代わりに、前記収縮画像生成手段により生成された収縮拡張膨張領域内画像を示すデータを出力するように構成されてもよい。 In a preferred aspect, the character image cutout device further includes a contraction image generation unit that performs contraction processing on the image in the expansion expansion region recognized by the image recognition unit in expansion expansion region and generates an image in the expansion expansion region. The output means may be configured to output data indicating a contraction / expansion / expansion area image generated by the contraction image generation means, instead of data indicating the expansion / expansion area image.

また、本発明は、平面上に配置された画像を構成する複数の画素の各々の属性値を示す画素データの集まりからなる画像データを取得する画像データ取得手段と、前記画像データ取得手段により取得された画像データに含まれる画素データにより示される画素のうち、所定の閾値を超える属性値を示すオン画素の中から、前記平面上の予め定められた升目により表される領域内において互いに連続して配置されているオン画素の集まりを領域内画像として認識する領域内画像認識手段と、前記領域内画像認識手段により認識された領域内画像に対し膨張処理を行い、膨張領域内画像を生成する膨張画像生成手段と、前記膨張画像生成手段により生成された膨張領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在するか否かを判定する判定手段と、前記判定手段により前記オン画素の集まりが存在する場合には当該オン画素の集まりを当該膨張領域内画像に統合した画像を拡張膨張領域内画像として認識し、当該膨張領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在しない場合には当該膨張領域内画像を拡張膨張領域内画像として認識する拡張膨張領域内画像認識手段と、前記画像データ取得手段により取得された画像データにより示される画像と前記拡張膨張領域内画像認識手段により認識された拡張膨張領域内画像との両方においてオン画素である画素の集まりを重複画像として認識する重複画像認識手段と、前記重複画像認識手段により認識された重複画像を示すデータを、１文字を表す画像データとして出力する出力手段とを備えることを特徴とする文字画像切出装置を提供する。 In addition, the present invention provides an image data acquisition unit that acquires image data including a collection of pixel data indicating attribute values of a plurality of pixels constituting an image arranged on a plane, and the image data acquisition unit acquires the image data. Among the pixels indicated by the pixel data included in the image data that has been processed, the pixels that are consecutive from each other in the area represented by the predetermined grid on the plane are selected from the on pixels that indicate attribute values that exceed a predetermined threshold. An intra-region image recognition unit that recognizes a collection of on-pixels arranged as an intra-region image, and performs an expansion process on the intra-region image recognized by the intra-region image recognition unit to generate an in-expansion region image an expansion image generation means, the dilated image generating means by adjacent the generated expanded area images on the pixels that are arranged consecutively to each other in front Symbol territory outside A determination unit configured to determine whether Mari is present, the determination unit by the ON pixels of the on-pixels gather the expansion region in the integrated image extended expansion area in an image in the image of when the collection is present recognized as recognized extend the expansion region in an image when the collection of oN pixels that are arranged consecutively to each other does not exist in the previous SL territory outside and adjacent to the expansion region in an image as an extended expansion zone in the image Pixels that are ON pixels in both the in-expansion region image recognition unit, the image indicated by the image data acquired by the image data acquisition unit, and the expansion in-expansion region image recognition unit A duplicate image recognition means for recognizing a set of images as a duplicate image, and data representing the duplicate image recognized by the duplicate image recognition means as one character. Providing a character image extraction device characterized by an output means for outputting as image data.

また、本発明は、平面上に配置された画像を構成する複数の画素の各々の属性値を示す画素データの集まりからなる画像データを取得する画像データ取得手段と、前記画像データ取得手段により取得された画像データに含まれる画素データにより示される画素のうち、所定の閾値を超える属性値を示すオン画素の中から、前記平面上の予め定められた升目により表される領域内において互いに連続して配置されているオン画素の集まりを領域内画像として認識する領域内画像認識手段と、前記領域内画像認識手段により認識された領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在するか否かを判定する判定手段と、前記判定手段により前記オン画素の集まりが存在する場合には当該オン画素の集まりを当該領域内画像に統合した画像を拡張領域内画像として認識し、当該領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在しない場合には当該領域内画像を拡張領域内画像として認識する拡張領域内画像認識手段と、前記拡張領域内画像認識手段により認識された拡張領域内画像に外接する前記平面上の所定の形状の領域を拡張領域として認識する拡張領域認識手段と、前記拡張領域認識手段により認識された拡張領域内に含まれるオン画素の集まりにより示される画像を示すデータを、１文字を表す画像データとして出力する出力手段とを備えることを特徴とする文字画像切出装置を提供する。 In addition, the present invention provides an image data acquisition unit that acquires image data including a collection of pixel data indicating attribute values of a plurality of pixels constituting an image arranged on a plane, and the image data acquisition unit acquires the image data. Among the pixels indicated by the pixel data included in the image data that has been processed, the pixels that are consecutive from each other in the area represented by the predetermined grid on the plane are selected from the on pixels that indicate attribute values that exceed a predetermined threshold. a region in the image recognition means for recognizing a group of on-pixels arranged as an area in an image Te, arranged in series with each other in adjacent pre Symbol territory outside the recognition area in the image by the area in the image recognition means a determination unit configured to determine whether a collection of on-pixels are there, collecting of the oN pixels in the case where the collection of the oN pixels is present by the determining means Ri and recognizes images integrated in the area in the image as an extended area within the image, the in the case where collection of ON pixels that are arranged consecutively to each other in front Symbol territory outside and adjacent to the region in an image is not present An extension area image recognition means for recognizing an in-area image as an extension area image, and an area of a predetermined shape on the plane that circumscribes the extension area image recognized by the extension area image recognition means as an extension area An extended area recognizing means for recognizing; and an output means for outputting data indicating an image indicated by a collection of on-pixels included in the extended area recognized by the extended area recognizing means as image data representing one character. A character image cutting device is provided.

好ましい態様において、前記文字画像切出装置の前記領域内画像認識手段は、前記拡張領域認識手段により拡張領域の認識が行われた後、前記拡張領域認識手段により認識された拡張領域を前記領域として前記領域内画像の認識処理を再実行し、前記拡張領域内画像認識手段は、前記領域内画像認識手段による認識処理の再実行により認識された領域内画像に対し前記拡張領域内画像の認識処理を再実行し、前記拡張領域認識手段は、前記拡張領域内画像認識手段による認識処理の再実行により認識された拡張領域内画像に対し前記拡張領域の認識処理を再実行し、前記出力手段は、前記拡張領域認識手段による認識処理の再実行により認識された拡張領域内に含まれるオン画素の集まりにより示される画像を示すデータを、前記１文字を表す画像データとして出力するように構成されてもよい。 In a preferred embodiment, the area in the image recognition means of the character image extraction apparatus, the extension after the recognition of the extended region is performed by the region recognizing means, before Symbol territory recognized extended regions by the extended region recognizing means And re-execution of the recognition process of the in-region image as a region, and the extended region image recognition means performs the processing of the extension in-region image with respect to the in-region image recognized by re-execution of the recognition process by the in-region image recognition unit Re-execution of recognition processing, the extension region recognition means re-executes recognition processing of the extension region on the image in the extension region recognized by re-execution of recognition processing by the image recognition means in the extension region, and the output Means for representing data representing an image indicated by a collection of on-pixels included in an extended area recognized by re-execution of recognition processing by the extended area recognizing means; It may be configured to output as image data.

また、本発明は、平面上に配置された画像を構成する複数の画素の各々の属性値を示す画素データの集まりからなる画像データを取得する画像データ取得手段と、前記画像データ取得手段により取得された画像データに含まれる画素データにより示される画素のうち、所定の閾値を超える属性値を示すオン画素の中から、前記平面上の予め定められた升目により表される領域内において互いに連続して配置されているオン画素の集まりを領域内画像として認識する領域内画像認識手段と、前記領域内画像認識手段により認識された１以上の領域内画像のうち少なくとも１の領域内画像に関し、当該領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在するか否かを判定する判定手段と、前記判定手段により前記オン画素の集まりが存在する場合、当該領域内画像を前記領域内画像認識手段により認識された領域内画像の集まりから除外する領域内画像除外手段と、前記領域内画像認識手段により認識された領域内画像のうち前記領域内画像除外手段により除外されなかった領域内画像の集まりを、１文字を表す画像データとして出力する出力手段とを備えることを特徴とする文字画像切出装置を提供する。 In addition, the present invention provides an image data acquisition unit that acquires image data including a collection of pixel data indicating attribute values of a plurality of pixels constituting an image arranged on a plane, and the image data acquisition unit acquires the image data. Among the pixels indicated by the pixel data included in the image data that has been processed, the pixels that are consecutive from each other in the area represented by the predetermined grid on the plane are selected from the on pixels that indicate attribute values that exceed a predetermined threshold. An intra-region image recognition means for recognizing a collection of on-pixels arranged as an intra-region image, and at least one intra-region image among at least one intra-region image recognized by the intra-region image recognition means, a determination unit configured to determine whether there is a collection of on-pixels that are arranged consecutively to each other in front Symbol territory outside adjacent to the region in an image, said determining means If there are more group of the on-pixel, and the area in the image excluding means excludes from collection areas within the image that has been recognized by the region within the image the area in the image recognition unit are recognized by the region image recognition means A character image cutout device comprising: output means for outputting a collection of in-area images that are not excluded by the in-area image excluding means among the in-area images, as image data representing one character. To do.

好ましい態様において、前記文字画像切出装置の前記領域内画像除外手段は、１の領域内画像に関し、当該領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在する場合、当該オン画素の集まりの占める面積よりも当該１の領域内画像の占める面積が狭いときには、当該１の領域内画像を前記領域内画像認識手段により認識された領域内画像の集まりから除外し、当該オン画素の集まりの占める面積よりも当該１の領域内画像の占める面積が広いときには、当該１の領域内画像を前記領域内画像の集まりから除外しないように構成されてもよい。 In a preferred aspect, the in-region image excluding unit of the character image cutting device includes a group of on pixels that are adjacent to the in-region image and are continuously arranged outside the region with respect to the one in-region image. If present, if the area occupied by the image in one region is smaller than the area occupied by the set of on-pixels, the image in the one region is extracted from the set of images in the region recognized by the image recognition means in the region. If the area occupied by the image in one region is larger than the area occupied by the collection of on-pixels, the image in the one area may not be excluded from the collection of images in the region .

また、他の好ましい態様において、前記文字画像切出装置は前記領域内画像除外手段により除外されなかった領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在する場合には当該オン画素の集まりを当該領域内画像に統合した画像を拡張領域内画像として認識し、当該領域内画像に隣接し前記領域外において互いに連続して配置されているオン画素の集まりが存在しない場合には当該領域内画像を拡張領域内画像として認識する拡張領域内画像認識手段をさらに備え、前記出力手段は、前記拡張領域内画像認識手段により認識された拡張領域内画像を示すデータを、前記１文字を表す画像データとして出力するように構成されてもよい。 Further, in another preferred embodiment, the character image extraction apparatus collection of ON pixels that are arranged consecutively to each other in front Symbol territory outside adjacent to that have not been region image excluding by the region image excluding means oN pixels in which the a group of on-pixels to recognize an image integrated in the area in the image as an extended area in the image, is arranged in series with each other before Symbol territory outside and adjacent to the region in the image, if present If there is no set of images, the image processing apparatus further includes an extended area image recognition means for recognizing the image in the area as an extended area image, and the output means recognizes the extended area image recognized by the extended area image recognition means. May be output as image data representing the one character.

また、本発明は、上記いずれかに記載の前記文字画像切出装置が行う処理と同様の処理をコンピュータに実行させることを特徴とするプログラムを提供する。 In addition , the present invention provides a program that causes a computer to execute the same process as the process performed by any one of the character image cutting apparatuses described above.

本発明によれば、升目からはみ出して書かれた文字を示す画像から、正しく１文字を示す画像が切り出されるため、高い精度の文字認識が可能となる。 According to the present invention, since an image that correctly shows one character is cut out from an image that shows characters that are written out of the cell, highly accurate character recognition is possible.

［実施形態］
以下、本発明の好適な実施形態を説明する。図１は以下に説明する実施形態にかかる文字認識システム１の構成を示したブロック図である。文字認識システム１は、複数の文字を示す画像データから各々の文字を示す画像データを切り出して送信する文字画像切出装置１０と、文字画像切出装置１０に対し複数の文字を示す画像データを送信するタブレットＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）１１と、文字画像切出装置１０から送信される各々の文字を示す画像データに対し文字認識処理を行い認識した文字を示すテキストデータを生成する文字認識装置１２を備えている。 [Embodiment]
Hereinafter, preferred embodiments of the present invention will be described. FIG. 1 is a block diagram showing a configuration of a character recognition system 1 according to an embodiment described below. The character recognition system 1 includes a character image cutting device 10 that cuts and transmits image data indicating each character from image data indicating a plurality of characters, and image data indicating a plurality of characters to the character image cutting device 10. A tablet PC (Personal Computer) 11 for transmission and a character recognition device 12 for generating text data indicating a recognized character by performing character recognition processing on image data indicating each character transmitted from the character image cutting device 10 I have.

タブレットＰＣ１１は、液晶ディスプレイに積層されたペンタブレット型入力デバイスを備え、ユーザが液晶ディスプレイに表示される升目に対しペン型のスタイラスを用いて筆記動作を行うと、ペンタブレット型入力デバイスがその筆記動作における筆圧の加えられた位置および筆圧の大きさを測定し、それらの測定値に基づきユーザにより書かれた文字を示す画像データを生成する。 The tablet PC 11 includes a pen tablet-type input device stacked on a liquid crystal display. When a user performs a writing operation using a pen-type stylus on a grid displayed on the liquid crystal display, the pen tablet-type input device performs the writing operation. The position where the pen pressure is applied in the operation and the magnitude of the pen pressure are measured, and image data indicating characters written by the user is generated based on the measured values.

［１．第１実施形態］
図２は、ユーザがタブレットＰＣ１１に表示される４文字分の升目に対し「榊原正義」なる文字列を筆記した場合に、従来技術により第２の升目から切り出される画像データ（図２（ａ））と、本発明の第１実施形態にかかる文字画像切出装置１０−１により第２の升目から切り出される画像データ（図２（ｂ））とを用いて、文字認識装置１２が文字認識処理を行った際に得られるテキストデータを比較して示した図である。 [1. First Embodiment]
FIG. 2 shows image data cut out from the second square by the conventional technique when the user writes a character string “Masayoshi Sugawara” for the four-character square displayed on the tablet PC 11 (FIG. 2A). ) And image data cut out from the second grid by the character image cutout device 10-1 according to the first embodiment of the present invention (FIG. 2B), the character recognition device 12 performs character recognition processing. It is the figure which compared and showed the text data obtained when performing.

図２に示されるように、升目から「原」の文字の一部がはみ出して書かれている場合、従来技術によれば升目からはみ出した画像部分を含まない画像データが文字認識装置１２に対し出力されるため、その画像データを用いた文字認識が正しく行われない場合が多い。これに対し、文字画像切出装置１０−１による場合、升目の外に配置されている画像であっても、升目の中に配置されている画像と連続している部分を含む画像データが文字認識装置１２に対し出力されるため、その画像データを用いた文字認識が正しく行わる可能性が従来技術による場合と比較して高い。 As shown in FIG. 2, when a part of the “original” character is written out from the cell, according to the prior art, image data that does not include the image portion that protrudes from the cell is transmitted to the character recognition device 12. Therefore, character recognition using the image data is often not performed correctly. On the other hand, in the case of the character image cutting device 10-1, even if the image is arranged outside the cell, image data including a portion continuous with the image arranged in the cell is a character. Since it is output to the recognition device 12, the possibility that character recognition using the image data is correctly performed is higher than in the case of the prior art.

図３は文字画像切出装置１０−１の構成を示したブロック図である。文字画像切出装置１０−１は、文字画像切出装置１０−１の構成部を制御する制御部１０１と、制御部１０１による各種処理を指示するプログラムおよび各種データを記憶するとともに制御部１０１のワークエリアとして用いられる記憶部１０２を備えている。 FIG. 3 is a block diagram showing the configuration of the character image cutting device 10-1. The character image cutout device 10-1 stores a control unit 101 that controls the components of the character image cutout device 10-1, a program that instructs various processes by the control unit 101, and various types of data. A storage unit 102 used as a work area is provided.

制御部１０１は、タブレットＰＣ１１から文字列を示す画像データと、タブレットＰＣ１１において表示される升目の各々の領域を示す領域データとを受信する画像データ入力部１０１１を備えている。画像データ入力部１０１１はタブレットＰＣ１１から受信した画像データおよび領域データをそれぞれ記憶部１０２に画像データ１０２１および領域データ１０２２として記憶する。画像データ１０２１は、平面上に等間隔で配置された画素に対応する画素データの集合であり、各画素データには例えば彩度データ、明度データおよび色相データが含まれている。 The control unit 101 includes an image data input unit 1011 that receives image data indicating a character string from the tablet PC 11 and region data indicating each region of the grid displayed on the tablet PC 11. The image data input unit 1011 stores the image data and area data received from the tablet PC 11 in the storage unit 102 as image data 1021 and area data 1022, respectively. The image data 1021 is a set of pixel data corresponding to pixels arranged at equal intervals on a plane, and each pixel data includes, for example, saturation data, lightness data, and hue data.

制御部１０１は画像データ１０２１により示される画像データを、白黒の２値画像データに変換する２値化部１０１２を備えている。２値化部１０１２は、画像データ１０２１に含まれる画素データのうち明度データが所定の閾値を超えるものを値「１」をとるオン画素データ、それ以外を値「０」をとるオフ画素データに変換することにより、２値画像データを生成する。２値化部１０１２は生成した２値画像データを２値画像データ１０２３として記憶部１０２に記憶する。 The control unit 101 includes a binarization unit 1012 that converts image data indicated by the image data 1021 into monochrome binary image data. The binarization unit 1012 converts the pixel data included in the image data 1021 into light-on pixel data having a value “1” when the lightness data exceeds a predetermined threshold value, and off-pixel data having a value “0” for the other pixel data. By converting, binary image data is generated. The binarization unit 1012 stores the generated binary image data in the storage unit 102 as binary image data 1023.

制御部１０１は、２値画像データ１０２３により示される画素画像データのうち、領域データ１０２２により示される各々の升目の領域に含まれる画像データの部分を取り出し、それらに含まれる連続する画像部分を各々分離して認識する領域内画像認識部１０１３を備えている。以下、第２の升目の領域に含まれる画像データに関し、文字画像切出装置１０−１において行われる処理を説明する。領域内画像認識部１０１３は、第２の升目の領域（以下、「所定領域」と呼ぶ）に含まれるオン画素データを、互いに連続するオン画素データ群にグループ化する。以下、領域内画像認識部１０１３により１つのグループと認識されたオン画素データ群により示される画像の各々を「領域内画像」と呼ぶ。領域内画像認識部１０１３は、認識した領域内画像の各々に対し識別ラベルを付加する。図２に示されるＬ１〜Ｌ４は、領域内画像認識部１０１３により付加された識別ラベルを示している。 The control unit 101 extracts the portion of the image data included in each square area indicated by the area data 1022 from the pixel image data indicated by the binary image data 1023, and each successive image portion included in each of the image data is indicated. An in-region image recognition unit 1013 that recognizes separately is provided. Hereinafter, the process performed in the character image cutting device 10-1 regarding the image data contained in the area of the second cell will be described. The in-region image recognition unit 1013 groups on-pixel data included in a second mesh region (hereinafter referred to as “predetermined region”) into a group of on-pixel data that are continuous with each other. Hereinafter, each of the images indicated by the on-pixel data group recognized as one group by the in-region image recognition unit 1013 is referred to as an “in-region image”. The in-region image recognition unit 1013 adds an identification label to each recognized in-region image. L1 to L4 shown in FIG. 2 indicate identification labels added by the in-region image recognition unit 1013.

制御部１０１は、領域内画像を所定領域外の連続部分にまで拡張する拡張領域内画像認識部１０１４を備えている。領域内画像認識部１０１３は識別ラベルを付加した領域内画像を示すデータ（以下、「領域内画像データ」と呼ぶ）を拡張領域内画像認識部１０１４に引き渡す。拡張領域内画像認識部１０１４は、領域内画像認識部１０１３から受け取った領域内画像データおよび２値画像データ１０２３を用いて、領域内画像の各々に関し、所定領域外に連続するオン画素群が存在するか否かを判定し、連続したオン画素群があるものに関しては、当該オン画素群を領域内画像に統合することにより領域内画像を拡張する。具体的には、図２に示されるように、識別ラベルＬ１の領域内画像が拡張領域内画像認識部１０１４により拡張される。以下、拡張領域内画像認識部１０１４による拡張処理の施された領域内画像を「拡張領域内画像」と呼ぶ。 The control unit 101 includes an in-region image recognition unit 1014 that extends the in-region image to a continuous part outside the predetermined region. The in-region image recognition unit 1013 delivers data indicating the in-region image to which the identification label is added (hereinafter referred to as “in-region image data”) to the extended in-region image recognition unit 1014. The extended in-region image recognition unit 1014 uses the in-region image data and the binary image data 1023 received from the in-region image recognition unit 1013, and for each of the in-region images, there is a continuous on pixel group outside the predetermined region. In the case where there are continuous on-pixel groups, the in-region image is expanded by integrating the on-pixel group into the in-region image. Specifically, as shown in FIG. 2, the in-region image of the identification label L <b> 1 is expanded by the extended region image recognition unit 1014. Hereinafter, the in-region image that has been subjected to the extension process by the in-extension region image recognition unit 1014 is referred to as an “extended region image”.

制御部１０１は、１文字を示す画像データを文字認識装置１２に送信する画像データ出力部１０１５を備えている。拡張領域内画像認識部１０１４は拡張領域内画像を示す拡張領域内画像データを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った拡張領域内画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 The control unit 101 includes an image data output unit 1015 that transmits image data representing one character to the character recognition device 12. The extended area image recognition unit 1014 delivers the extended area image data indicating the extended area image to the image data output unit 1015. The image data output unit 1015 transmits the received image data in the extended area to the character recognition device 12 as image data indicating one character.

以上のように文字画像切出装置１０−１により切り出され文字認識装置１２に送信される画像データは、升目からはみ出して書かれた文字部分を含む画像を示すため、文字認識処理において高い精度の結果をもたらすものとなる。 As described above, the image data cut out by the character image cutout device 10-1 and transmitted to the character recognition device 12 shows an image including a character portion that is written out of the cell, so that the character recognition process has high accuracy. Result.

［２．第２実施形態］
図４は、ユーザがタブレットＰＣ１１に表示される４文字分の升目に対し「榊原正義」なる文字列を筆記した場合に、従来技術により第１の升目から切り出される画像データ（図４（ａ））と、本発明の第２実施形態にかかる文字画像切出装置１０−２により第１の升目から切り出される画像データ（図４（ｂ））とを用いて、文字認識装置１２が文字認識処理を行った際に得られるテキストデータを比較して示した図である。 [2. Second Embodiment]
FIG. 4 shows image data cut out from the first square by the conventional technique when the user writes a character string “Masayoshi Sugawara” for the four-character square displayed on the tablet PC 11 (FIG. 4A). ) And image data cut out from the first cell by the character image cutout device 10-2 according to the second embodiment of the present invention (FIG. 4B), the character recognition device 12 performs character recognition processing. It is the figure which compared and showed the text data obtained when performing.

図４に示されるように、「榊」の文字の一部が升目から完全に外れている場合、従来技術によれば升目から外れている画像部分を含まない画像データが文字認識装置１２に対し出力されるため、その画像データを用いた文字認識は正しく行われない。これに対し、文字画像切出装置１０−２による場合、升目の外に配置されている画像であっても、升目の中に配置されている画像と近接している部分を含む画像データが文字認識装置１２に対し出力されるため、その画像データを用いた文字認識が正しく行わる可能性が高い。 As shown in FIG. 4, when a part of the character “榊” is completely out of the cell, according to the prior art, image data that does not include an image part that is out of the cell is transferred to the character recognition device 12. Therefore, character recognition using the image data is not performed correctly. On the other hand, in the case of the character image cutting device 10-2, even if the image is arranged outside the cell, image data including a portion close to the image arranged in the cell is character. Since it is output to the recognition device 12, there is a high possibility that character recognition using the image data will be performed correctly.

図５は文字画像切出装置１０−２の構成を示したブロック図である。文字画像切出装置１０−２の構成および動作は多くの点で文字画像切出装置１０−１のそれらと共通しているため、以下、文字画像切出装置１０−２が文字画像切出装置１０−１と異なる点のみを説明する。また、以下の説明において参照する図において、文字画像切出装置１０−１の構成部に対応する構成部には文字画像切出装置１０−１において用いられたものと同じ符号が付されている。 FIG. 5 is a block diagram showing the configuration of the character image cutting device 10-2. Since the configuration and operation of the character image cutting device 10-2 are the same as those of the character image cutting device 10-1 in many respects, the character image cutting device 10-2 is hereinafter referred to as the character image cutting device. Only differences from 10-1 will be described. Moreover, in the figure referred in the following description, the same code | symbol as what was used in the character image cutting device 10-1 is attached | subjected to the component corresponding to the component of the character image cutting device 10-1. .

文字画像切出装置１０−２の制御部１０１は領域内画像に対し膨張処理を行う膨張画像生成部２０１１を備えている。膨張画像生成部２０１１は、領域内画像認識部１０１３から領域内画像データを受け取り、受け取った領域内画像データにより示される領域内画像の各々の外輪を示すオン画素データに隣接するオフ画素データをオン画素データに書き換える処理を所定回数だけ繰り返すことにより、領域内画像の膨張を行う。以下、膨張画像生成部２０１１による膨張処理の施された領域内画像を「膨張領域内画像」と呼ぶ。 The control unit 101 of the character image cutting device 10-2 includes an expanded image generation unit 2011 that performs an expansion process on the in-region image. The dilated image generation unit 2011 receives the intra-region image data from the intra-region image recognition unit 1013 and turns on the off-pixel data adjacent to the on-pixel data indicating each outer ring of the intra-region image indicated by the received intra-region image data. By repeating the process of rewriting the pixel data a predetermined number of times, the in-region image is expanded. Hereinafter, the in-region image subjected to the expansion process by the expanded image generation unit 2011 is referred to as an “inflated region image”.

文字画像切出装置１０−２の制御部１０１は、文字画像切出装置１０−１の制御部１０１が備える拡張領域内画像認識部１０１４の代わりに拡張膨張領域内画像認識部２０１２を備えている。膨張画像生成部２０１１は膨張領域内画像を示す膨張領域内画像データを拡張膨張領域内画像認識部２０１２に引き渡す。 The control unit 101 of the character image cutting device 10-2 includes an expanded in-region image recognition unit 2012 instead of the in-expansion region image recognition unit 1014 provided in the control unit 101 of the character image cutting device 10-1. . The expansion image generation unit 2011 delivers the expansion area image data indicating the expansion area image to the expansion expansion area image recognition unit 2012.

拡張膨張領域内画像認識部２０１２は膨張画像生成部２０１１から受け取った膨張領域内画像データおよび２値画像データ１０２３を用いて、膨張領域内画像の各々に関し、所定領域外に連続するオン画素群が存在するか否かを判定し、連続したオン画素群があるものに関しては、当該オン画素群を膨張領域内画像に統合することにより膨張領域内画像を拡張する。具体的には、図４に示されるように、識別ラベルＬ２の膨張領域内画像が拡張膨張領域内画像認識部２０１２により拡張される。以下、拡張膨張領域内画像認識部２０１２による拡張処理の施された領域内画像を「拡張膨張領域内画像」と呼ぶ。 The expanded in-region image recognizing unit 2012 uses the in-expanded region image data and the binary image data 1023 received from the expanded image generating unit 2011, and for each of the in-expanded region images, an on-pixel group continuous outside the predetermined region is obtained. It is determined whether or not it exists, and for those having a continuous on-pixel group, the on-pixel group is expanded by integrating the on-pixel group into the in-expansion area image. Specifically, as shown in FIG. 4, the in-expanded region image recognition unit 2012 expands the in-expanded region image of the identification label L2. Hereinafter, the in-region image subjected to the expansion processing by the inflated region in-region image recognition unit 2012 is referred to as “an inflated region image”.

拡張膨張領域内画像認識部２０１２は拡張膨張領域内画像を示す拡張膨張領域内画像データを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った拡張膨張領域内画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 The inflated region image recognition unit 2012 passes the inflated region image data indicating the inflated region image to the image data output unit 1015. The image data output unit 1015 transmits the received image data in the expanded expansion area to the character recognition device 12 as image data indicating one character.

以上のように文字画像切出装置１０−２により切り出され文字認識装置１２に送信される画像データは、升目から外れて書かれた文字部分を含む画像を示すため、文字認識処理において高い精度の結果をもたらすものとなる。 As described above, the image data cut out by the character image cutout device 10-2 and transmitted to the character recognition device 12 shows an image including a character portion that is written out of the cell, so that the character recognition process has high accuracy. Result.

ところで、拡張膨張領域内画像認識部２０１２により生成される拡張膨張領域内画像は、升目内の画像に関しては膨張処理が施され、升目外の画像に関しては膨張処理が施されていない。一般的に、オリジナルの画像を用いて文字認識が行われる場合の方が、膨張処理が施されている画像を用いて文字認識が行われる場合と比較して、高い精度の結果が得られる。そこで、文字画像切出装置１０−２に対し以下の変形を加えてもよい。 By the way, the expansion in-expansion area image generated by the expansion in-expansion area image recognition unit 2012 is subjected to the expansion process for the image in the grid and is not subjected to the expansion process for the image outside the grid. In general, a result with higher accuracy is obtained when character recognition is performed using an original image than when character recognition is performed using an image subjected to dilation processing. Therefore, the following modifications may be added to the character image cutting device 10-2.

図６は、文字画像切出装置１０−２の制御部１０１に、拡張膨張領域内画像に対し収縮処理を行う収縮画像生成部２０１３を備えさせた場合の構成を示したブロック図である。収縮画像生成部２０１３は拡張膨張領域内画像認識部２０１２と画像データ出力部１０１５の間に介挿され、拡張膨張領域内画像認識部２０１２から拡張膨張領域内画像データを受け取り、受け取った拡張膨張領域内画像データにより示される拡張膨張領域内画像の各々の外輪を示すオン画素データをオフ画素データに書き換える処理を所定回数だけ繰り返すことにより、拡張膨張領域内画像の収縮を行う。以下、収縮画像生成部２０１３による収縮処理の施された拡張膨張領域内画像を「収縮拡張膨張領域内画像」と呼ぶ。 FIG. 6 is a block diagram illustrating a configuration when the control unit 101 of the character image cutting device 10-2 includes a contracted image generation unit 2013 that performs contraction processing on an image in the expanded expansion region. The contracted image generation unit 2013 is interposed between the expansion expansion region internal image recognition unit 2012 and the image data output unit 1015, receives the expansion expansion region internal image data from the expansion expansion region internal image recognition unit 2012, and receives the received expansion expansion region The process of rewriting the on-pixel data indicating the outer ring of each of the images in the expansion / expansion area indicated by the internal image data with the off-pixel data is repeated a predetermined number of times to contract the image in the expansion / expansion area. Hereinafter, the image in the expansion / expansion area subjected to the contraction process by the contraction image generation unit 2013 is referred to as “image in the contraction / expansion / expansion area”.

収縮拡張膨張領域内画像は、升目内の画像に関してはオリジナルの画像と同様の太さの画像を、また升目外の画像に関してはオリジナルの画像と比較して細い画像を含んでいる。一般的に文字認識処理においては、画像の特徴点を抽出するために、太さを持った画像を細線化する処理が行われるため、収縮拡張膨張領域内画像において升目外の画像がオリジナルの画像に比べ細い点は文字認識の精度にあまり悪影響を与えない。一方、収縮拡張膨張領域内画像に含まれる升目内の画像が拡張膨張領域内画像と比較してオリジナルに近似する太さである結果、より高い精度の文字認識をもたらす画像となる。 The image within the contraction / expansion / expansion region includes an image having the same thickness as the original image with respect to the image within the mesh, and a thin image with respect to the image outside the mesh compared with the original image. In general, in character recognition processing, in order to extract feature points of an image, processing for thinning an image having a thickness is performed. Compared with, the thin point does not have a bad influence on the accuracy of character recognition. On the other hand, the image in the grid included in the image in the contraction / expansion / expansion area has a thickness that approximates the original as compared with the image in the expansion / expansion area, resulting in an image that provides higher-accuracy character recognition.

収縮画像生成部２０１３は、収縮拡張膨張領域内画像を示す収縮拡張膨張領域内画像データを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った収縮拡張膨張領域内画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 The contracted image generation unit 2013 delivers the image data in the contraction / expansion / expansion area indicating the image in the contraction / expansion / expansion area to the image data output unit 1015. The image data output unit 1015 transmits the received image data in the contraction / expansion region to the character recognition device 12 as image data indicating one character.

また、図７は、文字画像切出装置１０−２の制御部１０１に、拡張膨張領域内画像と２値画像データ１０２３により示される画像との重複部分の画像を取り出す重複画像認識部２０１４を備えさせた場合の構成を示すブロック図である。重複画像認識部２０１４は拡張膨張領域内画像認識部２０１２と画像データ出力部１０１５の間に介挿され、拡張膨張領域内画像認識部２０１２から拡張膨張領域内画像データを受け取り、受け取った拡張膨張領域内画像データに含まれる画素データの各々に関し、当該画素データに対応する２値画像データ１０２３に含まれる画素データとの間の論理積を算出する。重複画像認識部２０１４により算出された論理積の結果を値とする画素データの集まりは、拡張膨張領域内画像と２値画像データ１０２３により示される画像との重複画像を示すデータである。 Further, in FIG. 7, the control unit 101 of the character image cutting device 10-2 includes an overlapping image recognition unit 2014 that extracts an image of an overlapping portion between the image in the expanded expansion area and the image indicated by the binary image data 1023. It is a block diagram which shows the structure at the time of making it. The overlapping image recognition unit 2014 is interposed between the expansion expansion region image recognition unit 2012 and the image data output unit 1015, receives the expansion expansion region image data from the expansion expansion region image recognition unit 2012, and receives the received expansion expansion region For each piece of pixel data included in the internal image data, a logical product between the pixel data included in the binary image data 1023 corresponding to the pixel data is calculated. A collection of pixel data whose value is the result of the logical product calculated by the overlapping image recognition unit 2014 is data indicating an overlapping image of the image in the expanded expansion area and the image indicated by the binary image data 1023.

上記のように重複画像認識部２０１４により認識される重複画像は、オリジナルの画像の中から、拡張膨張領域内画像により占められる範囲に含まれる部分を取り出したものである。従って、膨張処理が施された拡張膨張領域内画像と比較して、より高い精度の文字認識をもたらす画像となる。 As described above, the overlapping image recognized by the overlapping image recognition unit 2014 is obtained by extracting a portion included in the range occupied by the image in the expanded expansion area from the original image. Therefore, compared with the image in the expanded expansion area that has been subjected to the expansion process, the image can provide character recognition with higher accuracy.

重複画像認識部２０１４は、重複画像を示すデータを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った重複画像を示すデータを、１文字を示す画像データとして文字認識装置１２に対し送信する。 The duplicate image recognition unit 2014 passes data indicating the duplicate image to the image data output unit 1015. The image data output unit 1015 transmits the received data indicating the duplicate image to the character recognition device 12 as image data indicating one character.

［３．第３実施形態］
図８は、第２実施形態の場合と同様に、ユーザがタブレットＰＣ１１に表示される４文字分の升目に対し「榊原正義」なる文字列を筆記した場合に、本発明の第３実施形態にかかる文字画像切出装置１０−３により第１の升目から画像データが切り出される様子を示した図である。 [3. Third Embodiment]
FIG. 8 shows the third embodiment of the present invention when the user writes a character string “Masayoshi Sugawara” for the four-character cell displayed on the tablet PC 11 as in the case of the second embodiment. It is the figure which showed a mode that image data was cut out from the 1st cell by this character image cutting device 10-3.

図９は文字画像切出装置１０−３の構成を示したブロック図である。文字画像切出装置１０−３の構成および動作は多くの点で文字画像切出装置１０−２のそれらと共通しているため、以下、文字画像切出装置１０−３が文字画像切出装置１０−２と異なる点のみを説明する。また、以下の説明において参照する図において、文字画像切出装置１０−１もしくは文字画像切出装置１０−２の構成部に対応する構成部には文字画像切出装置１０−１もしくは文字画像切出装置１０−２において用いられたものと同じ符号が付されている。 FIG. 9 is a block diagram showing the configuration of the character image cutting device 10-3. Since the configuration and operation of the character image cutting device 10-3 are similar to those of the character image cutting device 10-2 in many respects, the character image cutting device 10-3 is hereinafter referred to as the character image cutting device. Only differences from 10-2 will be described. In the drawings referred to in the following description, the component corresponding to the component of the character image cutting device 10-1 or the character image cutting device 10-2 includes the character image cutting device 10-1 or the character image cutting device. The same code | symbol as what was used in the taking-out apparatus 10-2 is attached | subjected.

文字画像切出装置１０−３の制御部１０１は領域内画像認識部１０１３の代わりに、２値画像データ１０２３に含まれる連続する画像部分を各々分離して認識する画像認識部３０１１を備えている。画像認識部３０１１は、２値画像データ１０２３に含まれるオン画素データを、互いに連続するオン画素データ群にグループ化し、各々のオン画素データを部分画像として認識する。画像認識部３０１１は、そのように認識した部分画像の各々に対し識別ラベルを付加する。画像認識部３０１１は識別ラベルを付加した部分画像を示すデータ（以下、「部分画像データ」と呼ぶ）を膨張画像生成部２０１１に引き渡す。 The control unit 101 of the character image cutout device 10-3 includes an image recognition unit 3011 that separately recognizes successive image portions included in the binary image data 1023, instead of the in-region image recognition unit 1013. . The image recognition unit 3011 groups on-pixel data included in the binary image data 1023 into groups of on-pixel data that are continuous with each other, and recognizes each on-pixel data as a partial image. The image recognition unit 3011 adds an identification label to each of the partial images recognized as such. The image recognition unit 3011 delivers data indicating the partial image to which the identification label is added (hereinafter referred to as “partial image data”) to the dilated image generation unit 2011.

膨張画像生成部２０１１は受け取った部分画像データにより示される部分画像の各々に対し膨張処理を行い、膨張画像を生成する。文字画像切出装置１０−３の制御部１０１は、膨張処理により互いに連続することとなった部分画像を一つの部分画像とする連結膨張画像認識部３０１２を備えている。膨張画像生成部２０１１は膨張画像を示す膨張画像データを連結膨張画像認識部３０１２に引き渡す。連結膨張画像認識部３０１２は膨張画像データにより示される膨張画像を互いに連続する膨張画像群にグループ化し、グループ化した膨張画像群を連結して１つの画像として再認識する。以下、そのように連結された膨張画像群を「連結膨張画像」と呼ぶ。 The expanded image generation unit 2011 performs expansion processing on each of the partial images indicated by the received partial image data, and generates an expanded image. The control unit 101 of the character image cutout device 10-3 includes a connected expanded image recognition unit 3012 that uses partial images that are continuous with each other by expansion processing as one partial image. The dilated image generation unit 2011 delivers dilated image data indicating the dilated image to the connected dilated image recognition unit 3012. The connected expanded image recognition unit 3012 groups the expanded images indicated by the expanded image data into continuous expanded image groups, connects the grouped expanded image groups, and re-recognizes them as one image. Hereinafter, the group of expanded images connected in this manner is referred to as a “connected expanded image”.

文字画像切出装置１０−３の制御部１０１は、所定領域に一部もしくは全部が含まれる連結膨張画像を認識する領域内連結膨張画像認識部３０１３を備えている。領域内連結膨張画像認識部３０１３は連結膨張画像認識部３０１２から連結膨張画像を示す連結膨張画像データを受け取り、連結膨張画像データにより示される連結膨張画像のうち、領域データ１０２２により示される第１の升目の少なくとも一部を占めているものを特定する。以下、領域内連結膨張画像認識部３０１３により特定された連結膨張画像を「領域内連結膨張画像」と呼ぶ。 The control unit 101 of the character image cutout device 10-3 includes an intra-region connected dilated image recognition unit 3013 that recognizes a connected dilated image including a part or all of a predetermined region. The intra-region connected expanded image recognition unit 3013 receives the connected expanded image data indicating the connected expanded image from the connected expanded image recognition unit 3012, and among the connected expanded images indicated by the connected expanded image data, the first region indicated by the region data 1022. Identify what occupies at least part of the cell. Hereinafter, the connected expanded image identified by the intra-region connected expanded image recognition unit 3013 is referred to as an “in-region connected expanded image”.

領域内連結膨張画像認識部３０１３は、領域内連結膨張画像を示す領域内連結膨張画像データを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った領域内連結膨張画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 The intra-region linked dilated image recognition unit 3013 delivers intra-region linked dilated image data indicating the intra-region linked dilated image to the image data output unit 1015. The image data output unit 1015 transmits the received intra-region expanded image data to the character recognition device 12 as image data indicating one character.

以上のように文字画像切出装置１０−３により切り出され文字認識装置１２に送信される画像データは、文字画像切出装置１０−２により切り出される画像データと同様に、升目から外れて書かれた文字部分を含む画像を示すため、文字認識処理において高い精度の結果をもたらすものとなる。 As described above, the image data cut out by the character image cutout device 10-3 and transmitted to the character recognition device 12 is written out of the grid as the image data cut out by the character image cutout device 10-2. Since an image including a character portion is shown, a highly accurate result is obtained in the character recognition process.

ところで、領域内連結膨張画像認識部３０１３により認識される領域内連結膨張画像は、升目の内外のいずれに関してもオリジナルの画像に膨張処理が施された画像である。そこで、より高い文字認識の精度をもたらす画像を生成するために、文字画像切出装置１０−３に対し以下の変形を加えてもよい。 By the way, the intra-region linked dilated image recognized by the intra-region linked dilated image recognition unit 3013 is an image obtained by performing dilation processing on the original image in both the inside and outside of the mesh. Therefore, in order to generate an image that provides higher character recognition accuracy, the following modification may be added to the character image cutting device 10-3.

図１０は、文字画像切出装置１０−３の制御部１０１に収縮画像生成部２０１３を備えさせた場合の構成を示すブロック図である。収縮画像生成部２０１３は領域内連結膨張画像認識部３０１３と画像データ出力部１０１５の間に介挿され、領域内連結膨張画像認識部３０１３から受け取った領域内連結膨張画像データにより示される領域内連結膨張画像に対し収縮処理を行い、収縮領域内連結膨張画像を生成する。収縮領域内連結膨張画像は、領域内連結膨張画像と比較して、オリジナルの画像と近似する太さであり、より高い精度の文字認識をもたらす。 FIG. 10 is a block diagram illustrating a configuration in the case where the control unit 101 of the character image cutting device 10-3 includes a contracted image generation unit 2013. The contracted image generation unit 2013 is inserted between the intra-region connected expanded image recognition unit 3013 and the image data output unit 1015, and is connected within the region indicated by the intra-region connected expanded image recognition unit 3013. A contraction process is performed on the expanded image to generate a connected expanded image within the contracted region. The contracted intra-region connected dilated image has a thickness that approximates that of the original image compared to the intra-region concatenated dilated image, resulting in higher-accuracy character recognition.

収縮画像生成部２０１３は収縮領域内連結膨張画像を示す収縮領域内連結膨張画像データを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った収縮領域内連結膨張画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 The contracted image generation unit 2013 delivers the in-contracted region connected expanded image data indicating the in-contracted region connected expanded image to the image data output unit 1015. The image data output unit 1015 transmits the received in-shrinkage region connected expansion image data to the character recognition device 12 as image data indicating one character.

また、図１１は、文字画像切出装置１０−３の制御部１０１に重複画像認識部２０１４を備えさせた場合の構成を示すブロック図である。重複画像認識部２０１４は領域内連結膨張画像認識部３０１３と画像データ出力部１０１５の間に介挿され、領域内連結膨張画像認識部３０１３から受け取った領域内連結膨張画像データにより示される領域内連結膨張画像と２値画像データ１０２３により示される画像との重複部分の画像を取り出す。 FIG. 11 is a block diagram illustrating a configuration when the control unit 101 of the character image cutting device 10-3 includes the duplicate image recognition unit 2014. The overlapping image recognition unit 2014 is interposed between the intra-region connected expanded image recognition unit 3013 and the image data output unit 1015, and is connected within the region indicated by the intra-region connected expanded image recognition unit 3013. An image of an overlapping portion between the dilated image and the image indicated by the binary image data 1023 is extracted.

上記のように重複画像認識部２０１４により認識される重複画像は、オリジナルの画像の中から、領域内連結膨張画像により占められる範囲に含まれる部分を取り出したものである。従って、膨張処理が施された領域内連結膨張画像と比較して、より高い精度の文字認識をもたらす画像となる。 As described above, the overlapping image recognized by the overlapping image recognition unit 2014 is obtained by extracting a portion included in the range occupied by the intra-region connected expansion image from the original image. Therefore, it is an image that provides character recognition with higher accuracy compared to the intra-region connected expanded image subjected to the expansion processing.

［４．第４実施形態］
図１２は、第２実施形態および第３実施形態の場合と同様に、ユーザがタブレットＰＣ１１に表示される４文字分の升目に対し「榊原正義」なる文字列を筆記した場合に、本発明の第４実施形態にかかる文字画像切出装置１０−４により第１の升目から画像データが切り出される様子を示した図である。 [4. Fourth Embodiment]
FIG. 12 shows the case where the user writes the character string “Masayoshi Sugawara” for the four-character cell displayed on the tablet PC 11 as in the case of the second and third embodiments. It is the figure which showed a mode that image data was cut out from the 1st cell by the character image cutting device 10-4 concerning 4th Embodiment.

図１３は文字画像切出装置１０−４の構成を示したブロック図である。文字画像切出装置１０−４の構成および動作は多くの点で文字画像切出装置１０−３のそれらと共通しているため、以下、文字画像切出装置１０−４が文字画像切出装置１０−３と異なる点のみを説明する。また、以下の説明において参照する図において、文字画像切出装置１０−１ないし文字画像切出装置１０−３の構成部に対応する構成部には文字画像切出装置１０−１ないし文字画像切出装置１０−３において用いられたものと同じ符号が付されている。 FIG. 13 is a block diagram showing the configuration of the character image cutting device 10-4. Since the configuration and operation of the character image cutting device 10-4 are similar to those of the character image cutting device 10-3 in many respects, the character image cutting device 10-4 is hereinafter referred to as the character image cutting device. Only differences from 10-3 will be described. Further, in the drawings to be referred to in the following description, the component corresponding to the component of the character image cutting device 10-1 to the character image cutting device 10-3 has a character image cutting device 10-1 to a character image cutting device. The same code | symbol as what was used in the taking-out apparatus 10-3 is attached | subjected.

文字画像切出装置１０−４の制御部１０１は、文字画像切出装置１０−３が備える全ての構成部に加え、２値画像データ１０２３に示される画像データのうち、領域データ１０２２により示される升目に含まれる画像データの部分を取り出し、それらに含まれる連続する画像部分を各々分離し、領域内画像として認識する領域内画像認識部１０１３を備えている。領域内画像認識部１０１３は認識した領域内画像およびそれらに付した識別ラベルを示す領域内画像データを領域内連結膨張画像認識部３０１３に引き渡す。 The control unit 101 of the character image cutting device 10-4 is indicated by the area data 1022 among the image data indicated by the binary image data 1023 in addition to all the components included in the character image cutting device 10-3. An in-region image recognition unit 1013 is provided which takes out image data portions included in the cells and separates the consecutive image portions included therein and recognizes them as in-region images. The intra-region image recognition unit 1013 delivers the intra-region image data indicating the recognized intra-region images and the identification labels attached to them to the intra-region connected expanded image recognition unit 3013.

文字画像切出装置１０−４の領域内連結膨張画像認識部３０１３は、連結膨張画像認識部３０１２から連結膨張画像を示す連結膨張画像データを受け取り、また領域内画像認識部１０１３から領域内画像データを受け取る。領域内連結膨張画像認識部３０１３は領域内画像データにより示される領域内画像の各々に関し、当該領域内画像に含まれる任意のオン画素を起点として選択する。領域内連結膨張画像認識部３０１３は、連結膨張画像データにより示される連結膨張画像に含まれる、先に選択した起点のオン画素に対応する位置のオン画素を特定し、特定したオン画素に連続するオン画素群を領域内連結膨張画像として認識する。 The intra-region connected expanded image recognition unit 3013 of the character image cutting device 10-4 receives the connected expanded image data indicating the connected expanded image from the connected expanded image recognition unit 3012, and the intra-region image data from the intra-region image recognition unit 1013. Receive. For each of the in-region images indicated by the in-region image data, the in-region connected expanded image recognition unit 3013 selects any on-pixel included in the in-region image as a starting point. The intra-region connected dilated image recognition unit 3013 identifies an on-pixel at a position corresponding to the previously selected on-pixel included in the linked dilated image indicated by the connected dilated image data, and continues to the specified on-pixel. The on-pixel group is recognized as an intra-region connected expanded image.

例えば、図１２に示される例によれば、連結膨張画像により示される「榊」の画像は、その文字を構成する左側のパーツ（Ｌ１１）は識別ラベルＬ１もしくはＬ２が付された領域内画像に含まれるオン画素を起点とした取り出し処理により領域内連結膨張画像として認識される。また、連結膨張画像により示される「榊」の画像の右側のパーツ（Ｌ１２）は識別ラベルＬ３が付された領域内画像に含まれるオン画素を起点とした取り出し処理により領域内連結膨張画像として認識される。 For example, according to the example shown in FIG. 12, the image of “榊” shown by the connected expanded image is an image in the region where the left part (L11) constituting the character is attached with the identification label L1 or L2. It is recognized as an intra-region connected expanded image by the extraction process starting from the included on-pixel. In addition, the right part (L12) of the “榊” image indicated by the connected dilated image is recognized as an intra-region dilated image by the extraction process starting from the on-pixel included in the intra-region image with the identification label L3. Is done.

領域内連結膨張画像認識部３０１３は上記のように領域内連結膨張画像を認識すると、認識した領域内連結膨張画像を示す領域内連結膨張画像データを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った領域内連結膨張画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 When the intra-region linked dilated image recognition unit 3013 recognizes the intra-region linked dilated image as described above, the intra-region linked dilated image data is transferred to the image data output unit 1015 indicating the recognized intra-region linked dilated image. The image data output unit 1015 transmits the received intra-region expanded image data to the character recognition device 12 as image data indicating one character.

以上のように文字画像切出装置１０−４により切り出され文字認識装置１２に送信される画像データは、文字画像切出装置１０−３により切り出される画像データと同じ画像データとなり、升目から外れて書かれた文字部分を含む画像を示すため、文字認識処理において高い精度の結果をもたらすものとなる。 As described above, the image data cut out by the character image cutout device 10-4 and transmitted to the character recognition device 12 becomes the same image data as the image data cut out by the character image cutout device 10-3 and deviates from the grid. Since an image including a written character portion is shown, a highly accurate result is obtained in the character recognition process.

ところで、文字画像切出装置１０−４の領域内連結膨張画像認識部３０１３により認識される領域内連結膨張画像は、文字画像切出装置１０−３の文字画像切出装置１０−４により認識される領域内連結膨張画像と同じものであるため、文字画像切出装置１０−４に対し、文字画像切出装置１０−３に関し上述したものと同様の変形を加えてもよい。 By the way, the intra-region connected expanded image recognized by the intra-region connected expanded image recognition unit 3013 of the character image cutting device 10-4 is recognized by the character image cutting device 10-4 of the character image cutting device 10-3. Therefore, the character image cutting device 10-4 may be modified in the same manner as described above with respect to the character image cutting device 10-3.

すなわち、文字画像切出装置１０−４の領域内連結膨張画像認識部３０１３と画像データ出力部１０１５の間に収縮画像生成部２０１３を介挿し、領域内連結膨張画像に収縮処理を施した収縮領域内連結膨張画像を生成し、収縮領域内連結膨張画像を示す収縮領域内連結膨張画像データを画像データ出力部１０１５から出力するようにしてもよい。また、文字画像切出装置１０−４の領域内連結膨張画像認識部３０１３と画像データ出力部１０１５の間に重複画像認識部２０１４を介挿し、領域内連結膨張画像と２値画像データ１０２３により示される画像との重複画像を認識し、重複画像を示すデータを画像データ出力部１０１５から出力するようにしてもよい。 That is, a contracted area obtained by interpolating the contracted image generating unit 2013 between the intra-area connected expanded image recognition unit 3013 and the image data output unit 1015 of the character image cutting device 10-4 and performing contraction processing on the intra-area connected expanded image. It is also possible to generate an inner connected expanded image and output from the image data output unit 1015 the intra-contracted region expanded image data indicating the contracted region connected expanded image. In addition, an overlapping image recognition unit 2014 is inserted between the intra-region connected expanded image recognition unit 3013 and the image data output unit 1015 of the character image cutting device 10-4, and is indicated by the intra-region connected expanded image and the binary image data 1023. It is also possible to recognize an overlapping image with the image to be output and output data indicating the overlapping image from the image data output unit 1015.

［５．第５実施形態］
図１４は、ユーザがタブレットＰＣ１１に表示される４文字分の升目に対し「榊原正義」なる文字列を筆記した場合に、従来技術により第４の升目から切り出される画像データ（図１４（ａ））と、本発明の第５実施形態にかかる文字画像切出装置１０−５により第４の升目から切り出される画像データ（図１４（ｂ））とを用いて、文字認識装置１２が文字認識処理を行った際に得られるテキストデータを比較して示した図である。 [5. Fifth Embodiment]
FIG. 14 shows image data cut out from the fourth square by the conventional technique when the user writes a character string “Masayoshi Sugawara” for the four squares displayed on the tablet PC 11 (FIG. 14A). ) And image data cut out from the fourth cell by the character image cutout device 10-5 according to the fifth embodiment of the present invention (FIG. 14B), the character recognition device 12 performs character recognition processing. It is the figure which compared and showed the text data obtained when performing.

図１５は文字画像切出装置１０−５の構成を示したブロック図である。文字画像切出装置１０−５の構成および動作は多くの点で文字画像切出装置１０−１のそれらと共通しているため、以下、文字画像切出装置１０−５が文字画像切出装置１０−１と異なる点のみを説明する。また、以下の説明において参照する図において、文字画像切出装置１０−１ないし文字画像切出装置１０−４の構成部に対応する構成部には文字画像切出装置１０−１ないし文字画像切出装置１０−４において用いられたものと同じ符号が付されている。 FIG. 15 is a block diagram showing the configuration of the character image cutting device 10-5. Since the configuration and operation of the character image cutting device 10-5 are similar to those of the character image cutting device 10-1 in many respects, the character image cutting device 10-5 is hereinafter referred to as the character image cutting device. Only differences from 10-1 will be described. Further, in the drawings to be referred to in the following description, the component corresponding to the component of the character image cutting device 10-1 to the character image cutting device 10-4 has a character image cutting device 10-1 to a character image cutting device. The same reference numerals as those used in the dispensing device 10-4 are attached.

文字画像切出装置１０−５の制御部１０１は、処理対象の領域を升目から拡張領域内画像に外接する矩形の領域に拡張する拡張領域認識部５０１１を備えている。文字画像切出装置１０−５の制御部１０１が備える拡張領域内画像認識部１０１４は、領域内画像認識部１０１３から升目の中に配置された領域内画像を示す領域内画像データを受け取り、受け取った領域内画像データおよび２値画像データ１０２３を用いて拡張領域内画像データを生成すると、生成した拡張領域内画像データを拡張領域認識部５０１１に引き渡す。 The control unit 101 of the character image cutout device 10-5 includes an extended area recognition unit 5011 that extends a processing target area from a grid to a rectangular area that circumscribes the extended area image. The extended region image recognition unit 1014 included in the control unit 101 of the character image cutting device 10-5 receives and receives the region image data indicating the region image arranged in the cell from the region image recognition unit 1013. When the extension region image data is generated using the region image data and the binary image data 1023, the generated extension region image data is delivered to the extension region recognition unit 5011.

拡張領域認識部５０１１は、拡張領域内画像認識部１０１４から受け取った拡張領域内画像データにより示される拡張領域内画像の集まりに外接する矩形領域を特定する。以下、そのように特定された領域を「拡張領域」と呼ぶ。図１４において、領域Ａ０は領域データ１０２２により示される第４の升目の領域を示し、領域Ａ１は拡張領域を示す。拡張領域認識部５０１１は、拡張領域を示す拡張領域データを領域内画像認識部１０１３に引き渡す。 The extension area recognition unit 5011 identifies a rectangular area that circumscribes the collection of images in the extension area indicated by the image data in the extension area received from the image recognition section 1014 in the extension area. Hereinafter, the area thus identified is referred to as an “extended area”. In FIG. 14, a region A0 indicates a fourth cell region indicated by the region data 1022, and a region A1 indicates an extended region. The extended area recognition unit 5011 delivers the extended area data indicating the extended area to the in-area image recognition unit 1013.

領域内画像認識部１０１３は、拡張領域認識部５０１１から拡張領域データを受け取ると、２値画像データ１０２３に含まれるオン画素データのうち、領域データ１０２２により示される升目の領域に代えて、領域内画像の認識処理を再度実行する。その結果、図１４において、既に認識されていた識別ラベルＬ１〜Ｌ３の付された領域内画像に加え、新たに識別ラベルＬ４の付された領域内画像が領域内画像認識部１０１３により認識される。領域内画像認識部１０１３はそのように認識した領域内画像を示す領域内画像データを拡張領域内画像認識部１０１４に引き渡す。 When the in-region image recognition unit 1013 receives the extension region data from the extension region recognition unit 5011, the in-region image recognition unit 1013 replaces the region of the mesh indicated by the region data 1022 among the on-pixel data included in the binary image data 1023. The image recognition process is executed again. As a result, in FIG. 14, the in-region image with the new identification label L4 is recognized by the in-region image recognition unit 1013 in addition to the already recognized in-region images with the identification labels L1 to L3. . The intra-region image recognition unit 1013 delivers the intra-region image data indicating the recognized intra-region image to the extended region image recognition unit 1014.

拡張領域内画像認識部１０１４は領域内画像認識部１０１３から拡張領域に関し生成した領域内画像データを受け取ると、受け取った領域内画像データにより示される領域内画像の拡張処理を再実行する。その結果、図１４において、識別ラベルＬ４の付された領域内画像が拡張される。拡張領域内画像認識部１０１４はそのように認識した拡張領域内画像を示す拡張領域内画像データを画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った拡張領域内画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 When the intra-region image recognition unit 1014 receives the intra-region image data generated for the extension region from the intra-region image recognition unit 1013, the intra-region image recognition unit 1014 re-executes the expansion processing of the intra-region image indicated by the received intra-region image data. As a result, in FIG. 14, the in-region image with the identification label L4 is expanded. The extension area image recognition unit 1014 delivers the extension area image data indicating the recognized extension area image to the image data output unit 1015. The image data output unit 1015 transmits the received image data in the extended area to the character recognition device 12 as image data indicating one character.

上記の説明においては、拡張領域認識部５０１１による拡張領域の認識処理、領域内画像認識部１０１３による領域内画像の認識処理および拡張領域内画像認識部１０１４による拡張領域内画像の認識処理が１回だけ繰り返されるものとしたが、繰り返し回数を所定の複数回としてもよい。また、拡張領域内画像認識部１０１４による領域内画像の拡張もしくは拡張領域認識部５０１１による領域の拡張が行われなくなるまで、それらの処理を繰り返すようにしてもよい。 In the above description, the extension region recognition unit 5011 performs the extension region recognition processing, the region image recognition unit 1013 recognizes the region image, and the extension region image recognition unit 1014 recognizes the extension region image once. However, the number of repetitions may be a predetermined number of times. Further, these processes may be repeated until the extension of the in-region image by the extension-in-region image recognition unit 1014 or the extension of the region by the extension region recognition unit 5011 is not performed.

以上のように文字画像切出装置１０−５により切り出され文字認識装置１２に送信される画像データは、例えば文字画像切出装置１０−１における拡張処理によっては切り出されないような、升目外に分離して書かれた文字部分をも含む画像を示すため、文字認識処理において高い精度の結果をもたらすものとなる。 As described above, the image data cut out by the character image cutting device 10-5 and transmitted to the character recognition device 12 is out of the grid, for example, not cut out by the expansion processing in the character image cutting device 10-1. Since an image including a character portion that is separately written is shown, a highly accurate result is obtained in the character recognition processing.

ところで、文字画像切出装置１０−５における領域拡張による認識処理の繰り返し実行を、文字画像切出装置１０−２ないし文字画像切出装置１０−４の拡張処理等と組み合わせてもよい。 By the way, the repeated execution of recognition processing by area expansion in the character image cutting device 10-5 may be combined with the expansion processing of the character image cutting device 10-2 to the character image cutting device 10-4.

［６．第６実施形態］
図１６は、ユーザがタブレットＰＣ１１に表示される４文字分の升目に対し「榊原正義」なる文字列を筆記した場合に、本発明の第６実施形態にかかる文字画像切出装置１０−６により第１の升目から画像データが切り出される様子を示した図である。この例においては、第１の升目には、第２の升目に書かれるべき文字の一部がはみ出して書かれているため、第１の升目に含まれる画像を用いて文字認識処理を行った場合、高い文字認識の精度は期待できない。これに対し、文字画像切出装置１０−６は近隣の升目からはみ出してきた画像部分を除去することにより、高い精度の文字認識をもたらす画像の切り出しを行う。 [6. Sixth Embodiment]
FIG. 16 shows a case where the character image cutting device 10-6 according to the sixth embodiment of the present invention is used when the user writes a character string “Masayoshi Sugawara” with respect to the four-character grid displayed on the tablet PC 11. It is the figure which showed a mode that image data was cut out from the 1st cell. In this example, a part of the characters to be written in the second cell is written in the first cell, so that the character recognition process is performed using the image included in the first cell. In this case, high accuracy of character recognition cannot be expected. On the other hand, the character image cutout device 10-6 cuts out an image that brings about highly accurate character recognition by removing an image portion that protrudes from a neighboring cell.

図１７は、文字画像切出装置１０−６の構成を示したブロック図である。文字画像切出装置１０−６の構成および動作は多くの点で文字画像切出装置１０−１のそれらと共通しているため、以下、文字画像切出装置１０−６が文字画像切出装置１０−１と異なる点のみを説明する。また、以下の説明において参照する図において、文字画像切出装置１０−１ないし文字画像切出装置１０−５の構成部に対応する構成部には文字画像切出装置１０−１ないし文字画像切出装置１０−５において用いられたものと同じ符号が付されている。 FIG. 17 is a block diagram showing the configuration of the character image cutting device 10-6. Since the configuration and operation of the character image cutting device 10-6 are similar to those of the character image cutting device 10-1 in many respects, the character image cutting device 10-6 is hereinafter referred to as the character image cutting device. Only differences from 10-1 will be described. Further, in the drawings to be referred to in the following description, the component corresponding to the component of the character image cutting device 10-1 to the character image cutting device 10-5 has the character image cutting device 10-1 to the character image cutting device. The same code | symbol as what was used in the taking-out apparatus 10-5 is attached | subjected.

文字画像切出装置１０−６の制御部１０１は、文字画像切出装置１０−１の制御部１０１が備える拡張領域内画像認識部１０１４に代えて、領域内画像のうち升目外に連続するオン画素群を有するものの中から、近隣の升目からはみ出してきた部分を示すものと思われる領域内画像を画像データ出力部１０１５に引き渡す領域内画像の集まりから除外する領域内画像除外部６０１１を備えている。 The control unit 101 of the character image cutout device 10-6 replaces the extended region image recognition unit 1014 included in the control unit 101 of the character image cutout device 10-1, and turns on the continuous images outside the grid in the region image. An intra-region image excluding unit 6011 for excluding from the group of intra-region images that are handed over to the image data output unit 1015 an intra-region image that is supposed to indicate a portion protruding from a neighboring cell from among those having a pixel group. Yes.

領域内画像認識部１０１３は領域内画像データを領域内画像除外部６０１１に引き渡す。領域内画像除外部６０１１は受け取った領域内画像データおよび２値画像データ１０２３を用いて、領域内画像の各々に関し、升目外に連続するオン画素群が存在するか否かを判定し、連続したオン画素群があるもの（図１６における、識別ラベルＬ１およびＬ５の付された領域内画像）に関しては、升目内の領域内画像に含まれるオン画素の数（もしくはその面積）と、領域内画像に連続する升目外のオン画素群に含まれるオン画素の数（もしくはその面積）とを比較する。領域内画像除外部６０１１は、升目内のオン画素の数が升目外のオン画素の数よりも少ない場合（図１６における、識別ラベルＬ５の付された領域内画像）、その領域内画像を画像データ出力部１０１５に出力する領域内画像の集まりから除外する。 The intra-region image recognition unit 1013 delivers the intra-region image data to the intra-region image exclusion unit 6011. The in-region image exclusion unit 6011 uses the received in-region image data and binary image data 1023 to determine whether or not there is a continuous on-pixel group for each of the in-region images. For an on-pixel group (in-region image with identification labels L1 and L5 in FIG. 16), the number of on-pixels (or its area) included in the in-region image within the cell and the in-region image Are compared with the number (or area) of on-pixels included in a group of on-pixels outside the grid. When the number of on-pixels in the cell is smaller than the number of on-pixels outside the cell (intra-region image with the identification label L5 in FIG. 16), the in-region image excluding unit 6011 displays the in-region image as an image. Excluded from the collection of in-region images to be output to the data output unit 1015.

すなわち、領域内画像除外部６０１１は、升目外に連続するオン画素群を有しない領域内画像と、升目外に連続するオン画素群を有するが升目外に連続するオン画素群の占める面積よりも広い面積を占める領域内画像とを示す領域内画像データを、画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った領域内画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 That is, the in-region image excluding unit 6011 has a larger area than an in-region image that does not have an on-pixel group continuous outside the grid and an on-pixel group that has an on-pixel group continuous outside the grid but outside the grid. The in-region image data indicating the in-region image occupying a large area is delivered to the image data output unit 1015. The image data output unit 1015 transmits the received in-region image data to the character recognition device 12 as image data indicating one character.

以上のように文字画像切出装置１０−６により切り出され文字認識装置１２に送信される画像データは、近隣の升目からはみ出した文字部分が除外された画像を示すため、文字認識処理において高い精度の結果をもたらすものとなる。 As described above, the image data cut out by the character image cutout device 10-6 and transmitted to the character recognition device 12 shows an image in which the character portion protruding from the neighboring cell is excluded, so that it is highly accurate in character recognition processing. Will bring about the result.

ところで、文字画像切出装置１０−６における領域内画像の除外処理を、文字画像切出装置１０−１ないし文字画像切出装置１０−５の拡張処理等と組み合わせてもよい。例えば、図１８は文字画像切出装置１０−６に文字画像切出装置１０−１における領域内画像の拡張処理を組み合わせた場合の構成を示したブロック図である。 By the way, the in-region image exclusion processing in the character image cutting device 10-6 may be combined with the expansion processing of the character image cutting device 10-1 to the character image cutting device 10-5. For example, FIG. 18 is a block diagram showing a configuration when the character image cutting device 10-6 is combined with the in-region image expansion processing in the character image cutting device 10-1.

図１８に示される構成の文字画像切出装置１０−６においては、領域内画像除外部６０１１と画像データ出力部１０１５の間に拡張領域内画像認識部１０１４が介挿されている。この場合、拡張領域内画像認識部１０１４は領域内画像除外部６０１１から、除外されなかった領域内画像を示す領域内画像データを受け取り、受け取った領域内画像データにより示される領域内画像に対し拡張処理を行い、拡張領域内画像を認識する。拡張領域内画像認識部１０１４は認識した拡張領域内画像を示す拡張領域内画像データを、画像データ出力部１０１５に引き渡す。画像データ出力部１０１５は受け取った拡張領域内画像データを、１文字を示す画像データとして文字認識装置１２に対し送信する。 In the character image cutout device 10-6 configured as shown in FIG. 18, an extended in-region image recognition unit 1014 is interposed between the in-region image excluding unit 6011 and the image data output unit 1015. In this case, the extended in-region image recognition unit 1014 receives the in-region image data indicating the in-region image that has not been excluded from the in-region image excluding unit 6011, and expands the in-region image indicated by the received in-region image data. Processing is performed to recognize the image in the extended area. The extension area image recognition unit 1014 delivers the extension area image data indicating the recognized extension area image to the image data output unit 1015. The image data output unit 1015 transmits the received image data in the extended area to the character recognition device 12 as image data indicating one character.

このように画像データ出力部１０１５から送信される画像データは、近隣の升目からはみ出した文字部分が除外され、かつ升目外にはみ出した文字部分を取り込んだ画像を示すため、文字認識処理においてより高い精度の結果をもたらすものとなる。 In this way, the image data transmitted from the image data output unit 1015 is higher in character recognition processing because it indicates an image in which the character portion that protrudes from the neighboring cells is excluded and the character portion that protrudes outside the cells is captured. The result will be accuracy.

また、文字画像切出装置１０−６の領域内画像除外部６０１１が升目の内外に配置されるオン画素数を基準として領域内画像を除外する代わりに、例えば領域内画像除外部６０１１は升目の内外にまたがる拡張領域内画像を含めた画像データと除外した画像データの２セットの画像データを、画像データ出力部１０１５を介して文字認識装置１２に送信するようにしてもよい。その場合、文字認識装置１２においてそれらの画像データの各々に対し文字認識処理を行い、認識結果を登録された単語を示すテキストデータ等と比較することにより、正しいと思われる認識結果を採用するようにしてもよい。 Further, instead of the in-region image excluding unit 6011 of the character image cutting device 10-6 excluding the in-region image on the basis of the number of on-pixels arranged inside and outside the cell, the in-region image excluding unit 6011, for example, Two sets of image data, that is, image data including an image in the extended area extending inside and outside and excluded image data, may be transmitted to the character recognition device 12 via the image data output unit 1015. In that case, the character recognition device 12 performs character recognition processing on each of the image data, and compares the recognition result with text data or the like indicating the registered word, thereby adopting a recognition result that seems to be correct. It may be.

ところで、上述した文字画像切出装置１０は、いずれも専用のハードウェアにより実現されてもよいし、汎用的なＰＣにアプリケーションプログラムに従った処理を実行させることにより実現されてもよい。 By the way, each of the character image cutout devices 10 described above may be realized by dedicated hardware, or may be realized by causing a general-purpose PC to execute processing according to an application program.

また、上述した実施形態においては、文字認識システム１は文字画像切出装置１０、タブレットＰＣ１１および文字認識装置１２を互いに接続することにより実現されるものとして説明したが、それらの配置は任意に変更可能である。例えば、文字認識装置１２を文字画像切出装置１０と同じ筐体内に配置したり、文字画像切出装置１０をタブレットＰＣ１１により実現するようにしたりしてもよい。また、文字画像切出装置１０とタブレットＰＣ１１および文字画像切出装置１０と文字認識装置１２をネットワークを介して相互に接続するようにしてもよい。 Moreover, in embodiment mentioned above, although the character recognition system 1 demonstrated as what was implement | achieved by mutually connecting the character image cutting device 10, the tablet PC 11, and the character recognition device 12, those arrangements are changed arbitrarily. Is possible. For example, the character recognition device 12 may be arranged in the same casing as the character image cutting device 10, or the character image cutting device 10 may be realized by the tablet PC 11. Further, the character image cutting device 10 and the tablet PC 11 and the character image cutting device 10 and the character recognition device 12 may be connected to each other via a network.

また、上述した実施形態においては、文字画像切出装置１０に対し、タブレットＰＣ１１から文字列を示す画像データが入力されるものとして説明したが、例えばタブレットＰＣ１１の代わりに、画像を光学的に読み取り画像データを生成するスキャナ装置を文字画像切出装置１０に接続し、紙面に書かれた文字列をスキャナ装置に読み取らせ、スキャナ装置により生成された画像データを文字画像切出装置１０に入力させるようにしてもよい。その場合、例えば、升目を示す図形を例えば朱色等で色付けしておき、文字画像切出装置１０において色フィルタにより升目を示す図形を抽出後、図形認識処理を行い升目の領域を特定するようにすればよい。もしくは、図形認識処理のみによって升目の領域を特定するようにしてもよい。 Further, in the above-described embodiment, it has been described that image data indicating a character string is input from the tablet PC 11 to the character image cutting device 10, but for example, instead of the tablet PC 11, an image is optically read. A scanner device that generates image data is connected to the character image cutting device 10, the character string written on the paper is read by the scanner device, and the image data generated by the scanner device is input to the character image cutting device 10. You may do it. In that case, for example, a figure showing a square is colored with, for example, vermilion, and after the figure showing the square is extracted by a color filter in the character image cutting device 10, a figure recognition process is performed to identify the area of the square. do it. Or you may make it specify the area | region of a grid only by a figure recognition process.

また、上述した実施形態においては、手書きによる文字列から１文字分の画像を切り出す場合について説明したが、升目に対しずれて印字された活字等に関しても本発明にかかる文字認識システムが利用可能であることは言うまでもない。 Further, in the above-described embodiment, the case where an image for one character is cut out from a handwritten character string has been described. However, the character recognition system according to the present invention can also be used for printed characters that are shifted from the grid. Needless to say.

本発明の実施形態にかかる文字認識システムの構成を示したブロック図である。It is the block diagram which showed the structure of the character recognition system concerning embodiment of this invention. 本発明の第１実施形態にかかる文字画像切出装置により画像データが切り出される様子を示した図である。It is the figure which showed a mode that image data was cut out by the character image cutting device concerning 1st Embodiment of this invention. 本発明の第１実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 1st Embodiment of this invention. 本発明の第２実施形態にかかる文字画像切出装置により画像データが切り出される様子を示した図である。It is the figure which showed a mode that image data was cut out by the character image cutting device concerning 2nd Embodiment of this invention. 本発明の第２実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 2nd Embodiment of this invention. 本発明の第２実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 2nd Embodiment of this invention. 本発明の第２実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 2nd Embodiment of this invention. 本発明の第３実施形態にかかる文字画像切出装置により画像データが切り出される様子を示した図である。It is the figure which showed a mode that image data was cut out by the character image cutting device concerning 3rd Embodiment of this invention. 本発明の第３実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 3rd Embodiment of this invention. 本発明の第３実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 3rd Embodiment of this invention. 本発明の第３実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 3rd Embodiment of this invention. 本発明の第４実施形態にかかる文字画像切出装置により画像データが切り出される様子を示した図である。It is the figure which showed a mode that image data was cut out by the character image cutting device concerning 4th Embodiment of this invention. 本発明の第４実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 4th Embodiment of this invention. 本発明の第５実施形態にかかる文字画像切出装置により画像データが切り出される様子を示した図である。It is the figure which showed a mode that image data was cut out by the character image cutting-out apparatus concerning 5th Embodiment of this invention. 本発明の第５実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 5th Embodiment of this invention. 本発明の第６実施形態にかかる文字画像切出装置により画像データが切り出される様子を示した図である。It is the figure which showed a mode that image data was cut out by the character image cutting-out apparatus concerning 6th Embodiment of this invention. 本発明の第６実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 6th Embodiment of this invention. 本発明の第６実施形態にかかる文字画像切出装置の構成を示したブロック図である。It is the block diagram which showed the structure of the character image cutting device concerning 6th Embodiment of this invention.

Explanation of symbols

１…文字認識システム、１０…文字画像切出装置、１１…タブレットＰＣ、１２…文字認識装置、１０１…制御部、１０２…記憶部、１０１１…画像データ入力部、１０１２…２値化部、１０１３…領域内画像認識部、１０１４…拡張領域内画像認識部、１０１５…画像データ出力部、１０２１…画像データ、１０２２…領域データ、１０２３…２値画像データ、２０１１…膨張画像生成部、２０１２…拡張膨張領域内画像認識部、２０１３…収縮画像生成部、２０１４…重複画像認識部、３０１１…画像認識部、３０１２…連結膨張画像認識部、３０１３…領域内連結膨張画像認識部、５０１１…拡張領域認識部、６０１１…領域内画像除外部 DESCRIPTION OF SYMBOLS 1 ... Character recognition system, 10 ... Character image cutting device, 11 ... Tablet PC, 12 ... Character recognition device, 101 ... Control part, 102 ... Memory | storage part, 1011 ... Image data input part, 1012 ... Binarization part, 1013 ... Image recognition unit in region, 1014 ... Image recognition unit in extended region, 1015 ... Image data output unit, 1021 ... Image data, 1022 ... Region data, 1023 ... Binary image data, 2011 ... Expanded image generation unit, 2012 ... Extension Expansion region image recognition unit, 2013 ... Shrinkage image generation unit, 2014 ... Duplicate image recognition unit, 3011 ... Image recognition unit, 3012 ... Concatenated expansion image recognition unit, 3013 ... Intra-region connection expansion image recognition unit, 5011 ... Expansion region recognition Part, 6011...

Claims

Image data acquisition means for acquiring image data consisting of a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane;
Of the pixels indicated by the pixel data included in the image data acquired by the image data acquisition means, the pixel is represented by a predetermined grid on the plane from among the on pixels that indicate an attribute value exceeding a predetermined threshold. In-region image recognition means for recognizing a collection of on-pixels arranged continuously in a region as an in-region image;
Determining means for determining whether there is a collection of on-pixels adjacent to the in-area image recognized by the in-area image recognition means and continuously arranged outside the area;
If the determination means includes a set of on-pixels, an image obtained by integrating the set of on-pixels into the image in the region is recognized as an image in the extension region, and is adjacent to the image in the region and mutually outside the region. Extended region image recognition means for recognizing the image in the region as an image in the extended region when there is no group of continuously arranged on pixels;
A character image cutting device comprising: output means for outputting data indicating an image in the extended area recognized by the image recognition means in the extended area as image data representing one character.

Image data acquisition means for acquiring image data consisting of a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane;
Of the pixels indicated by the pixel data included in the image data acquired by the image data acquisition means, the pixel is represented by a predetermined grid on the plane from among the on pixels that indicate an attribute value exceeding a predetermined threshold. In-region image recognition means for recognizing a collection of on-pixels arranged continuously in a region as an in-region image;
Expansion image generation means for performing expansion processing on the in-region image recognized by the in-region image recognition means, and generating an in-expansion area image;
Determining means for determining whether there is a collection of on-pixels adjacent to the in-expansion area image generated by the inflated image generation means and continuously arranged outside the area;
When the determination unit includes the collection of on-pixels, an image obtained by integrating the collection of on-pixels with the image in the expansion area is recognized as an image in the expansion area, and is adjacent to the image in the expansion area. An expansion in-expansion region image recognition means for recognizing the in-expansion region image as an expansion in-expansion region image when there is no collection of on-pixels arranged continuously outside.
A character image cutout device comprising: output means for outputting data indicating an image in the expanded expansion area recognized by the image recognition means in the expanded expansion area as image data representing one character.

Further comprising contraction image generation means for performing contraction processing on the image in the expansion expansion area recognized by the image recognition means in the expansion expansion area and generating an image in the contraction expansion expansion area,
3. The character according to claim 2, wherein the output unit outputs data indicating the image in the contraction / expansion / expansion region generated by the contraction image generation unit, instead of data indicating the image in the expansion / expansion region. Image cutting device.

Image data acquisition means for acquiring image data consisting of a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane;
Of the pixels indicated by the pixel data included in the image data acquired by the image data acquisition means, the pixel is represented by a predetermined grid on the plane from among the on pixels that indicate an attribute value exceeding a predetermined threshold. In-region image recognition means for recognizing a collection of on-pixels arranged continuously in a region as an in-region image;
Expansion image generation means for performing expansion processing on the in-region image recognized by the in-region image recognition means, and generating an in-expansion area image;
Determining means for determining whether there is a collection of on-pixels adjacent to the in-expansion area image generated by the inflated image generation means and continuously arranged outside the area;
When the determination unit includes the collection of on-pixels, an image obtained by integrating the collection of on-pixels with the image in the expansion area is recognized as an image in the expansion area, and is adjacent to the image in the expansion area. An expansion in-expansion region image recognition means for recognizing the in-expansion region image as an expansion in-expansion region image when there is no collection of on-pixels arranged continuously outside.
A set of pixels that are on-pixels is recognized as a duplicate image in both the image indicated by the image data acquired by the image data acquisition unit and the image in the expansion expansion region recognized by the image recognition unit in the expansion expansion region. Duplicate image recognition means;
A character image cutting device comprising: output means for outputting data indicating a duplicate image recognized by the duplicate image recognition means as image data representing one character.

Image data acquisition means for acquiring image data consisting of a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane;
Of the pixels indicated by the pixel data included in the image data acquired by the image data acquisition means, the pixel is represented by a predetermined grid on the plane from among the on pixels that indicate an attribute value exceeding a predetermined threshold. In-region image recognition means for recognizing a collection of on-pixels arranged continuously in a region as an in-region image;
Determining means for determining whether there is a collection of on-pixels adjacent to the in-area image recognized by the in-area image recognition means and continuously arranged outside the area;
If the determination means includes a set of on-pixels, an image obtained by integrating the set of on-pixels into the image in the region is recognized as an image in the extension region, and is adjacent to the image in the region and mutually outside the region. Extended region image recognition means for recognizing the image in the region as an image in the extended region when there is no group of continuously arranged on pixels;
Extended area recognition means for recognizing an area of a predetermined shape on the plane circumscribing an image in the extension area recognized by the image recognition means in the extension area as an extension area;
Output means for outputting data indicating an image indicated by a collection of on-pixels included in the extension area recognized by the extension area recognition means as image data representing one character. Out device.

The in-region image recognition unit re-executes the in-region image recognition processing with the extension region recognized by the extension region recognition unit as the region after the extension region is recognized by the extension region recognition unit. ,
The extended area image recognition means re-executes the recognition process of the extended area image for the image in the area recognized by re-execution of the recognition process by the intra-area image recognition means,
The extension area recognition means re-executes the extension area recognition process on the extension area image recognized by re-execution of the recognition process by the extension area image recognition means,
The output means outputs data indicating an image indicated by a collection of on-pixels included in an extended area recognized by re-execution of recognition processing by the extended area recognition means as image data representing the one character. The character image cutting device according to claim 5.

Image data acquisition means for acquiring image data consisting of a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane;
Of the pixels indicated by the pixel data included in the image data acquired by the image data acquisition means, the pixel is represented by a predetermined grid on the plane from among the on pixels that indicate an attribute value exceeding a predetermined threshold. In-region image recognition means for recognizing a collection of on-pixels arranged continuously in a region as an in-region image;
Regarding at least one intra-region image among one or more intra-region images recognized by the intra-region image recognition means, a collection of on-pixels arranged adjacent to the intra-region image and continuously outside the region. Determination means for determining whether or not it exists;
An intra-region image excluding unit for excluding the intra-region image from the intra-region image collection recognized by the intra-region image recognition unit when the determination unit includes the set of on-pixels;
Output means for outputting, as image data representing one character, a collection of intra-area images that are not excluded by the intra-area image exclusion means among the intra-area images recognized by the intra-area image recognition means. A character image cutting device.

The area in the image excluding means relates the first region in an image, if the set of ON pixels that are arranged consecutively to each other in front Symbol territory outside and adjacent to the region in an image is present, the collection of the ON pixels When the area occupied by the image in one region is smaller than the area occupied, the image in the one region is excluded from the collection of images in the region recognized by the image recognition means in the region, and the collection of the on pixels occupies. The character image cutting device according to claim 7 , wherein when the area occupied by the image in one area is larger than the area, the image in the one area is not excluded from the collection of images in the area .

When there is a collection of on-pixels adjacent to the in-area image that has not been excluded by the in-area image excluding means and continuously arranged outside the area, the on-pixel collection is used as the in-area image. If the integrated image is recognized as an image in the extension area and there is no collection of on pixels that are adjacent to the image in the area and are continuously arranged outside the area, the image in the area is determined to be an image in the extension area. Further comprising an image recognition means in the extended area that recognizes as
The character image cutout according to claim 8 , wherein the output means outputs data indicating the image in the extended area recognized by the image recognition means in the extended area as image data representing the one character. apparatus.

Processing for obtaining image data composed of a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane;
Among the pixels indicated by the pixel data included in the acquired image data, among the on pixels indicating attribute values exceeding a predetermined threshold value, the pixels are continuous with each other within the area represented by the predetermined grid on the plane. Processing for recognizing a collection of on-pixels arranged as an in-region image,
A process of determining whether there is a collection of on-pixels adjacent to the recognized in-region image and continuously arranged outside the region;
Recognizing an image obtained by integrating the collection of the ON pixels in the area in the image in the case where collection of pre SL on pixel exists as an extended area within the image, and adjacent to the area in the image to one another continuously outside the said region A process of recognizing the image in the area as an image in the extension area when there is no collection of arranged on pixels,
A program for causing a computer to execute a process of outputting data indicating a recognized image in an extended area as image data representing one character.

Processing for obtaining image data composed of a collection of pixel data indicating attribute values of each of a plurality of pixels constituting an image arranged on a plane;
Among the pixels indicated by the pixel data included in the acquired image data, among the on pixels indicating attribute values exceeding a predetermined threshold value, the pixels are continuous with each other within the area represented by the predetermined grid on the plane. Processing for recognizing a collection of on-pixels arranged as an in-region image,
For at least one in-region image among one or more recognized in-region images, it is determined whether there is a collection of on-pixels that are adjacent to the in-region image and are continuously arranged outside the region. Processing to
If a collection of pre-Symbol ON pixels are present, the process of excluding the region image from the collection of recognized areas in the image,
A program that causes a computer to execute a process of outputting a collection of intra-region images that are not excluded among recognized intra-region images as image data representing one character.