JPS6311718B2

JPS6311718B2 -

Info

Publication number: JPS6311718B2
Application number: JP58243828A
Authority: JP
Inventors: Shigeru Kakumoto; Koji Fujimoto; Kunihiro Okada
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1983-12-26
Filing date: 1983-12-26
Publication date: 1988-03-15
Also published as: JPS59132078A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明は、大型の図面、帳票等に書かれた２次
元画像文字データを切出すための文字パターン切
出方法に関するものである。DETAILED DESCRIPTION OF THE INVENTION [Field of Application of the Invention] The present invention relates to a character pattern cutting method for cutting out two-dimensional image character data written on large drawings, forms, etc.

[Background of the invention]

例えば、光学的文字読取装置（以下OCRとい
う。）において、文字の書かれた帳票の走査を行
なう場合、光電変換素子を１次元的に配列したセ
ンサを使用し、そのセンサと帳票との位置を相対
的に移動させることによつて、２次元の走査を行
なつている。 For example, when an optical character reading device (hereinafter referred to as OCR) scans a document with characters written on it, a sensor with photoelectric conversion elements arranged one-dimensionally is used to detect the position of the sensor and the document. Two-dimensional scanning is performed by moving them relatively.

ところで、帳票が大型になり、１個のセンサで
帳票の巾方向の全体をカバーできない場合には、
第１図に示すように、複数個のセンサ１，２とレ
ンズ３，４とを使用し、帳票５の別々の部分をレ
ンズ３，４によりセンサ１，２上に結像させ、セ
ンサ１による走査線６とセンサ２による走査線７
との接続点を位置合せして１走査線を構成するよ
うにしたものが従来より知られている。 By the way, when the form becomes large and one sensor cannot cover the entire width of the form,
As shown in FIG. Scanning line 6 and scanning line 7 by sensor 2
Conventionally, it is known that one scanning line is constructed by aligning the connection points with the scanning line.

しかしながら、上述した従来の方法では、セン
サ１，２の走査線の接続点を精密に位置合わせし
たとしても、センサ１，２あるいはレンズ３，４
のばらつき、量子化誤差等により、センサ１，２
に得られる画像データにおいて、その接続点でず
れを生ずる。そのため、そのようなデータを認識
する場合、認識結果に悪影響を与えるという問題
があつた。 However, in the conventional method described above, even if the connecting points of the scanning lines of the sensors 1 and 2 are precisely aligned, the sensors 1 and 2 or the lenses 3 and 4
Due to variations in quantization, quantization errors, etc.
In the image data obtained during the process, a shift occurs at the connection point. Therefore, when such data is recognized, there is a problem that the recognition result is adversely affected.

また、センサ１，２で得られた画像データを認
識する場合、全体の画像データを一時メモリに記
憶しておく必要があるため、非常に大きなメモリ
容量を必要とするという問題があつた。 Furthermore, when recognizing the image data obtained by the sensors 1 and 2, it is necessary to temporarily store the entire image data in a memory, which poses a problem in that a very large memory capacity is required.

また、設計図等の大型図面を読取るための装置
においても、上述したと同様な問題があり、それ
に対する有効な解決策が見い出されていないのが
現状である。 Further, devices for reading large-sized drawings such as engineering drawings also have the same problem as described above, and no effective solution to the problem has been found at present.

[Object of the present invention]

本発明の目的は、極めて正確に認識が行なえ、
かつ、画像データを記憶するメモリの容量を低減
できる文字認識方法を提供することにある。 The purpose of the present invention is to enable extremely accurate recognition;
Another object of the present invention is to provide a character recognition method that can reduce the capacity of a memory for storing image data.

[Summary of the invention]

このような目的を達成するために、本発明で
は、読取るべき画面を複数個に分割し、互いに隣
接する分割画面の一部が重複するように構成し、
このような分割画面単位で文字を認識し、その認
識結果を照合して、全体の画面データの読取りを
行なうようにしたことに特徴がある。 In order to achieve such an object, the present invention divides the screen to be read into a plurality of parts, and configures the divided screens so that some of the adjacent divided screens overlap,
The feature is that characters are recognized in units of such divided screens, and the recognition results are compared to read the entire screen data.

[Embodiments of the invention]

以下、本発明の実施例を画面により詳細に説明
する。 Hereinafter, embodiments of the present invention will be explained in detail using screens.

第２図は、本発明により文字パターン切出し方
法の原理を示すもので、大型図面の読取りを行な
う例である。 FIG. 2 shows the principle of the character pattern cutting out method according to the present invention, and is an example of reading a large-sized drawing.

図から解るように、大画面１０は一度に処理で
きる単位である複数個の小領域１１，１２，１
３，……に分割されており、隣接する小領域、例
えば、１１と１３の間には重複領域１４が設けら
れ、１１と１２との間にも同様に重複領域１５が
設けられている。このような大画面１０のデータ
を読取る場合、小領域単位で行なわれる。例え
ば、小領域１１，１２の順番で矢印で示す行方向
に小領域単位で、光電変換を行ない、次いで、小
領域１３を含む行の光電変換を順次行なうように
し、また、このような光電変換で得られた各小領
域の画像データを順次認識処理するようにし、そ
して、各小領域の認識結果を基に全体の画面デー
タの読取りを完了する。 As can be seen from the figure, the large screen 10 has a plurality of small areas 11, 12, 1, which are units that can be processed at once.
3, . . . An overlapping area 14 is provided between adjacent small areas, for example, 11 and 13, and an overlapping area 15 is similarly provided between 11 and 12. When data on such a large screen 10 is read, it is performed in units of small areas. For example, photoelectric conversion is performed for each small region in the row direction indicated by the arrow in the order of small regions 11 and 12, and then photoelectric conversion is sequentially performed for the row including small region 13. The image data of each small area obtained in step 1 is sequentially recognized, and reading of the entire screen data is completed based on the recognition result of each small area.

第３図は、本発明による文字パターン切出し方
法を実現する大型図面読取装置の一実施例のブロ
ツク図を示すもので、２１は分割画面入力装置、
２２は分割画面格納用メモリ、２３は文字、線認
識部、２４は認識テーブル格納メモリ、２５は制
御部、２６は接続処理部である。 FIG. 3 shows a block diagram of an embodiment of a large-sized drawing reading device that implements the character pattern cutting method according to the present invention, in which numeral 21 denotes a split screen input device;
22 is a split screen storage memory, 23 is a character/line recognition section, 24 is a recognition table storage memory, 25 is a control section, and 26 is a connection processing section.

設計図などの画面には、第４図ａに示すように
一般に文字２７と線２８とが含まれており、それ
らを含む大画面を読取る場合、大画面１０中の小
領域、例えば、横および縦方向長さがSXおよび
SYである領域１１の分割画面を分割画面入力装
置２１により光電変換し、２値化処理などの前処
理を行なつた後、分割画面格納用画像メモリ２２
に格納する。 Screens such as blueprints generally include characters 27 and lines 28 as shown in FIG. Longitudinal length is SX and
After photoelectrically converting the split screen of the area 11 which is SY by the split screen input device 21 and performing preprocessing such as binarization processing, the split screen is stored in the image memory 22 for storing the split screen.
Store in.

したがつて、このメモリ２２には、第４図ｂに
示すようなデータとして格納される。但し、第４
図ｂの（Pi）は座標を示している。次に、このメ
モリ２２に格納された画像データを読取り、文
字、線認識部２３に送り、分割画面中の文字の切
出し認識および線の認識の結果、第４図ｃに示す
ような分割画面認識テーブル３０を得、それを認
識テーブル格納メモリ２４に格納する。この認識
結果では、線は端点、屈曲点等の特徴点の座標に
よつて表わされる。例えば、線ａは、座標P₁，
P₂，P₃，P₄によつて、線ａ（P₁，P₂，P₃，P₄）の
ように表わされる。また文字は、文字の種類とそ
の座標で表わされる。例えば、文字Ａは、文字Ａ
（P8）のように表わされる。さらに、図には示さ
れていないが、曲線は折れ線近似等の手法で表わ
される。このようにして小領域１１の分割画面の
認識が終了すると、次の小領域１２の分割画面の
取込み、認識処理を上述したと同様にして行な
い、大画面１０の全領域あるいは所望領域の分割
画面の認識処理を行なう。 Therefore, this memory 22 stores the data as shown in FIG. 4b. However, the fourth
(Pi) in Figure b indicates the coordinates. Next, the image data stored in the memory 22 is read and sent to the character/line recognition section 23, and as a result of character cutout recognition and line recognition in the split screen, split screen recognition as shown in FIG. 4c is performed. A table 30 is obtained and stored in the recognition table storage memory 24. In this recognition result, a line is represented by the coordinates of feature points such as end points and bending points. For example, line a has coordinates P ₁ ,
P ₂ , P ₃ , and P ₄ are expressed as a line a (P ₁ , P ₂ , P ₃ , P ₄ ). Characters are also represented by character types and their coordinates. For example, the letter A is
It is expressed as (P8). Furthermore, although not shown in the figure, the curve is represented by a method such as polygonal line approximation. When recognition of the split screen of the small area 11 is completed in this way, the next split screen of the small area 12 is captured and recognized in the same manner as described above, and the entire area of the large screen 10 or the split screen of a desired area is performs recognition processing.

このようにして、メモリ２４には、各分割画面
の認識結果が分割画面認識テーブル３０として格
納される。これらの処理が終了すると、接続処理
部２６では、これらの分割画面認識テーブル３０
の内容を読み出し、その内容を取捨選択して隣接
する分割画面間の認識結果の接続を行ない、その
結果、第４図ｄに示すような認識テーブル２９を
作成し、認識テーブル格納用メモリ２４に格納す
る。 In this way, the recognition results for each split screen are stored in the memory 24 as the split screen recognition table 30. When these processes are completed, the connection processing unit 26 uses these split screen recognition tables 30.
The contents are read out, the contents are selected and the recognition results are connected between adjacent split screens, and as a result, a recognition table 29 as shown in FIG. 4d is created and stored in the recognition table storage memory 24. Store.

上述したように、隣接した分割画面の入力に際
し、適当な重なり領域を設けているので、例え
ば、小領域１１の分割画面に一部のみが含まれ、
その切出しパターンでは認識不能である文字Ｃ
〔第４図ａ〕は、隣接する小領域１３の分割画面
に完全な型で含まれるため、その領域で完全に認
識され、認識不能とされた方が捨てられ、完全に
認識された方が選択される。これにより、認識テ
ーブル２９には正確な認識結果が得られる。ま
た、隣接する２つの小領域の分割画面のそれぞれ
に完全な型で含まれる文字、例えば文字Ｂはそれ
ぞれの認識結果に含まれるが、両認識結果の接続
に際して一方は捨てられ、他方のみが認識テーブ
ル２９に登録される。 As mentioned above, when inputting adjacent split screens, an appropriate overlapping area is provided, so for example, only a part of the split screen of the small area 11 is included,
Character C that is unrecognizable with that cutout pattern
[Figure 4a] is included in the divided screen of the adjacent small area 13 in its complete form, so it is completely recognized in that area, the one that is deemed unrecognizable is discarded, and the one that is completely recognized is discarded. selected. As a result, accurate recognition results can be obtained in the recognition table 29. In addition, characters that are included in complete form in each of the split screens of two adjacent small areas, such as the character B, are included in each recognition result, but when connecting both recognition results, one is discarded and only the other is recognized. It is registered in table 29.

また上述したように、１つの分割画面が画像メ
モリ２２に格納され、認識処理された後には次の
分割画面が画像メモリ２２に格納されるので、画
像メモリ２２としては１つの分割画面を記憶する
容量を有しておればよく、全体画面を格納する場
合に比べて、メモリ容量を著しく低減できる。 Furthermore, as described above, one split screen is stored in the image memory 22, and after recognition processing, the next split screen is stored in the image memory 22, so the image memory 22 stores one split screen. It is sufficient to have the capacity, and the memory capacity can be significantly reduced compared to the case where the entire screen is stored.

なお、第３図の構成では分割画面入力装置２
１，文字、線認識部２３および接続処理部２６の
動作は、制御部２５からの制御データ、タイミン
グ信号によつて制御される。 In addition, in the configuration shown in FIG. 3, the split screen input device 2
1. The operations of the character/line recognition section 23 and the connection processing section 26 are controlled by control data and timing signals from the control section 25.

以下、第３図の装置の各部分の動作につき、以
下に詳細に説明する。 The operation of each part of the apparatus shown in FIG. 3 will be explained in detail below.

第５図は第３図の分割画面入力装置２１の具体
的構成の一例を示すものである。 FIG. 5 shows an example of a specific configuration of the split screen input device 21 shown in FIG.

図において、３１は大画面載置台、３２はテレ
ビカメラ等からなる画像入力ヘツド、３３は可動
レール、３４、３５は固定レール、３６は画像入
力ヘツド移動用モータ、３７は可動レール移動用
モータ、３８はモータ駆動回路、３９は画像入力
ヘツド移動制御回路、４０は画像入力制御回路を
示す。また、１０は大画面、２２は画像メモリで
ある。 In the figure, 31 is a large screen mounting table, 32 is an image input head such as a television camera, 33 is a movable rail, 34 and 35 are fixed rails, 36 is a motor for moving the image input head, 37 is a motor for moving the movable rail, 38 is a motor drive circuit, 39 is an image input head movement control circuit, and 40 is an image input control circuit. Further, 10 is a large screen, and 22 is an image memory.

このような構成において、制御部２５から画像
入力ヘツド移動制御回路３９に画像入力ヘツド３
２の移動目標値をセツトし、それによつて、モー
タ駆動回路３８を通して可動レール移動用モータ
３７および画像用モータ３６を駆動し、画像入力
ヘツド３２を可動レール３３に沿つてＹ方向に移
動させるとともに、可動レール３３を固定レール
３４，３５に沿つてＸ方向に移動させ、画像入力
ヘツド３２を目標位置に移動させる。 In such a configuration, the image input head 3 is controlled from the control section 25 to the image input head movement control circuit 39.
Accordingly, the movable rail moving motor 37 and the image motor 36 are driven through the motor drive circuit 38 to move the image input head 32 along the movable rail 33 in the Y direction. , the movable rail 33 is moved in the X direction along the fixed rails 34 and 35, and the image input head 32 is moved to the target position.

次に、制御部２５から画像入力制御回路４０に
入力の起動を発すると、画像入力ヘツド３２のテ
レビカメラ等で撮像された小領域の分割画面の映
像信号が前処理され、その結果が画像メモリ２２
に記憶される。 Next, when the control unit 25 issues an input activation signal to the image input control circuit 40, the video signal of the divided screen of the small area captured by the television camera or the like of the image input head 32 is preprocessed, and the result is stored in the image memory. 22
is memorized.

そして、次々に制御部２５から移動目標値が入
力され、その度に、対応する小領域の分割画面が
画像メモリ２２に格納される。 Then, movement target values are input from the control unit 25 one after another, and each time the divided screen of the corresponding small area is stored in the image memory 22.

なお、画像入力ヘツド３２の内に、Ｘ方向に直
線状に配列されたアレーセンサを用いる場合に
は、画像入力ヘツド３２を目標位置に移動させた
後、画像入力ヘツド３２をＹ方向に小領域に対応
する分だけ移動させながら、映像信号を入力に、
それが終つた後、重複領域分だけ、画像入力ヘツ
ド３２を後退させ、次の隣接領域の分割画面の映
像信号を入力すればよい。 Note that when using an array sensor linearly arranged in the X direction in the image input head 32, after moving the image input head 32 to the target position, move the image input head 32 to a small area in the Y direction. input the video signal while moving it by the amount corresponding to
After this is completed, the image input head 32 is moved back by the amount of the overlapping area, and the video signal of the divided screen of the next adjacent area is input.

なお、分割画面入力装置２１としては、画像入
力ヘツドを固定して、図面を移動させるようにし
たものを用いることもできるし、また、全面ある
いは必要な画面部分をカバーする複数個の画像入
力ヘツドを配置し、それらヘツドからの出力を選
択的に取り出すようにしてもよい。 As the split screen input device 21, a device with a fixed image input head and a movable drawing can be used, or a device with a plurality of image input heads that cover the entire screen or a necessary part of the screen can be used. The outputs from these heads may be selectively taken out.

分字、線認識部２３における認識は従来から種
種の方法があり、特に説明を要さない。装置とし
てはマイクロプロセツサの使用が考えられる。 Conventionally, various methods have been used for recognition in the segmented character/line recognition section 23, and no particular explanation is required. A microprocessor may be used as the device.

第３図の接続処理部２６における文字の接続
は、隣接する領域の分割画面の文字の座標を比較
し、一致すれば、一方の文字を消去し、一致しな
ければ、両方を認識テーブルに登録することによ
り行なう。 To connect characters in the connection processing unit 26 in FIG. 3, the coordinates of characters in split screens in adjacent areas are compared, and if they match, one character is deleted, and if they do not match, both are registered in the recognition table. Do by doing.

このような処理をマイクロコンピユータ等のプ
ロセツサを使つて実行するようにすれば、簡単に
処理できる。また、上記した機能を果す専用装置
を接続処理部２６として構成するようにしてもよ
い。 Such processing can be easily performed by using a processor such as a microcomputer. Further, a dedicated device that performs the above-described functions may be configured as the connection processing section 26.

なお、制御部２５としては、マイクロコンピユ
ータ等のプロセツサで構成され得る。 Note that the control unit 25 may be configured with a processor such as a microcomputer.

また、メモリ２２および２４としては、マイク
ロコンピユータ等のランダムアクセスメモリなど
が使用できる。 Further, as the memories 22 and 24, random access memories such as microcomputers can be used.

さらに、第３図の装置を１つのマイクロコンピ
ユータ等で構成するようにしてもよい。 Furthermore, the apparatus shown in FIG. 3 may be configured with one microcomputer or the like.

第６図は、本発明によるパターン認識方法を実
現するOCRの例の原理を示すものである。 FIG. 6 shows the principle of an example of OCR that implements the pattern recognition method according to the invention.

第６図ａのように、大型帳票５の文字の読取り
を行なう場合、従来と同様、複数のセンサ１，２
と対応するレンズ３，４を設けるが、本発明で
は、２つの隣接するセンサ１，２では、１文字を
充分カバーできるだけの巾だけ重複して走査を行
なうようになつている。その結果、センサ１およ
び２で走査される領域Ａ１およびＢ１は、例え
ば、第６図ｂ，ｃに示すようになる。このように
すると、センサ１，２の走査領域の重複部分付近
での文字は、第６図ｂ，ｃのように、必らず、い
ずれかの領域に完全な型で含まれる。例えば、第
６図ｂでは、文字「５」は両センサ１および２の
走査領域Ａ１およびＢ１に完全な型で含まれる。
また、第６図ｃでは、文字「４」はセンサ１の走
査領域A1に、文字「５」はセンサ２の走査領域
Ｂ１にそれぞれ完全な型で含まれている。 As shown in FIG. 6a, when reading characters on a large form 5, a plurality of sensors 1, 2
However, in the present invention, two adjacent sensors 1 and 2 perform overlapping scanning by a width sufficient to cover one character. As a result, the areas A1 and B1 scanned by sensors 1 and 2 become as shown in FIGS. 6b and 6c, for example. In this way, characters near the overlapping area of the scanning areas of the sensors 1 and 2 will necessarily be included in complete form in either area, as shown in FIGS. 6b and 6c. For example, in FIG. 6b, the character "5" is included in complete form in the scanning areas A1 and B1 of both sensors 1 and 2.
Further, in FIG. 6c, the character "4" is included in the scanning area A1 of the sensor 1, and the character "5" is included in the scanning area B1 of the sensor 2 in complete form.

〔Effect of the invention〕

以上述べたように、本発明によれば、大画面の
認識が、極めて正確にできるばかりか、画面メモ
リの容量の著るしい低減を計ることができる。 As described above, according to the present invention, not only can a large screen be recognized extremely accurately, but also the capacity of the screen memory can be significantly reduced.

[Brief explanation of the drawing]

第１図は従来のOCRの説明図、第２図は本発
明によるパターン認識方法の原理の説明図、第３
図は本発明によるパターン認識方法を実現する大
型図面読取装置の一実施例のブロツク図、第４図
は第３図の動作の説明図、第５図は第３図の分割
画面入力装置の一例の具体的構成図、第６図は本
発明によるパターン認識方法を実現するOCRの
原理説明図である。１０は大画面、１１〜１３は
小領域、１４，１５は重複領域、２１は分割画面
入力装置、２２は画像メモリ、２６は接続処理部
を示す。 Fig. 1 is an explanatory diagram of conventional OCR, Fig. 2 is an explanatory diagram of the principle of the pattern recognition method according to the present invention, and Fig. 3 is an explanatory diagram of the principle of the pattern recognition method according to the present invention.
The figure is a block diagram of an embodiment of a large drawing reading device that implements the pattern recognition method according to the present invention, FIG. 4 is an explanatory diagram of the operation of FIG. 3, and FIG. 5 is an example of the split screen input device of FIG. 3. FIG. 6 is a diagram explaining the principle of OCR that realizes the pattern recognition method according to the present invention. 10 is a large screen, 11 to 13 are small areas, 14 and 15 are overlapping areas, 21 is a split screen input device, 22 is an image memory, and 26 is a connection processing section.

Claims

[Claims]

1. Divide a screen with characters into a plurality of subdivisions so as to have parts that overlap with adjacent subdivisions, photoelectrically convert the image on the partial divisions, recognize the character pattern from the photoelectric conversion signal, and As the recognition result for the character on the boundary line of one of the pair of partial compartments, the recognition result of the character at the position corresponding to the position of the character on the boundary line in the other compartment of the pair of adjacent partial compartments is adopted. A character recognition method characterized by: