JPH0789361B2

JPH0789361B2 - Form registration device

Info

Publication number: JPH0789361B2
Application number: JP60030368A
Authority: JP
Inventors: 康明中野; 浩道藤沢; 邦弘岡田; 正雄道野
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1985-02-20
Filing date: 1985-02-20
Publication date: 1995-09-27
Anticipated expiration: 2010-09-27
Also published as: JPS61190676A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明は文書の処理方式に係り、特に表形式の文書上の
文字を読み取る目的で帳票を予め登録する目的に好適な
帳票登録方式に関する。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document processing system, and more particularly to a form registration system suitable for registering a form in advance for the purpose of reading characters on a tabular document.

[Background of the Invention]

従来の文字読み取り装置（以下、OCRと略する）では、
読み取り領域をOCRが感じない色（ドロツプアウトカラ
ー）を用いて印刷する必要があり、印刷費用が高価にな
る問題があつた。さらに文字読み取り領域の位置を文書
のエツジからの距離として記述し、その領域の文字数や
文字種・チエツク方法などを指定する必要があり、煩雑
であつた。以下、文字読み取り領域の位置情報や領域内
の文字数や文字種・チエツク方法などの情報を書式情報
と呼ぶ。文字認識機能は専用OCRだけでなく、文書フア
イルやOA用ワークステーシヨンなどにも要求されている
が、上記の問題点が広い普及を阻んでいた。In the conventional character reading device (hereinafter abbreviated as OCR),
Since it is necessary to print the reading area in a color that OCR does not feel (dropout color), the printing cost becomes high. Further, it is necessary to describe the position of the character reading area as the distance from the edge of the document, and to specify the number of characters in that area, the character type, the check method, etc., which is complicated. Hereinafter, the position information of the character reading area, the number of characters in the area, the character type, the check method, and the like are referred to as format information. The character recognition function is required not only for dedicated OCR, but also for document files and OA workstations, but the above problems hindered its widespread use.

従来技術では、たとえば公開特許公報昭和58−20718
（昭和58年12月２日公開）において、入力画像からあら
かじめメモリ内に格納しておいた定形的な画像を除去す
ることによりドロツプアウトカラー以外の文書を使用可
能とする方法が開示されている。しかし、この方法で
は、定形的な画像を除去した残りの画像に対して書式情
報を指定する必要がある点は従来と同様であつた。In the related art, for example, Japanese Patent Laid-Open Publication No. Showa 58-20718
(Published December 2, 1983), a method is disclosed in which a document other than drop-out color can be used by removing a fixed image stored in advance in a memory from an input image. ing. However, this method is similar to the conventional one in that it is necessary to specify the format information for the remaining images from which the standard images have been removed.

[Object of the Invention]

本発明の目的は、入力文書の構造を自動的に理解して文
書の書式を自動的に生成することにより、対象文書の範
囲を大幅に拡大するとともに、書式情報作成の手間を簡
略化する手段を抵抗することにある。An object of the present invention is to automatically understand the structure of an input document and automatically generate a document format, thereby significantly expanding the range of a target document and simplifying the labor for creating format information. To resist.

本発明の他の目的は、定形的な入力文書の種類を自動的
に識別してあらかじめ記憶した書式情報を用いることに
より、種類の異なる定形的な文書の連続読み取りを可能
とする手段を提供することにある。Another object of the present invention is to provide means for enabling continuous reading of standard documents of different types by automatically identifying the type of a standard input document and using prestored format information. Especially.

[Outline of Invention]

かかる目的を達成するために、本発明においては識別対
象文書（帳票）の画像から枠部分ならびに枠部分の間の
関係を抽出し、さらに枠の内部の画像及びその枠の性質
に対する選択情報（以下メニユーと呼ぶ）を表示してそ
の枠の性質を入力させ、枠部分自体の情報を使用者が選
択した枠の性質とから帳票の構造を理解することによ
り、自動的に識別対象文書の書式情報の決定を行うもの
である。In order to achieve such an object, in the present invention, a frame portion and a relationship between the frame portions are extracted from the image of the identification target document (form), and further, an image inside the frame and selection information for the property of the frame (hereinafter referred to as “selection information”). (Referred to as “menu”), the character of the frame is entered, and the information of the frame itself is understood from the property of the frame selected by the user to understand the structure of the form. Is to make a decision.

すなわち、枠にはデータの記入が予定されているデータ
枠と、普通はデータの記入がなされない項目枠があり、
項目枠とデータ枠の区別及び項目枠の属性が分かれば、
帳票の構造を理解できることを利用する。項目枠には通
常項目名称が印刷（または手書き）されているので、こ
の項目名称を表示して人間がその種類・属性を与えるこ
とにより、帳票の構造を自動的に理解させるものであ
る。That is, the frame has a data frame in which data is scheduled to be entered and an item frame in which data is not normally entered,
If the distinction between the item frame and the data frame and the attribute of the item frame are known,
Use the fact that you can understand the structure of the form. Since the item name is usually printed (or handwritten) in the item frame, the item name is displayed and a person gives the type / attribute to automatically understand the structure of the form.

Example of Invention

以下、本発明を図面を用いて詳細に説明する。 Hereinafter, the present invention will be described in detail with reference to the drawings.

第１図は本発明の文書理解方式を実施する装置の構成を
示すブロツク図である。装置の各部はバス１に接続さ
れ、全体の動作は制御部２により制御される。文書３上
に情報（文書画像）は光電変換装置４により走査され、
さらにデイジタル化され、バス１を介してメモリ51に格
納される。メモリ51は後述する52,53,54とともにメモリ
５の一部をなす。デイジタル化の際、公知の高効率符号
化処理を行つてもよく、これにより文書画像を記憶する
メモリの記憶容量を節約できる。また、以下の説明では
１画素１ビツトに二値化するものとするが、１画素を多
値で表現してもよく、カラースキヤナにより光電変換し
て色情報を付与してもよい。FIG. 1 is a block diagram showing the configuration of an apparatus for implementing the document understanding system of the present invention. Each unit of the device is connected to the bus 1, and the entire operation is controlled by the control unit 2. Information (document image) on the document 3 is scanned by the photoelectric conversion device 4,
Further, it is digitized and stored in the memory 51 via the bus 1. The memory 51 forms a part of the memory 5 together with 52, 53 and 54 described later. A well-known high-efficiency encoding process may be performed at the time of digitalization, whereby the storage capacity of the memory for storing the document image can be saved. Further, in the following description, it is assumed that one pixel is binarized into one bit, but one pixel may be represented by multiple values, or color information may be provided by photoelectric conversion by a color scanner.

文書の処理には登録モードと識別モードとがある。モー
ドの選択はキーボード６、デイスプレイ７を用いて制御
部２とマンマシン対話により行う。まず、登録モードの
場合を説明する。登録モードの処理の流れ図を第２図に
示す。この処理は制御部２のソフトウエアにより実行さ
れる。第２図で、201は識別対象文書画像の入力であ
り、この文書上の画像をメモリ51の中に格納する。202
では入力した文書画像に対し位置補正処理、傾き補正処
理などの正規化を行い、その結果得られる画像がメモリ
52に格納される。この正規化画像に対し、203で線抽出
処理が行われ、抽出された線パターンはメモリ53に格納
される。第３図は線パターン抽出の処理内容を説明する
図であり、（Ａ）のような入力画像から水平方向の線抽
出により（Ｂ）のような水平線パターンを得、同様に
（Ａ）から垂直方向の線抽出により（Ｃ）のような垂直
線パターンを得、（Ｂ）と（Ｃ）との論理和により
（Ｄ）のような画像を得る。以後（Ｄ）を線パターンと
呼ぶ。204では線パターンから輪郭抽出により輪郭上の
座標点列を抽出する。この座標点列は輪郭の個数だけ得
られる。以下では輪郭の中で内輪郭だけを処理する。こ
こで内輪郭とは白地を取り囲む黒地の境界であり、白地
を右側に黒地を左側に見るようにして輪郭を一周したと
き時計回りに一周するような輪郭を言う。205ではこの
内輪郭の中から長方形をなすものを抽出する。以後長方
形をなす内輪郭を枠と呼ぶ。206では枠の４隅の座標を
用いて枠の順序を左上から右下へと並べ換える。207で
は抽出した枠が複数個あつたとき、枠の間の相互関係す
なわち枠ｍと枠ｎの間の関係コードCmnを抽出する。関
係コードCmnの定義の例を次に示す。Document processing has a registration mode and an identification mode. Mode selection is performed by man-machine interaction with the control unit 2 using the keyboard 6 and the display 7. First, the case of the registration mode will be described. A flowchart of the registration mode processing is shown in FIG. This processing is executed by the software of the control unit 2. In FIG. 2, 201 is an input of a document image to be identified, and the image on this document is stored in the memory 51. 202
Then, the input document image is normalized such as position correction processing and tilt correction processing, and the resulting image is stored in the memory.
Stored in 52. Line extraction processing is performed at 203 on this normalized image, and the extracted line pattern is stored in the memory 53. FIG. 3 is a diagram for explaining the processing contents of line pattern extraction. A horizontal line pattern as shown in FIG. 3B is obtained by horizontal line extraction from an input image as shown in FIG. A vertical line pattern such as (C) is obtained by extracting the direction lines, and an image such as (D) is obtained by the logical sum of (B) and (C). Hereinafter, (D) is called a line pattern. In 204, the coordinate point sequence on the contour is extracted from the line pattern by contour extraction. This coordinate point sequence is obtained by the number of contours. In the following, only the inner contour is processed among the contours. Here, the inner contour is a boundary of a black background that surrounds a white background, and refers to a contour that goes around clockwise when the white background is viewed on the right and the black background is viewed on the left. In 205, a rectangle is extracted from this inner contour. Hereinafter, the inner contour forming a rectangle is called a frame. In 206, the order of the frames is rearranged from the upper left to the lower right using the coordinates of the four corners of the frame. In 207, when there are a plurality of extracted frames, the mutual relationship between the frames, that is, the relation code Cmn between the frames m and n is extracted. The following is an example of the definition of the relation code Cmn.

Cmn＝ 0:以下の条件が全て不成立＝ 1:mがｎを包含＝−1:nがｍを包含＝ 2:mがｎの直上＝−2:mがｎの直下＝ 3:mがｎの直左＝−3:mがｎの直右＝ 4:mがｎの上方＝−4:mがｎの下方＝ 5:mがｎの左方＝−5:mがｎの右方 208では抽出した枠に対応する領域の画像を原画像から
取り出し、順次この画像をデイスプレイ７に送つて枠内
の画像を表示する。209ではこの枠の性質に関するメニ
ユーをデイスプレイ７に表示する。210ではこのメニユ
ーに対する使用者の応答をキーボードから読み込む。枠
の性質には、種類（項目枠かデータ枠か）の区別及び属
性（空白、人名、地名、年月日、品名、人間関係）が含
まれる。属性のメニユー表示及び応答入力は項目枠につ
いてのみ行う。また枠内が空白であることを自動的に判
定して、空白枠は自動的にデータ枠と決定してもよい。
211では枠ｍの性質Am、枠間の関係コードCmnなどを文書
の構造に関する知識と照合して、入力文書の構造を理解
する。知識は「if〜then〜」形式、すなわち条件と結論
とにより表現される。知識の例を次に示す。但し、a,b,
cは枠番号を示す。Cmn = 0: All of the following conditions are not satisfied = 1: m includes n = -1: n includes m = 2: m is directly above n = -2: m is directly below n = 3: m is n To the left =-3: m is to the right of n = 4: m is above n = -4: m is below n = 5: m is to the left of n = -5: m is to the right of n 208 Then, the image of the area corresponding to the extracted frame is taken out from the original image, and this image is sequentially sent to the display 7 to display the image in the frame. In 209, a menu regarding the nature of this frame is displayed on the display 7. At 210, the user's response to this menu is read from the keyboard. The characteristics of the frame include distinction between types (item frame or data frame) and attributes (blank, person's name, place name, date, item name, human relationship). Attribute menu display and response input are performed only for the item frame. Alternatively, it may be automatically determined that the inside of the frame is blank, and the blank frame may be automatically determined as the data frame.
At 211, the structure Am of the frame m, the relation code Cmn between the frames, and the like are compared with the knowledge about the structure of the document to understand the structure of the input document. Knowledge is expressed in "if-then-" form, that is, conditions and conclusions. An example of knowledge is shown below. However, a, b,
c indicates the frame number.

1. （条件） Aa＝‘人名’＆枠ａは項目枠（結論）枠ａの項目名称＝‘人名’ 2. （条件） Aa＝‘空白’＆ Ab＝‘人名’＆ Cab＝−３（結論）枠ａは枠ｂに従属するデータ枠枠ａの属性＝‘人名’ 3. （条件） Aa＝‘空白’＆ Ab＝‘地名：都道府県’＆ Cab＝−４ Ac＝‘地名’＆ Cac＝−３（結論）枠ａは枠b,cに属するデータ枠枠ａの属性＝‘地名：都道府県’ 4. （条件） Aa＝‘年月日’ （結論）枠ａは独立なデータ枠枠ａの属性＝‘時間：年月日’ 5. （条件） Aa＝‘空白’ （結論）枠ａは独立なデータ枠枠ａの項目名称＝‘雑’ 枠ａの属性＝‘雑’ すなわち、データ枠の属性はそれが属する項目枠によつ
て自動的に決定される。1. (Condition) Aa = 'Person name'& Box a is item frame (Conclusion) Item name of box a = 'Person name' 2. (Condition) Aa = 'Blank'& Ab = 'Person name'& Cab = -3 ( Conclusion) Frame a is a data frame that is subordinate to frame b Attribute of frame a = 'Person name' 3. (Condition) Aa = 'Blank'& Ab = 'Place name: Prefecture'& Cab = -4 Ac = 'Place name'& Cac = -3 (Conclusion) Frame a is a data frame belonging to frames b and c Attribute of frame a = 'Place name: prefecture' 4. (Condition) Aa = 'Date of year' (Conclusion) Frame a is independent data Frame Attribute of frame a = 'Time: date' 5. (Condition) Aa = 'Blank' (Conclusion) Frame a is an independent data frame Item name of frame a = 'Miscellaneous' Attribute of frame a = 'Miscellaneous' That is, the attribute of the data frame is automatically determined by the item frame to which it belongs.

上記の知識の具体的な実現法はプログラムの「if〜then
〜」文としてかかれたサブルーチンであり、知識との照
合はa,bのところにm,nを代入し、条件が成立するか否か
を調べた結果を外部に引数として与えればよい。条件が
成立したときは結論も引数として外部に与える。知識た
とえば1.の条件のa,bにあてはまる枠番号m,nが見出され
たとすれば、結論の枠番号ａのところに枠番号ｍを代入
したものを入力文書の構造データとして登録する。構造
データは上記の枠の関係コード、枠の種類（項目枠、デ
ータ枠あるいは両者）、従属関係、項目名称、属性コー
ドなどからなる。複数の知識に該当する場合には、一例
として出現順序の早い知識を優先させると決めればよ
い。最も優先度の低い知識として他の知識にあてはまら
ないときの救済措置（上記の場合は5.）を決めておけ
ば、すべての枠についての構造が決定される。すべての
枠について知識との照合が行われ構造が決定されると、
構造データの登録が終了する。212でマスクデータを決
定する。マスクデータとは、データ枠の中にありかじめ
印刷された文字（上記の4.における年月日などの単位名
称に相当）の位置を示すものである。マスクデータを決
定するには、たとえば枠内の黒画素の連結領域を抽出し
て十分大きい連結領域の外接四角形を求めればよい。21
3で構造データ及び枠の物理データ（枠の４隅の座標な
ど）、マスクデータを出力装置９に書き出す。構造デー
タ、枠の物理データ、マスクデータは従来のOCRの書式
データに相当する。書き出しに際してキーボード６、デ
イスプレイ７を用いて、入力文書に対する識別番号やフ
アイル名称を入力する。The concrete realization method of the above knowledge is "if ~ then
It is a subroutine written as a "~" statement, and the collation with the knowledge can be performed by substituting m and n for a and b and giving the result of checking whether the condition is satisfied to the outside as an argument. When the condition is satisfied, the conclusion is also given to the outside as an argument. Knowledge If, for example, the frame numbers m and n that meet the conditions a and b of 1. are found, the frame number m is substituted for the frame number a of the conclusion and registered as the structural data of the input document. The structural data includes the frame relation code, frame type (item frame, data frame, or both), subordinate relationship, item name, attribute code, and the like. If the knowledge corresponds to a plurality of pieces of knowledge, it may be determined that the knowledge having the earliest appearance order has priority. By determining the remedy (5. in the above case) when it does not apply to other knowledge as the lowest priority knowledge, the structure for all the frames is determined. Once all the boxes have been matched against the knowledge and the structure has been determined,
Registration of structural data is completed. At 212, the mask data is determined. The mask data indicates the position of a character (corresponding to a unit name such as the date, etc. in 4. above) that is printed in the data frame. To determine the mask data, for example, a connected area of black pixels in the frame may be extracted and a circumscribed rectangle of a sufficiently large connected area may be obtained. twenty one
At 3, the structure data, the physical data of the frame (coordinates of the four corners of the frame, etc.) and the mask data are written to the output device 9. Structural data, frame physical data, and mask data correspond to the conventional OCR format data. At the time of writing, the identification number and file name for the input document are input using the keyboard 6 and the display 7.

以上で登録モードの説明を終了する。この登録モード処
理はあらかじめ行つて構造データ等の書式情報を格納し
ておくものとするが、次に述べる識別モード処理と統合
して行うことも可能である。This is the end of the description of the registration mode. This registration mode processing is performed in advance to store format information such as structural data, but it is also possible to perform it in combination with the identification mode processing described below.

次に識別モードにおける入力対象文書の処理について説
明する。第４図に識別モードの処理の流れ図を示す。40
1は入力文書の画像の取り込み、402は正規化、403は線
パターンの抽出、404は輪郭抽出、405は長方形抽出、40
6は枠順序付け、407は関係コード抽出であり、第２図の
202〜207と同様である。408では見本文書の構造データ
及び物理データを順次メモリ53に読みだし、メモリ54の
入力文書との構造データ及び物理データと比較し、入力
文書がどの見本文書と一致するかを決定する。この見本
文書の構造データ及び物理データを用いて入力文書の読
み取りを以下のようにして行う。409は枠について以下
の410〜414を繰り返し実行するループである。410はデ
ータ枠についてのみ読み取りを行う判定である。411で
はこのデータ枠の内部の領域だけを抽出する。412では
マスクデータによりあらかじめ印刷された文字を消去す
る（マスクデータが存在するときのみ行う）。413では
属性コードにより定まる字種を選択し、認識辞書（文字
認識のための標準パターン）を設定する。414では枠内
部の領域を認識部８に送つて枠内部の文字を認識する。
415では属性コードにより定まる単語辞書を選択し、416
で単語照合を行つて文字認識結果の誤読あるいは不読を
検出あるいは修正する。417では文字認識結果と構造デ
ータ中の必要な部分編集し、出力装置９に書き出す。編
集の例として、たとえばデータ枠の属する項目枠の属性
が「人名」であり「前記知識の2.に相当）、この枠の認
識結果が‘山田太郎’であつたとすれば、 ‘人名’＝‘山田太郎’ のようにし、これを１レコードとして書き出す。また、
独立な項目枠・データ枠で項目名称＝‘時間：年月
日’、単位名称＝‘年月日’であり（知識4.に相当）単
位名称の部分がマスクデータにより消去されたため、認
識結果が58529だつたとすれば、編集出力は ‘日時’＝‘58年５月29日’ とする。後者の例では認識部８から文字の位置座標を取
り出し、マスクデータと座標の比較により位置を決定し
て編集する。Next, processing of the input target document in the identification mode will be described. FIG. 4 shows a flow chart of processing in the identification mode. 40
1 is the input document image capture, 402 is normalization, 403 is line pattern extraction, 404 is contour extraction, 405 is rectangle extraction, 40
6 is frame ordering, 407 is relation code extraction, and
Same as 202-207. In 408, the structural data and physical data of the sample document are sequentially read into the memory 53 and compared with the structural data and physical data of the input document in the memory 54 to determine which sample document the input document matches. The input document is read using the structural data and physical data of the sample document as follows. 409 is a loop for repeatedly executing the following 410 to 414 for the frame. 410 is a judgment to read only the data frame. At 411, only the area inside this data frame is extracted. In 412, the characters printed in advance by the mask data are deleted (only when the mask data exists). In 413, a character type determined by the attribute code is selected and a recognition dictionary (standard pattern for character recognition) is set. At 414, the area inside the frame is sent to the recognition unit 8 to recognize the character inside the frame.
At 415, select the word dictionary determined by the attribute code,
The word matching is performed to detect or correct misreading or nonreading of the character recognition result. At 417, a necessary part of the character recognition result and the structure data is edited and written to the output device 9. As an example of editing, if the attribute of the item frame to which the data frame belongs is “personal name” and “corresponds to 2. of the above knowledge” and the recognition result of this frame is “Taro Yamada”, then “personal name” = Write it out as one record like "Taro Yamada".
Independent item / data frame with item name = 'Time: year / month / day', unit name = 'year / month / day' (corresponding to knowledge 4.) If the output is 58529, the edited output will be "Date" = "May 29, 58". In the latter example, the position coordinate of the character is taken out from the recognition unit 8, and the position is determined by the comparison with the mask data and the coordinate to edit.

以上本発明の一実施例について説明した。本実施例に対
して通常の文字認識装置や図形処理装置で行われる処
理、たとえば枠の抽出結果の表示・修正、文字認識結果
の表示・修正などを付加することは容易である。また、
枠認識に際して線抽出や長方形決定の閾値を複数個設
け、自動判定の結果が艱難なときはマンマシン的に最適
値を選択したり、二値化レベルを変えて再試行するなど
の変更を加えてもよい。さらに、識別モードにおいて文
書の種類があらかじめ決まつている場合はキーボードか
ら見本文書の番号を与えるようにして、構造データなど
の照合を省略してもよい。The embodiment of the present invention has been described above. It is easy to add processing performed by a normal character recognition device or a graphic processing device to this embodiment, for example, display / correction of a frame extraction result, display / correction of a character recognition result, or the like. Also,
Multiple thresholds for line extraction and rectangle determination are provided for frame recognition, and when the result of automatic determination is difficult, the optimum value is selected in a man-machine manner or the binarization level is changed and retried is added. May be. Further, when the type of document is determined in advance in the identification mode, the number of the sample document may be given from the keyboard so that the collation of structural data and the like may be omitted.

なお、本実施例では文書から線パターンを抽出し、それ
から枠を抽出するとしたが原文書パターンから直接輪郭
を抽出し、それから枠を抽出するようにすることも可能
であり、この場合には枠以外の輪郭を後で他処理たとえ
ば文字切り出しや文字認識に利用できる。Although the line pattern is extracted from the document and the frame is extracted from the document in the present embodiment, it is also possible to directly extract the contour from the original document pattern and then extract the frame. In this case, the frame is extracted. Contours other than can be used later for other processing such as character cutting and character recognition.

また、本実施例では文書構造の記述として枠形式のもの
に限定したが、実線あるいは破線などの罫線や、円など
を用いるように拡張することも可能である。たとえば実
線からなる罫線は、細長い外輪郭を抽出することで検出
でき、罫線の上方の領域を切り出すことによつて下線を
引いた表題文字の認識が可能である。円の検出は輪郭座
標系列を極座標表示し、この空間で直線検出すればよ
い。Further, in the present embodiment, the description of the document structure is limited to the frame format, but it is also possible to extend it to use ruled lines such as a solid line or a broken line and circles. For example, a ruled line composed of a solid line can be detected by extracting a slender outer contour, and an underlined title character can be recognized by cutting out an area above the ruled line. The circle can be detected by displaying the outline coordinate series in polar coordinates and detecting a straight line in this space.

また、枠の外部の文字も読み取つて、構造記述に利用す
るこもできる。色情報を利用し構造記述に色属性を付加
することも有効である。Also, the characters outside the frame can be read and used for the structure description. It is also effective to add color attributes to the structure description using color information.

さらに、文書から直接光電変換する場合に限定せず、画
像フアイルから読みだした文書画像に対しても適用でき
る。Further, the present invention is not limited to the case where the photoelectric conversion is directly performed on the document, and the invention can be applied to the document image read from the image file.

〔The invention's effect〕

以上説明したごとく、本発明によれば簡単な構成で入力
文書の構造を自動的に理解して文書の書式を自動的に生
成することが可能で、従来のように人間による書式情報
の作成作業が不要であり、黒色で印刷された表形式の文
書が読み取れるのでドロツプアウトカラーで印刷し直す
必要がなく、準備作業の手間を簡略化できるとともに、
対象文書の範囲を大幅に拡大することができる。As described above, according to the present invention, it is possible to automatically understand the structure of the input document with a simple structure and automatically generate the document format. As in the conventional art, the work of creating the format information by humans is performed. Is unnecessary, and the tabular document printed in black can be read, so there is no need to reprint in dropout color, and the preparation work can be simplified.
The range of target documents can be greatly expanded.

また本発明によれば、定形的な入力文書の種類を自動的
に識別して予め記憶した書式情報を用いることができ、
異なる種類の定形的な文書の連続読み取りが可能とな
る。Further, according to the present invention, it is possible to automatically identify the type of a standard input document and use prestored format information,
It enables continuous reading of different types of standard documents.

[Brief description of drawings]

第１図は本発明の文書処理方式を実施する装置の構成を
示すブロツク図、第2,4図は第１図の制御部における処
理を説明するための流れ図、第３図は第２図における処
理内容を説明する図である。１……バス、２……制御部、３……文書、５……メモ
リ、６……キーボード、７……デイスプレイ、８……文
字認識部、９……出力装置。FIG. 1 is a block diagram showing the configuration of an apparatus for implementing the document processing system of the present invention, FIGS. 2 and 4 are flow charts for explaining the processing in the control unit of FIG. 1, and FIG. It is a figure explaining a processing content. 1 ... bus, 2 ... control unit, 3 ... document, 5 ... memory, 6 ... keyboard, 7 ... display, 8 ... character recognition unit, 9 ... output device.

───────────────────────────────────────────────────── フロントページの続き (72)発明者道野正雄神奈川県小田原市国府津2880番地株式会社日立製作所小田原工場内 (56)参考文献特開昭61−59568（ＪＰ，Ａ) 特開昭61−54569（ＪＰ，Ａ) 特開昭56−137480（ＪＰ，Ａ) ７ｔｈ．ＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＰａｔｔｅｒｎＲｅｃｏｇｎｉｔｉｏｎ，ＰｒｏｃｅｅｄｉｎｇｓＶｏｌ．１、Ｐ．347− 349，Ｊｕｌｙ30−Ａｕｇｕｓｔ２ 1984, “ＨＩＥＲＡＲＣＨＩＣＡＬＲＥＰＲＥＳＥＮＴＡＴＩＯＮＯＦＯＰＴＩＣＡＬＬＹＳＣＡＮＮＥＤＤＯＣＵＭＥＮＴＳ" ─────────────────────────────────────────────────── ─── Continuation of front page (72) Masao Dono Inventor Masao Dono 2880, Kozu, Odawara City, Kanagawa Prefecture Odawara Plant, Hitachi Ltd. (56) Reference JP 61-59568 (JP, A) JP 61 -54569 (JP, A) JP-A-56-137480 (JP, A) 7th. International Conference on Pattern Recognition, Proceedings Vol. 1, P.I. 347-349, July 30-August2 1984, "HIERARCHICAL REPRE SENTATION OF OPTICA LLY SCANNED DOCUMEN TS"

Claims

[Claims]

1. A means for holding inference information for determining the table structure of a tabular document, a means for inputting a document image, a means for extracting a frame in the document image, and a means for mutually extracting the extracted frames. Means for detecting positional relationship information of the frame, means for extracting an image of an area corresponding to the frame from the document image, means for displaying an image in a frame included in the area corresponding to the extracted frame, A means for displaying a plurality of attribute information for a frame as a selection item and selecting one attribute information from the plurality of displayed attribute information, the attribute information selected from the selection item, and the positional relationship information between the frames. And a means for applying the above inference information to determine table structure data of a tabular document, and a means for registering the table structure data.