JP4533422B2

JP4533422B2 - Image processing apparatus and method, and storage medium

Info

Publication number: JP4533422B2
Application number: JP2007305073A
Authority: JP
Inventors: 和之齋藤
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2007-11-26
Filing date: 2007-11-26
Publication date: 2010-09-01
Anticipated expiration: 2019-05-14
Also published as: JP2008109692A

Description

本発明は画像処理装置及び方法及び記憶媒体、より詳しくは文書画像を読み取り処理する画像処理装置及び方法及び記憶媒体に関するものである。 The present invention relates to an image processing apparatus and method and a storage medium, is rather more details to an image processing apparatus and method and a storage medium reading processing the document image.

従来、文書画像処理に関する技術としては、文書を光学的に入力し、全体の画像を印刷して出力する複写機や、文書を光学的に入力し、その文書画像を保存する文書データベースシステム、文書を光学的に入力し、ネットワーク或いは回線を介してその文書画像を出力するファクシミリ装置、文書を光学的に入力し、文字を認識してテキストコードを出力するＯＣＲ等の技術が存在する。 Conventionally, as a technique related to document image processing, a copying machine that optically inputs a document, prints and outputs the entire image, a document database system that optically inputs a document and stores the document image, and a document There are techniques such as a facsimile apparatus that optically inputs a document and outputs the document image via a network or a line, and an OCR that optically inputs a document, recognizes characters, and outputs a text code.

しかしながら、これまでの技術ではデジタル化やネットワーク化に適応できなくなっている。つまり、入力機器と出力機器がネットワークで接続されるようになったことや、文書のカラー化等により、
１）入力文書画像をそのまま保存、配信したのでは、データ量が大きくなる。
２）文書画像を一律に圧縮したのでは再利用に適した画質を保つことができない。
３）白黒プリンターやカラープリンター等の出力機器の違いによっては出力画質が劣化する。
４）文字認識処理（ＯＣＲ）を用いてテキストのみを配信すると図や写真等の情報の欠落がある。
５）ＯＣＲによって誤認識があった場合には、意味の欠落がある。 However, conventional technologies cannot adapt to digitization and networking. In other words, due to the fact that input devices and output devices are now connected via a network, and colorization of documents,
1) If the input document image is stored and distributed as it is, the amount of data increases.
2) If the document image is uniformly compressed, the image quality suitable for reuse cannot be maintained.
3) The output image quality deteriorates depending on the output device such as a monochrome printer or a color printer.
4) When only text is distributed using character recognition processing (OCR), there is a lack of information such as figures and photographs.
5) When there is a misrecognition by OCR, there is a lack of meaning.

という問題がクローズアップされるようになった。 The problem has been closed up.

第１の発明はかかる問題点に鑑みなされたものであり、原稿のレイアウトを保持しながら、データ量を少なくでき、しかも、再生出力する際に画質劣化することを抑えることを可能ならしめる画像処理装置及び画像再生装置及びシステム及び方法及び記憶媒体を提供しようとするものである。 The first invention has been made in view of such problems, and image processing that makes it possible to reduce the amount of data while maintaining the document layout and to suppress deterioration in image quality during reproduction and output. An apparatus, an image reproducing apparatus, a system, a method, and a storage medium are provided.

また、第２の発明は、上記第１の発明に加えてセキュリティーに優れた画像処理装置及び画像再生装置及びシステム及び方法及び記憶媒体を提供しようとするものである。 The second invention is intended to provide an image processing apparatus, an image reproducing apparatus, a system, a method, and a storage medium excellent in security in addition to the first invention.

そして、第３の発明は、上記第１の発明に加えて、画像中のテキストについて自然言語の違いを吸収できる画像処理装置及び画像再生装置及びシステム及び方法及び記憶媒体を提供しようとするものである。 In addition to the first invention, the third invention is to provide an image processing apparatus, an image reproducing apparatus, a system, a method, and a storage medium that can absorb a difference in natural language for text in an image. is there.

上記課題を解決するため、例えば第１，第２の発明の課題を解決する画像処理装置は以下の構成を備える。すなわち、
文書の多値画像データを入力する入力手段と、
前記入力した多値画像データから二値画像データを生成する二値画像生成手段と、
前記生成した二値画像データを属性毎の領域に分割し、当該分割した各領域のレイアウト情報を生成するレイアウト解析手段と、
前記レイアウト情報に基づいて、テキスト属性の領域について前記二値画像データから対応する領域の部分画像を切り取る第１部分画像切取手段と、
前記レイアウト情報に基づいて、非テキスト属性の領域について前記多値画像データから対応する領域の部分画像を切り取る第２部分画像切取手段と、
オペレータの指示に基づいて、暗号化対象とする属性を設定する暗号化対象属性設定手段と、
前記第１部分画像切取手段あるいは前記第２部分画像切取手段で切り取られる部分画像のうち、前記暗号化対象属性設定手段で暗号化対象として設定された属性の領域に対応する部分画像を暗号化して暗号化データを生成する暗号化手段と、
前記暗号化対象の領域に対応する前記暗号化手段で生成された暗号化データと、前記暗号化対象属性設定手段で暗号化対象として設定されなかった属性の領域に対応する前記第１部分画像切取手段あるいは前記第２部分画像切取手段で切り取られた部分画像と、前記レイアウト情報とを保存する保存手段とを備える。 In order to solve the above problems, for example, an image processing apparatus that solves the problems of the first and second inventions has the following configuration. That is,
Input means for inputting multivalued image data of a document;
Binary image generation means for generating binary image data from the input multi-value image data;
A layout analysis unit that divides the generated binary image data into regions for each attribute, and generates layout information for each of the divided regions;
First partial image cutting means for cutting a partial image of a corresponding area from the binary image data for a text attribute area based on the layout information;
Based on the layout information, second partial image cutting means for cutting a partial image of a corresponding area from the multi-valued image data for a non-text attribute area;
Encryption target attribute setting means for setting an attribute to be encrypted based on an instruction from the operator;
Of the partial images cut by the first partial image cutting unit or the second partial image cutting unit, a partial image corresponding to the attribute area set as the encryption target by the encryption target attribute setting unit is encrypted. An encryption means for generating encrypted data;
The first partial image cutout corresponding to the encrypted data generated by the encryption means corresponding to the encryption target area and the attribute area not set as the encryption target by the encryption target attribute setting means Or a saving means for saving the partial image cut by the second partial image cutting means and the layout information.

本発明によれば、原稿のレイアウトを保持しながら、データ量を少なくでき、しかも、再生出力する際に画質劣化することを抑えることが可能になる。更に、セキュリティーに優れたものとすることも可能になる。 According to the present invention, it is possible to reduce the amount of data while maintaining the original layout, and to suppress deterioration in image quality when reproducing and outputting. Furthermore, it is possible to make it excellent in security.

以下、添付図面に従って本発明に係る実施形態を詳細に説明する。 Hereinafter, embodiments according to the present invention will be described in detail with reference to the accompanying drawings.

［第１の実施形態］
図１は第１の実施形態における構成概念とデータの流れを示す図である。 [First Embodiment]
FIG. 1 is a diagram illustrating a configuration concept and a data flow in the first embodiment.

同図において、１０１は入力される対象となる印刷物やコンピュータ上で扱う画像データ等を含む入力文書、１０２は文書を画像データとして入力する画像入力部（例えばイメージスキャナ）、１０３は画像入力部から入力された元画像データ、１０４は入力文書画像から二値画像を作成する二値化処理部、１０５は作成された二値画像データ、１０６は二値画像１０５をもとにして文書を、図、テキスト、表等の各種属性毎の領域に分割した情報、すなわちレイアウト解析データを出力するレイアウト解析部、１０７はレイアウト解析データである。 In the figure, reference numeral 101 denotes an input document including a printed matter to be input and image data handled on a computer, 102 denotes an image input unit (for example, an image scanner) for inputting the document as image data, and 103 denotes from the image input unit. The input original image data, 104 is a binarization processing unit for generating a binary image from the input document image, 105 is the generated binary image data, 106 is a document based on the binary image 105, FIG. The layout analysis unit 107 outputs information divided into areas for each attribute such as text and table, that is, layout analysis data, and 107 is layout analysis data.

また、１０８はレイアウト解析データをもとにして、二値画像の任意の領域の文字を認識しコード情報等を文字認識データとして出力する文字認識部、１０９は文字認識データ、１１０はレイアウト解析データをもとにして、入力文書元画像及び二値画像から適応的に保存条件を変更しながら画像データおよび文字認識データを文書画像理解データとして保存する文書画像理解データ保存部、１１１は文書画像理解データを出力する文書画像理解データ出力部、１１２は文書画像理解データ、１１３は文書画像理解データを入力する文書画像理解データ入力部、１１４は文書画像理解データから適応的に再生条件を変更しながら再生文書画像を作成する文書画像理解再生部、１１５は再生文書画像、１１６は再生文書画像を出力する画像出力部、１１７は出力文書である。 A character recognition unit 108 recognizes characters in an arbitrary area of the binary image based on the layout analysis data and outputs code information or the like as character recognition data, 109 a character recognition data, and 110 a layout analysis data. A document image comprehension data storage unit 111 that stores image data and character recognition data as document image comprehension data while adaptively changing the storage conditions based on the input document original image and binary image, 111 is a document image comprehension Document image comprehension data output unit for outputting data, 112 is document image comprehension data, 113 is a document image comprehension data input unit for inputting document image comprehension data, and 114 is while adaptively changing the reproduction conditions from the document image comprehension data Document image understanding / reproducing unit for creating a reproduced document image, 115 a reproduced document image, and 116 an image output for outputting the reproduced document image , 117 is an output document.

また、図２は第１の実施形態を表わすシステム装置構成の例を示している。 FIG. 2 shows an example of a system apparatus configuration representing the first embodiment.

同図において、２０１は文書画像理解処理（例えば図１における文書画像理解データ出力部１１１までの処理）を行うコンピュータ、２０２はデータを保存する記憶装置、２０３は画像入力を専用的に行うスキャナ、２０４はカラープリンタ、２０５はファクシミリ、２０６はカラー画像の入力及びプリント出力を複合的に行うデジタルカラー複写機（カラースキャナとカラープリンタとしての機能を有する）、２０７はプリンタを管理するプリンタサーバ、２０８はモノクロプリンタ、２０９はデータベースを管理するファイルサーバ、２１０はデータべ−ス、２１１はモノクロ画像の入力及びプリント出力を複合的に行うデジタル複写機、２１２は各デジタル機器間を接続するネットワークである。 In the figure, 201 is a computer that performs document image understanding processing (for example, processing up to the document image understanding data output unit 111 in FIG. 1), 202 is a storage device that stores data, 203 is a scanner dedicated to image input, Reference numeral 204 denotes a color printer; 205, a facsimile; 206, a digital color copier (having functions as a color scanner and a color printer) that inputs and outputs color images in a composite manner; 207, a printer server that manages the printer; Is a monochrome printer, 209 is a file server that manages a database, 210 is a database, 211 is a digital copier that inputs and outputs monochrome images in combination, and 212 is a network that connects the digital devices. .

次に処理の流れについて図３乃至図１０のフローチャート、及び図１、図２、図１４乃至図１７に従って説明する。 Next, the processing flow will be described with reference to the flowcharts of FIGS. 3 to 10 and FIGS. 1, 2, and 14 to 17.

本実施形態では、原稿画像を２４ビット（ＲＧＢそれぞれ８ビット）多値画像データとしてスキャナ（２０３や２０６）より入力し、ネットワークで配信して、配信先でモノクロプリンター（２０８や２１１あるいは２０５）に出力する場合の画像理解処理システムについて説明する。 In this embodiment, a document image is input as 24-bit (RGB each 8 bits) multi-value image data from a scanner (203 or 206), distributed over a network, and sent to a monochrome printer (208, 211, or 205) at a distribution destination. An image understanding processing system for output will be described.

まず、コンピュータ２０１は、図３のステップＳ３０１において、画像入力部１０２（図１におけるスキャナ２０３）より文書画像を多値画像として入力する。 First, in step S301 in FIG. 3, the computer 201 inputs a document image as a multi-value image from the image input unit 102 (scanner 203 in FIG. 1).

次に、ステップＳ３０２で、二値化処理部１０４により二値画像を作成する。二値画像の作成方法は、読み取った画像データＲＧＢ（各８ビット）から輝度成分を算出し、その算出した輝度を所定の閾値と比較することで行うものとする。 In step S302, the binarization processing unit 104 creates a binary image. The binary image creation method is performed by calculating a luminance component from the read image data RGB (8 bits each) and comparing the calculated luminance with a predetermined threshold value.

ステップＳ３０３では、こうして作成された二値画像をもとに、レイアウト解析部１０６により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報（図１５）を出力する。レイアウト情報は、図１５（ａ）に示す如く、１ページの原稿画像に対し、複数の矩形領域（部分画像）が設定され、部分画像の個数のデータが格納される。各部分画像は、識別情報（レイアウト領域ＩＤ）、切り取り幅、高さ、画像タイプ、圧縮方式、画像サイズ、そして切り取り画像データで構成されている。画像タイプには、同図（ｂ）の如く、二値画像か２４ビット多値画像かを識別情報、圧縮方式は同図（ｃ）の如くＭＭＲ，ＪＰＥＧ等がある。なお、画像タイプ、圧縮方式の種類及び数はこれ以外であっても良いのは勿論である。 In step S303, based on the binary image thus created, the layout analysis unit 106 divides the document image into regions for various attributes such as diagrams, texts, and tables, and outputs layout information (FIG. 15). In the layout information, as shown in FIG. 15A, a plurality of rectangular areas (partial images) are set for one page of document image, and data of the number of partial images is stored. Each partial image includes identification information (layout area ID), cutout width, height, image type, compression method, image size, and cutout image data. The image type includes identification information as to whether it is a binary image or a 24-bit multi-valued image as shown in FIG. 7B, and the compression method includes MMR, JPEG, etc. as shown in FIG. Of course, other types and numbers of image types and compression methods may be used.

次にステップＳ３０４においてテキストや表等の文字を含む領域の文字画像に対し文字認識（文字認識）処理を行い、ステップＳ３０５で、文書画像理解データ保存部（１０８）においてレイアウト解析データをもとにして、文字認識データならびに、入力文書元画像である多値画像及び二値画像から適応的に保存条件を変更しながら文書画像理解データを作成し保存する。 In step S304, character recognition (character recognition) processing is performed on a character image in an area including characters such as text and a table. In step S305, the document image understanding data storage unit (108) uses the layout analysis data as a basis. Then, the document image understanding data is created and stored while adaptively changing the storage conditions from the character recognition data and the multi-value image and binary image that are the input document original images.

ここで、具体的な文書画像理解データ保存処理の例を図５、図６、図７，図８のフローチャートに沿って説明する。 Here, a specific example of document image understanding data storage processing will be described with reference to the flowcharts of FIGS. 5, 6, 7, and 8.

本例では、文書画像理解データ保存処理（Ｓ３０５）に入ると、まずステップＳ５０１で各分割領域のレイアウト解析データを分析して各領域の領域属性ならびに文書レイアウトタイプを分類する。 In this example, when the document image understanding data storage process (S305) is entered, layout analysis data of each divided area is first analyzed in step S501 to classify area attributes and document layout types of each area.

文書タイプの分類の例については図６に一例を示す。 An example of document type classification is shown in FIG.

まず、ステップＳ６０１において、文書全体の面積（ＡＳ１）を算出する。 First, in step S601, the area (AS1) of the entire document is calculated.

次にステップＳ６０２において、領域属性が『テキスト』および『表』の領域面積の総和（ＤＳ１）を算出し、ステップＳ６０３において、総和面積（ＤＳ１）が文書全体の面積（ＡＳ１）に対して、あらかじめ定めた割合（ＴＳ１［％］）より大である。すなわち
ＤＳ１＞ＡＳ１×ＴＳ１／１００
であれば、その文書は「テキスト系」レイアウト文書と判断する（ステップＳ６０４）。 Next, in step S602, the total area area (DS1) of the area attributes “text” and “table” is calculated. In step S603, the total area (DS1) is preliminarily calculated with respect to the entire document area (AS1). It is larger than the set ratio (TS1 [%]). That is, DS1> AS1 × TS1 / 100
If so, the document is determined to be a “text-based” layout document (step S604).

また、ステップＳ６０３において、テキスト系と判断されない場合に、領域属性『線図形』および『絵、写真』の領域面積の総和（ＰＳ１）を算出し（ステップＳ６０５）、ステップＳ６０６において、先に算出した『テキスト』および『表』の領域面積の総和（ＤＳ１）が領域属性『線図形』および『絵、写真』の総和面積（ＰＳ１）よりも大の場合、
すなわち
ＤＳ１＞ＰＳ１
であれば、その文書は「テキスト系」レイアウト文書と判断する。（ステップＳ６０４）
上記以外の場合であれば、ステップＳ６０７において、その文書は「ピクチャー系」レイアウト文書と判断する。 In step S603, if the text type is not determined, the total area area (PS1) of the area attributes “line figure” and “picture, photograph” is calculated (step S605), and is previously calculated in step S606. When the total area (DS1) of “text” and “table” is larger than the total area (PS1) of area attributes “line figure” and “picture, photograph”,
That is, DS1> PS1
If so, the document is determined to be a “text-based” layout document. (Step S604)
Otherwise, in step S607, the document is determined to be a “picture system” layout document.

図５に戻って、上記のようにして文書レイアウトタイプが決定されると、ステップＳ５０２に進み、それぞれのレイアウトタイプによって分岐する。文書レイアウトタイプが「ピクチャー系」である場合は、ステップＳ５０３において、ピクチャー系文書画像理解データ保存処理を行う。 Returning to FIG. 5, when the document layout type is determined as described above, the process proceeds to step S502, and the process branches according to each layout type. If the document layout type is “picture type”, a picture type document image understanding data storage process is performed in step S503.

図７を用いて、このピクチャー系文書画像理解データ保存処理について更に詳細に説明する。 The picture-based document image understanding data storage process will be described in more detail with reference to FIG.

ピクチャー系文書画像理解データ保存処理においてはまず、ステップＳ７０１で各分割領域のレイアウト解析データを再度分析して、次に、ステップＳ７０２において領域属性が『テキスト』であれば、ステップＳ７０３で切り取り先全体画像として二値画像を選択し、ステップＳ７０５でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 In the picture-based document image comprehension data saving process, first, the layout analysis data of each divided area is analyzed again in step S701. Next, if the area attribute is “text” in step S702, the entire cut-out destination is determined in step S703. A binary image is selected as an image, and a partial image is cut out using the coordinate data of the area of the layout analysis data in step S705.

また、ステップＳ７０２において、領域属性が『テキスト』でなかった場合は、ステップＳ７０４において切り取り先全体画像として元画像である多値画像を選択し、ステップＳ７０５でレイアウト解析データの領域の座標データを用いてその部分画像である多値画像を切り取る。 If the region attribute is not “text” in step S702, a multi-valued image that is the original image is selected as the entire cut-out image in step S704, and the coordinate data of the region of the layout analysis data is used in step S705. The multi-valued image that is the partial image is cut out.

次にステップＳ７０６において切り取った部分画像の画像タイプを調べ、二値画像タイプであれば、ステップＳ７０７において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ７０９において文書画像理解データ（図１４）として保存処理を行う。 Next, the image type of the partial image cut out in step S706 is checked. If it is a binary image type, a binary image compression process is performed in step S707 (for example, MMR), and the compressed partial image is converted into a document in step S709. Storage processing is performed as image understanding data (FIG. 14).

このとき、文書画像理解データに圧縮方式も記録しておく事により、文書画像理解再生処理において圧縮画像を復号可能とする。 At this time, by recording the compression method in the document image understanding data, the compressed image can be decoded in the document image understanding reproduction process.

また、ステップＳ７０６において調べた結果が、多値画像タイプであったならば、ステップＳ７０８において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ７０９において文書画像理解データ（図１４）として保存処理を行う。 If the result checked in step S706 is a multi-value image type, multi-value image compression processing is performed in step S708 (for example, JPEG), and the compressed partial image is converted into document image understanding data in step S709. The storage process is performed as shown in FIG.

さらに詳細な本例の文書画像理解データの例は、図１４、図１５および図１６に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像情報として保存する。 As shown in FIG. 14, FIG. 15 and FIG. 16, the example of the detailed document image comprehension data of this example uses the appearance order in the layout analysis data of the target layout area data as the layout area data ID. The cut width, the cut height, the image type, the compression method, the image size, and the compressed cut image are stored as partial image information.

次にステップＳ７１０において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ７０１に戻り、残りの分割領域について上記の処理を繰り返し、全ての分割領域に対して行うと、文書画像理解データ保存処理を終了する。 Next, in step S710, it is checked whether or not there is a remaining divided area. If there is, the process returns to step S701, and the above processing is repeated for the remaining divided areas. The document image understanding data storage process is terminated.

図５に戻る。同図のステップＳ５０２において文書レイアウトタイプが「テキスト系」である場合は、処理はステップＳ５０４に進み、テキスト系文書画像理解データ保存処理を行う。 Returning to FIG. If the document layout type is “text type” in step S502 in the figure, the process proceeds to step S504, and text type document image understanding data storage processing is performed.

この文書タイプがテキスト系の場合の文書画像理解保存処理を図８のフローチャート、及び図１３，１５に従って説明する。 Document image understanding storage processing when the document type is text-based will be described with reference to the flowchart of FIG. 8 and FIGS.

本例では、文書画像理解保存処理（図３のステップＳ３０５）に入ったら、まずステップＳ８０１で各分割領域のレイアウト解析データを再度分析して各領域の領域属性を求める。 In this example, when the document image understanding storage process (step S305 in FIG. 3) is entered, first, in step S801, the layout analysis data of each divided area is analyzed again to obtain the area attribute of each area.

次に、ステップＳ８０２において領域属性が『テキスト』であれば、ステップＳ８０３でその領域の文字認識データを選択し、ステップＳ８０４でその文字認識データを文書画像理解データとして（図１４）として保存する。 Next, if the area attribute is “text” in step S802, the character recognition data of the area is selected in step S803, and the character recognition data is stored as document image understanding data (FIG. 14) in step S804.

さらに詳細な本例の文書画像理解データの例は、図１７に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、行数、行間隔、第一位から第ｎ位までの文字コード、第一位から第ｎ位までの文字認識距離、文字フォントタイプ、文字サイズ、文字位置の座標、不確実フラグ、等を文字認識データとして保存する。 As shown in FIG. 17, the more detailed example of the document image comprehension data of this example uses the appearance order in the layout analysis data of the target layout area data as the ID of the layout area data, and the number of lines and the line spacing. , Character codes from 1st to nth, character recognition distance from 1st to nth, character font type, character size, character position coordinates, uncertainty flag, etc. are stored as character recognition data .

また、ステップＳ８０２で領域属性が『テキスト』でなかった場合は、ステップＳ８０５において切り取り先全体画像として元画像を選択し、ステップＳ８０６でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 If the area attribute is not “text” in step S802, the original image is selected as the entire cut-out image in step S805, and the partial image is cut using the coordinate data of the area of the layout analysis data in step S806.

ステップＳ８０７において、本例の場合、元画像は多値画像であると判定されるので、ステップＳ８０８において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ８０９において文書画像理解データ（図１４）として保存処理を行う。 In step S807, in this example, it is determined that the original image is a multi-valued image. Therefore, in step S808, compression processing for a multi-valued image is performed (for example, JPEG), and the compressed partial image is converted into a document in step S809. Storage processing is performed as image understanding data (FIG. 14).

さらに詳細な本例の文書画像理解データの例は、図１６に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像情報として保存する。 As shown in FIG. 16, the more detailed example of the document image understanding data of this example uses the appearance rank in the layout analysis data of the target layout area data as the layout area data ID, and the cut width and cut height. The image type, compression method, image size, and compressed cut image are stored as partial image information.

次にステップＳ８１０において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ８０１に戻り、残りの分割領域について上記の処理を繰り返す。残りの分割領域が存在していなければ、文書画像理解保存処理を終了する。 Next, in step S810, it is checked whether or not there is a remaining divided area. If there is, the process returns to step S801, and the above processing is repeated for the remaining divided areas. If there is no remaining divided area, the document image understanding storage process is terminated.

以上のようにして、文書画像に対するレイアウトタイプ毎の理解データが保存されると、処理は図５のステップＳ５０５に進み、保存された文書画像理解データを出力する。 As described above, when the understanding data for each layout type for the document image is stored, the process proceeds to step S505 in FIG. 5 and the stored document image understanding data is output.

さらに、本実施形態では文書画像理解データ保存部（１１０）において作成・保存された文書画像理解データは文書画像理解データ出力部（１１１）によりネットワーク上に出力され、ネットワークを介して利用先に配信される。 Furthermore, in this embodiment, the document image understanding data created and stored in the document image understanding data storage unit (110) is output to the network by the document image understanding data output unit (111), and is distributed to the usage destination via the network. Is done.

利用先（クライアントでもある）では、図４に示すように、ステップＳ４０１において文書画像理解データ入力部（１１３）により文書画像理解データを入力し、ステップＳ４０２において文書画像理解データから適応的に再生条件を変更しながら再生文書画像を作成する。 As shown in FIG. 4, the user (also a client) inputs document image understanding data by the document image understanding data input unit (113) in step S401, and adaptively reproduces the reproduction conditions from the document image understanding data in step S402. Create a replay document image while changing.

この具体的な文書画像理解データ再生処理の例を図９に従って説明する。 A specific example of the document image understanding data reproduction process will be described with reference to FIG.

まず、ステップＳ９０１において再生文書画像の下地となる画像を白い下地となるように作成する。 First, in step S901, an image that is a background of a reproduced document image is created to be a white background.

次に、ステップＳ９０２において文書画像理解データを分析し、ステップＳ９０３で、部分領域属性の検査を行い、属性が『テキスト』であるならば、ステップＳ９０４において文字認識データが存在するか否かを検査する。 Next, in step S902, the document image understanding data is analyzed. In step S903, the partial region attribute is inspected. If the attribute is “text”, it is inspected in step S904 whether character recognition data exists. To do.

文字認識データが存在するならば、ステップＳ９０５において文書画像理解データから文字認識データ（文字コードを含む）を取り出し、ステップＳ９０６で、取り出した文字認識データから文字フォントを白い下地画像と合成し再生画像を作成する。 If character recognition data exists, character recognition data (including a character code) is extracted from the document image comprehension data in step S905. In step S906, a character font is combined with the white base image from the extracted character recognition data. Create

また、ステップＳ９０３の検査で部分領域の属性が非『テキスト』であるか、ステップＳ９０４の検査で文字認識データが存在しないならば、ステップＳ９０７において文書画像理解データから部分領域の画像データを取り出し、ステップＳ９０８において、取り出した部分画像とその座標データから白い下地画像と合成し再生画像を作成する。 If the attribute of the partial area is non- “text” in the inspection in step S903 or no character recognition data exists in the inspection in step S904, the image data of the partial area is extracted from the document image understanding data in step S907. In step S908, a reproduced image is created by combining the extracted partial image and its coordinate data with a white background image.

再生画像合成処理（ステップＳ９０８）の一例としては、図１０に示すように、ステップＳ１００１において文書画像理解データから部分画像の画像タイプを抽出し、ステップＳ１００２において、画像タイプが『二値画像タイプ』であれば、ステップＳ１００３において、二値画像の黒、白画素をそれぞれ２４ビット多値画像の黒、白画素に変換する疑似２４ビット化の処理を行う。 As an example of the reproduction image composition processing (step S908), as shown in FIG. 10, the image type of the partial image is extracted from the document image understanding data in step S1001, and the image type is “binary image type” in step S1002. If so, in step S1003, pseudo 24-bit conversion processing is performed to convert black and white pixels of the binary image into black and white pixels of a 24-bit multi-valued image, respectively.

本例では、二値画像の黒画素は１、白画素は０、２４ビット多値画像の黒画素は、Ｒ＝０，Ｇ＝０，Ｂ＝０であり、白画素はＲ＝２５５，Ｇ＝２５５，Ｂ＝２５５（ＲはＲｅｄ成分、ＧはＧｒｅｅｎ成分、ＢはＢｌｕｅ成分で各８ビットの値を持つ。）としている。 In this example, the black pixel of the binary image is 1, the white pixel is 0, the black pixel of the 24-bit multi-valued image is R = 0, G = 0, B = 0, and the white pixel is R = 255, G = 255, B = 255 (R is a Red component, G is a Green component, and B is a Blue component each having an 8-bit value).

またステップＳ１００２において、画像タイプが『２４ビット多値画像タイプ』であればそのままの部分画像を合成に用いる。 In step S1002, if the image type is “24-bit multi-value image type”, the partial image as it is is used for synthesis.

つぎに、ステップＳ１００４において下地画像と部分画像に対し部分画像の各画素の論理演算を行い合成画像を作成する。 Next, in step S1004, a logical operation is performed on each pixel of the partial image with respect to the base image and the partial image to create a composite image.

本例では、下地画像の白画素（Ｒ＝２５５，Ｇ＝２５５，Ｂ＝２５５）と部分画像の黒画素（Ｒ＝０，Ｇ＝０，Ｂ＝０）の合成画像が黒画素（Ｒ＝０，Ｇ＝０，Ｂ＝０）となるように論理演算を用いている。 In this example, a composite image of white pixels (R = 255, G = 255, B = 255) of the background image and black pixels (R = 0, G = 0, B = 0) of the partial image is a black pixel (R = 255). (0, G = 0, B = 0).

以上のような処理によって１つの部分画像に対して再生合成処理（Ｓ９０８）を終了したら、ステップＳ９０９において残りの部分領域が存在しているか否かを調べ、存在していればステップＳ９０２に戻り、残りの部分領域について上記の処理を繰り返す。残りの部分領域が存在していなければ、文書画像理解データ再生処理を終了する。 When the reproduction synthesis processing (S908) is completed for one partial image by the above processing, it is checked in step S909 whether or not there is a remaining partial area, and if it exists, the process returns to step S902. The above process is repeated for the remaining partial areas. If there is no remaining partial area, the document image understanding data reproduction process is terminated.

図４に戻って、上記のようにして再生処理が行われると、ステップＳ４０３において画像出力部（１１６）であるモノクロプリンターから再生画像を出力文書として出力する。 Returning to FIG. 4, when the reproduction process is performed as described above, the reproduction image is output as an output document from the monochrome printer which is the image output unit (116) in step S403.

［第２の実施形態］
文書画像理解データ保存処理（ステップＳ３０５）において、レイアウト解析結果で領域属性がテキストとなった文書でも、画像の状態が文字認識処理に不適切な状態（ノイズ存在や、低解像度画像等）である場合に、不確実な文字認識データではなく確実な画像データを保存することも可能である。 [Second Embodiment]
In the document image understanding data storage process (step S305), even in a document whose area attribute is text as a result of the layout analysis, the image state is inadequate for the character recognition process (the presence of noise, a low resolution image, etc.). In some cases, it is possible to store reliable image data instead of uncertain character recognition data.

以下に具体的な文書画像理解データ保存処理の他の例を図１１のフローチャートにしたがって説明する。 Another example of specific document image understanding data storage processing will be described below with reference to the flowchart of FIG.

本例では、文書画像理解データ保存処理（ステップＳ３０５）に入ったら、まずステップＳ１１０１で各分割領域のレイアウト解析データを分析して各領域の領域属性ならびに文書レイアウトタイプを分類する（図６）。 In this example, when the document image understanding data storage process (step S305) is entered, the layout analysis data of each divided area is first analyzed in step S1101 to classify the area attributes and document layout type of each area (FIG. 6).

ここで、ステップＳ１１０２においてピクチャー系文書レイアウトと判定された場合は、ステップＳ１１０６においてピクチャー系文書レイアウトとして文書画像理解データ保存処理（図７）を実行する。 If it is determined in step S1102 that the picture-type document layout is determined, a document image understanding data storage process (FIG. 7) is executed as a picture-type document layout in step S1106.

また、ステップＳ１１０２においてテキスト系文書レイアウトと判定された場合は、ステップＳ１１０３において全体文字認識信頼度（ＺＮｒ）を算出する。 If it is determined in step S1102 that the text-based document layout is determined, the overall character recognition reliability (ZNr) is calculated in step S1103.

全体文字認識信頼度の一例としては、各文字ごとの認識演算結果である認識距離値（Ｄ）が、あらかじめ定めた閾値（Ｔｄ）未満である場合、
すなわち
Ｄ＜Ｔｄ
である場合にその文字に対して不確実な文字であるとして文字認識データの不確実フラグを有効（＝“１”）にし、その不確実フラグが有効でない（＝“０”）文字数の和（ｎ）を求め、全体の文字数（Ｎ）に対する割合（ｎ／Ｎ）を全体文字認識信頼度（ＺＮｒ）としている。
すなわち
ＺＮｒ＝ｎ／Ｎ
として求める。 As an example of the overall character recognition reliability, when the recognition distance value (D) that is the recognition calculation result for each character is less than a predetermined threshold (Td),
That is, D <Td
If it is, the uncertain flag of the character recognition data is made valid (= “1”) and the uncertain flag is not valid (= “0”). n) is obtained, and the ratio (n / N) to the total number of characters (N) is defined as the total character recognition reliability (ZNr).
That is, ZNr = n / N
Asking.

次にステップＳ１１０３において、全体文字認識信頼度（ＺＮｒ）を検査し、全体文字認識信頼度があらかじめ定めた閾値（Ｔｒ）より大、
すなわち
ＺＮｒ＞Ｔｒ
であればＯＫとし、ステップＳ１１０５においてテキスト系文書レイアウトとして文書画像理解データ保存処理（図８）を実行する。 Next, in step S1103, the overall character recognition reliability (ZNr) is inspected, and the overall character recognition reliability is greater than a predetermined threshold (Tr).
That is, ZNr> Tr
If YES, the document image understanding data storage process (FIG. 8) is executed as a text document layout in step S1105.

また、
ＺＮｒ≦Ｔｒ
であればＮＧとし、ステップＳ１１０６においてピクチャー系文書レイアウトとして文書画像理解データ保存処理（図７）を実行する。 Also,
ZNr ≦ Tr
If YES, the document image understanding data storage process (FIG. 7) is executed as a picture document layout in step S1106.

以上の結果、原稿文書の品位が悪い、或いはそこに印刷されている文字品位が悪い場合には、誤認識される率が高くなるので、認識結果を保存せず、ピクチャー系文書とすることが可能になる。従って、クライアント側で再生する場合にはオリジナルの原稿画像に忠実な再生が可能になる。 As a result, if the quality of the original document is poor or the character quality printed on it is poor, the rate of misrecognition increases, so that the recognition result is not saved and the picture document can be made. It becomes possible. Accordingly, when reproducing on the client side, it is possible to reproduce the original document image faithfully.

［第３の実施形態］
文書画像理解データ保存処理（ステップＳ３０５）において、レイアウト解析結果で領域属性がテキストとなった文書でも、部分領域毎に画像の状態が文字認識処理に不適切な状態（ノイズ存在や、低解像度画像等）であるか否かを判断し、不適切な画像と判断した場合に、不確実な文字認識データだけではなく確実な画像データをも併せて保存することも可能である。 [Third Embodiment]
In the document image understanding data storage process (step S305), even in a document whose area attribute is text in the layout analysis result, the state of the image for each partial area is inappropriate for the character recognition process (the presence of noise or low resolution images). If it is determined that the image is inappropriate, it is possible to store not only unreliable character recognition data but also reliable image data.

以下に具体的な文書画像理解データ保存処理の例を図１２のフローチャートにしたがって説明する。 A specific example of document image understanding data storage processing will be described with reference to the flowchart of FIG.

本例では、文書画像理解データ保存処理（ステップＳ３０５）に入ったら、まず図５のように各分割領域のレイアウト解析データを分析して各領域の領域属性ならびに文書レイアウトタイプを分類する。 In this example, when the document image understanding data storage process (step S305) is entered, the layout analysis data of each divided area is first analyzed as shown in FIG. 5 to classify the area attributes and document layout type of each area.

つぎにテキスト系文書レイアウトと判断された場合に、ステップＳ１２０１においてレイアウト解析データを再度分析し、ステップＳ１２０２で部分領域属性の検査を行い、属性が『テキスト』であるならば、ステップＳ１２０３で文字認識データを選択し、ステップＳ１２０４で文字認識データ保存処理を行った上、さらにステップＳ１２０５で部分領域の文字認識信頼度（ＺＭｒ）を算出する。 Next, when it is determined that the layout is a text-based document layout, the layout analysis data is analyzed again in step S1201, and the partial region attribute is checked in step S1202. If the attribute is “text”, character recognition is performed in step S1203. Data is selected, character recognition data storage processing is performed in step S1204, and character recognition reliability (ZMr) of the partial area is calculated in step S1205.

部分領域文字認識信頼度の一例としては、各文字ごとの認識演算結果である認識距離値（Ｄ）が、あらかじめ定めた閾値（Ｔｄ）未満である場合、
すなわち
Ｄ＜Ｔｄ
である場合にその文字に対して不確実な文字であるとして文字認識データの不確実フラグを有効（１）にし、その部分領域内における、不確実フラグが有効でない（０）文字数の和（ｍ）を求め、その領域内の文字数（Ｍ）に対する割合（ｍ／Ｍ）を部分領域文字認識信頼度（ＺＭｒ）として用いる。
すなわち
ＺＭｒ＝ｍ／Ｍ
として求める。 As an example of the partial region character recognition reliability, when the recognition distance value (D) that is the recognition calculation result for each character is less than a predetermined threshold (Td),
That is, D <Td
In this case, the uncertain flag of the character recognition data is made valid (1) and the uncertain flag is not valid in the partial area (0). ) And the ratio (m / M) to the number of characters (M) in the region is used as the partial region character recognition reliability (ZMr).
That is, ZMr = m / M
Asking.

次にステップＳ１２０６において、部分領域文字認識信頼度（ＺＭｒ）を検査し、部分領域文字認識信頼度があらかじめ定めた閾値（Ｔｒ）より大、
すなわち
ＺＭｒ＞Ｔｒ
であればＯＫとし、画像データの保存処理は実行せずに、ステップＳ１２１４において残りの部分領域が存在しているか否かを調べ、存在していればステップＳ１２０１に戻り、残りの部分領域について上記の処理を繰り返す。 Next, in step S1206, the partial area character recognition reliability (ZMr) is inspected, and the partial area character recognition reliability is greater than a predetermined threshold (Tr).
That is, ZMr> Tr
If not, the image data storage process is not executed, and it is checked in step S1214 whether or not there is a remaining partial area. If there is, the process returns to step S1201, and the remaining partial areas are described above. Repeat the process.

残りの部分領域が存在していなければ、文書画像理解データ保存処理を終了する。
また、
ＺＭｒ≦Ｔｒ
であればＮＧとし、ステップＳ１２０７以降における画像データ保存処理を実行する。 If there is no remaining partial area, the document image understanding data storage process is terminated.
Also,
ZMr ≦ Tr
If not, it is determined as NG, and the image data storage processing in step S1207 and subsequent steps is executed.

［第４の実施形態］
文書画像理解再生処理時に文字認識データの信頼度に応じて文字フォントを合成するか、領域文字画像を合成するかを適応的に選択し処理することも可能である。 [Fourth Embodiment]
It is also possible to adaptively select whether to synthesize a character font or to synthesize a region character image in accordance with the reliability of the character recognition data during the document image understanding reproduction process.

以下に処理の流れについて図４，図１３のフローチャートに従って説明する。 The processing flow will be described below with reference to the flowcharts of FIGS.

本例では部分領域毎に文字認識信頼度を求め文字フォントで出力するか領域画像で出力するかを選択する。また、文書画像領域データとして文字認識データおよび画像データの両方を持っているものとする。 In this example, the character recognition reliability is obtained for each partial region, and whether to output in character font or region image is selected. Assume that the document image area data includes both character recognition data and image data.

本例ではまず、文書画像理解データの再生利用先において、ステップＳ４０１において文書画像理解データ入力部（１１３）により文書画像理解データを入力し、ステップＳ４０２において文書画像理解データから適応的に再生条件を変更しながら再生文書画像を作成する。 In this example, first, at the reproduction / use destination of the document image understanding data, the document image understanding data is input by the document image understanding data input unit (113) in step S401, and the reproduction condition is adaptively determined from the document image understanding data in step S402. Create a replay document image while changing.

この文書画像理解データ再生処理について図１３のフローチャートで説明する。 The document image understanding data reproduction process will be described with reference to the flowchart of FIG.

ステップＳ１３０１において再生文書画像の下地となる画像を白い下地となるように作成する。 In step S1301, an image serving as a background of the reproduced document image is created so as to be a white background.

次に、ステップＳ１３０２において文書画像理解データを分析し、ステップＳ１３０３で、部分領域の属性が『テキスト』であるならば、ステップＳ１３０４で、文字認識データの検査を行い、文字認識データがあるのであれば、ステップＳ１３０５で文書画像理解データから文字認識データを取り出し、ステップＳ１３０６において文字認識信頼度を調べる。 Next, the document image understanding data is analyzed in step S1302. If the attribute of the partial area is “text” in step S1303, the character recognition data is inspected in step S1304, and there is character recognition data. For example, character recognition data is extracted from the document image understanding data in step S1305, and the character recognition reliability is checked in step S1306.

本例における文字認識信頼度の調べ方の一例としては、部分領域の文字認識データの不確実フラグの割合を用いるものとする。 As an example of how to check the character recognition reliability in this example, the ratio of the uncertainty flag of the character recognition data in the partial area is used.

具体的には、その部分領域内における、不確実フラグが有効でない（０）文字数の和（ｍ）を求め、その領域内の文字数（Ｍ）に対する割合（ｍ／Ｍ）を部分領域文字認識信頼度（ＺＭｒ）としている。
すなわち
ＺＭｒ＝ｍ／Ｍ
として求める。 Specifically, the sum (m) of the number of characters (0) for which the uncertainty flag is not valid in the partial area is obtained, and the ratio (m / M) to the number of characters (M) in the area is determined as the partial area character recognition reliability. Degree (ZMr).
That is, ZMr = m / M
Asking.

次にステップＳ１３０７において、部分領域文字認識信頼度（ＺＭｒ）を検査し、部分領域文字認識信頼度があらかじめ定めた閾値（Ｔｒ２）より大、
すなわち
ＺＭｒ＞Ｔｒ２
である場合、ステップＳ１３０８で、取り出した文字認識データから文字フォントを白い下地画像と合成し再生画像を作成する。 Next, in step S1307, the partial area character recognition reliability (ZMr) is checked, and the partial area character recognition reliability is greater than a predetermined threshold (Tr2).
That is, ZMr> Tr2
In step S1308, a character font is combined with a white background image from the extracted character recognition data to create a reproduction image.

もし、文字認識信頼度があらかじめ定めた閾値以下であるならば文書画像理解データから部分画像データを取り出し、ステップＳ１３０９において、取り出した部分画像とその座標データから白い下地画像と合成し再生画像を作成する。 If the character recognition reliability is equal to or lower than a predetermined threshold, partial image data is extracted from the document image understanding data, and a reproduced image is generated by combining the extracted partial image and its coordinate data with a white background image in step S1309. To do.

また、ステップＳ１３０３で、部分領域の属性が非『テキスト』である場合も文書画像理解データから部分画像データを取り出し、ステップＳ１３０４において、取り出した部分画像とその座標データから白い下地画像と合成し再生画像を作成する。 In step S1303, partial image data is extracted from the document image understanding data even if the attribute of the partial area is non- "text". In step S1304, the extracted partial image and its coordinate data are combined with a white background image and reproduced. Create an image.

以上のような処理によって１つの部分画像に対して再生合成処理（ステップＳ１３０９）を終了したら、ステップＳ１３１０において残りの部分領域が存在しているか否かを調べ、存在していればステップＳ１３０２に戻り、残りの部分領域について上記の処理を繰り返す。 When the reproduction synthesis process (step S1309) is completed for one partial image by the above processing, it is checked in step S1310 whether or not there is a remaining partial area, and if it exists, the process returns to step S1302. The above processing is repeated for the remaining partial areas.

残りの部分領域が存在していなければ、文書画像理解データ再生処理を終了する。 If there is no remaining partial area, the document image understanding data reproduction process is terminated.

そして、ステップＳ４０３において画像出力部（ステップ１１４）から、合成した１ページの再生画像を出力文書として出力する。 In step S403, the synthesized output image of one page is output as an output document from the image output unit (step 114).

以上説明したように第１乃至第４の実施形態によれば、
１）文書画像の保存時における、データ量の削減、
２）文書画像の配信時における、ネットワークトラフィックへの負荷軽減、
３）文書画像の保存、配信時における、再利用に適した高画質の維持、
４）出力時の文書画像の画質劣化や情報欠落の防止、
５）ＤＴＰ（デスクトップパブリッシング）等の電子文書への再利用の容易化、
等がはかれる効果がある。 As described above, according to the first to fourth embodiments,
1) Reducing the amount of data when saving document images
2) Reducing the load on network traffic when delivering document images,
3) Maintaining high image quality suitable for reuse during storage and distribution of document images,
4) Prevention of document image quality degradation and information loss during output,
5) Facilitating reuse of electronic documents such as DTP (desktop publishing)
This has the effect of peeling off.

［第５の実施形態］
上記実施形態に加えて、本第５の実施形態では、セキュリティーを重視する例を説明する。 [Fifth Embodiment]
In addition to the above embodiment, in the fifth embodiment, an example in which security is emphasized will be described.

図１８は第５の実施形態における構成概念とデータの流れを示す図である。 FIG. 18 is a diagram showing a configuration concept and a data flow in the fifth embodiment.

同図において、２１０１は入力される対象となる印刷物やコンピュータ上で扱う画像データ等を含む入力文書、２１０２は文書を画像として入力する画像入力部、２１０３は画像入力部から入力された元画像データ、２１０４は入力文書画像から二値画像を作成する二値画像出力部、２１０５は作成された二値画像、２１０６は二値画像をもとにして文書を図やテキストや表等の各種属性毎の領域に分割した情報すなわちレイアウト解析データを出力するレイアウト解析部、２１０７はレイアウト解析データ、２１０８はレイアウト解析データをもとにして、入力文書元画像及び二値画像から適応的に保存条件を変更しながら画像データ及び暗号化データを文書画像理解データとして保存する文書画像理解データ保存部、２１０９は文書画像理解データを出力する文書画像理解デーグ出力部、２１１０は文書画像理解データ、２１１１は文書画像理解データを入力する文書画像理解データ入力部、２１１２は文書画像理解データから適応的に再生条件を変更しながら再生文書画像を作成する文書画像理解再生部、２１１３は再生文書画像、２１１４は再生文書画像を出力する画像出力部、２１１５は出力文書、２１１６は本第５の実施形態における特徴である、任意の領域の文字認識データならびに画像データを暗号化する暗号化処理部である。 In the figure, reference numeral 2101 denotes an input document including a printed matter to be input and image data handled on a computer, 2102 denotes an image input unit for inputting the document as an image, and 2103 denotes original image data input from the image input unit. Reference numeral 2104 denotes a binary image output unit that creates a binary image from the input document image, 2105 denotes the created binary image, and 2106 denotes a document based on the binary image for each attribute such as a figure, text, or table. Layout analysis unit that outputs information divided into areas, that is, layout analysis data, 2107 is layout analysis data, 2108 is based on layout analysis data, and adaptively changes storage conditions from the input document original image and binary image The document image understanding data storage unit 2109 stores the image data and the encrypted data as document image understanding data. Document image comprehension tag output unit for outputting solution data, 2110 is document image comprehension data, 2111 is a document image comprehension data input unit for inputting document image comprehension data, and 2112 is adapted to change reproduction conditions adaptively from the document image comprehension data. While, a document image comprehension / reproducing unit that creates a reproduced document image, 2113 is a reproduced document image, 2114 is an image output unit that outputs a reproduced document image, 2115 is an output document, and 2116 is a feature of the fifth embodiment. This is an encryption processing unit for encrypting character recognition data and image data in the area.

上記のデータの流れ及び処理が行われるシステムは、図２と同様である。 The system in which the above data flow and processing are performed is the same as in FIG.

次に処理の流れについて図１９、図２０、図２１のフローチャート、及び図１８、図２、図１４、図１５、図２７、図２８、図２９に従って説明する。 Next, the flow of processing will be described with reference to the flowcharts of FIGS. 19, 20, and 21, and FIGS. 18, 2, 14, 15, 27, 28, and 29.

本実施形態でも、２４ビット多値画像をスキャナ（２０３や２０６）より入力し、『テキスト』属性の領域に対して自動的に暗号化して保存し、ネットワークで配信して、配信先でモノクロプリンター（２０８や２１１あるいは２０５）に出力する場合の画像理解処理システムについて示す。 Also in this embodiment, a 24-bit multi-valued image is input from a scanner (203 or 206), is automatically encrypted and stored in an area of “text” attribute, is distributed over a network, and is a monochrome printer at a distribution destination. An image understanding processing system when outputting to (208, 211, or 205) will be described.

まず、ステップＳ２３０１において、画像入力部（２１０２）であるスキャナより文書画像を多値画像データとして入力する。ステップＳ２３０２では、二値画像出力部（２１０４）により二値画像を作成し、それをもとにステップＳ２３０３において、レイアウト解析部（２１０６）により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報を出力する。このレイアウト情報は先に説明した第１の実施形態における図１５と同様である。 First, in step S2301, a document image is input as multi-value image data from a scanner which is an image input unit (2102). In step S2302, a binary image is generated by the binary image output unit (2104), and in step S2303, the layout analysis unit (2106) generates a document image for each attribute such as a figure, text, or table. The layout information is output by dividing the area. This layout information is the same as that in FIG. 15 in the first embodiment described above.

次に、ステップＳ２３０４で文書画像理解保存部（２１０８）においてレイアウト解析データをもとにして、入力文書元画像である多値画像及び二値画像から適応的に保存条件を変更しながら文書画像理解データを作成し保存する。その際に、あらかじめ定められた特定領域のデータの暗号化処理も行う。 Next, in step S2304, the document image understanding storage unit (2108) understands the document image while adaptively changing the storage conditions from the multi-value image and the binary image that are the input document original images based on the layout analysis data. Create and save data. At that time, encryption processing of data in a predetermined specific area is also performed.

本実施形態では、『テキスト』領域の画像データを暗号化するものとする。 In the present embodiment, it is assumed that image data in the “text” area is encrypted.

文書画像理解保存処理の例を図２０のフローチャート、及び図１４、図１５、図２７に従って説明する。 An example of document image understanding storage processing will be described with reference to the flowchart of FIG. 20 and FIGS.

本例では、文書画像理解保存処理（ステップＳ２３０４）に入ったら、まずステップＳ２５０１で各分割領域のレイアウト解析データを分析して各領域の領域属性を分類する。 In this example, when the document image understanding storage process (step S2304) is entered, first, in step S2501, the layout analysis data of each divided area is analyzed to classify the area attributes of each area.

次に、ステップＳ２５０２において領域属性が『テキスト』であれば、ステップＳ２５０３で切り取り先全体画像として二値画像を選択し、ステップＳ２５０４でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 Next, if the area attribute is “text” in step S2502, a binary image is selected as the cut-out whole image in step S2503, and a partial image is cut out using the coordinate data of the area of the layout analysis data in step S2504.

次にステップＳ２５０５で、その部分画像に対して暗号化処理を行い、暗号化データを作成し、ステップＳ５０６で、暗号化データを文書画像理解データとして保存する。 In step S2505, the partial image is encrypted to create encrypted data. In step S506, the encrypted data is stored as document image understanding data.

さらに詳細な本例の文書画像理解データの例は図２７に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび暗号化した切り取り画像を部分画像暗号化データとして保存する。この時暗号化フラグを有効（＝１）にすることでデータが暗号化されていることを示すようにする。 In a more detailed example of the document image understanding data of this example, as shown in FIG. 27, the appearance rank in the layout analysis data of the target layout area data is used as the ID of the layout area data, and the cut width and the cut height. The image type, the compression method, the image size, and the encrypted cut image are stored as the partial image encryption data. At this time, the encryption flag is enabled (= 1) to indicate that the data is encrypted.

また、ステップＳ２５０２において、領域属性が非『テキスト』であるならば、ステップＳ２５０７で切り取り先全体画像として元画像を選択し、ステップＳ２５０８でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 In step S2502, if the area attribute is non- "text", an original image is selected as the entire cut-out image in step S2507, and a partial image is cut using the coordinate data of the area of the layout analysis data in step S2508. .

次にステップＳ２５０９において切り取った部分画像の画像タイプを調べ、二値画像タイプであれば、ステップＳ２５１０において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ２５１１において文書画像理解データ（図２７）として保存処理を行う。 Next, the image type of the partial image cut out in step S2509 is checked, and if it is a binary image type, a binary image compression process is performed in step S2510 (for example, MMR), and the compressed partial image is converted into a document in step S2511. Storage processing is performed as image understanding data (FIG. 27).

この時文書画像理解データに圧縮方式も記録しておく事により、文書画像理解再生処理において圧縮画像を復号可能とする。 At this time, by recording the compression method in the document image understanding data, the compressed image can be decoded in the document image understanding reproduction process.

また、ステップＳ２５０９において調べた結果が、多値画像タイプであったならば、ステップＳ２５１２において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ２５１１において文書画像理解データ（図２７）として保存処理を行う。 If the result checked in step S2509 is a multi-valued image type, multi-valued image compression processing is performed in step S2512 (for example, JPEG), and the compressed partial image is converted into document image understanding data in step S2511. The storage process is performed as shown in FIG.

本例の文書画像理解データの例は図２７に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像データとして保存する。 In the example of the document image understanding data of this example, as shown in FIG. 27, the appearance order in the layout analysis data of the target layout area data is taken as the ID of the layout area data, the cut width, the cut height, the image type The compression method, the image size, and the compressed cut image are stored as partial image data.

次にステップＳ２５１３において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ２５０１に戻り、残りの分割領域について上記の処理を繰り返す。 Next, in step S2513, it is checked whether or not there is a remaining divided area. If it exists, the process returns to step S2501, and the above processing is repeated for the remaining divided areas.

残りの分割領域が存在していなければ、文書画像理解データ保存処理を終了する。そして図１９のステップＳ２３０５に進んで、文書画像理解データを出力する。 If there is no remaining divided area, the document image understanding data storage process is terminated. Then, the process proceeds to step S2305 in FIG. 19 to output document image understanding data.

ここで、本実施形態では文書画像理解保存部（２１０８）において作成・保存された文書画像理解データは文書画像理解データ出力部（２１０９）によりネットワーク上に出力され、ネットワークを介して利用先に配信される。 Here, in this embodiment, the document image understanding data created and stored in the document image understanding storage unit (2108) is output to the network by the document image understanding data output unit (2109) and distributed to the users via the network. Is done.

利用先では、先の第１の実施形態で説明したように、図４のフローチャートに従って処理することになる。つまり、ステップＳ４０１において文書画像理解データ入力部（２１１１）により文書画像理解データを入力し、ステップＳ４０２において文書画像理解データから適応的に再生条件を変更しながら再生文書画像を作成する。 At the user, processing is performed according to the flowchart of FIG. 4 as described in the first embodiment. That is, in step S401, the document image understanding data input unit (2111) inputs the document image understanding data, and in step S402, a reproduction document image is created while changing the reproduction condition adaptively from the document image understanding data.

本第５の実施形態における具体的な文書画像理解再生処理（ステップＳ４０２）の例を図２１のフローチャートに従って説明する。 An example of specific document image understanding reproduction processing (step S402) in the fifth embodiment will be described with reference to the flowchart of FIG.

本例では、まず、ステップＳ２６０１において再生文書画像の下地となる画像を白い下地となるように作成する。次に、ステップＳ２６０２において文書画像理解データを分析し、ステップＳ２６０３で、部分領域の暗号化データにおける暗号化フラグが有効（＝１）であるならば、ステップＳ２６０４で、文書画像理解データから暗号化データを取り出し、暗号化解除をおこない、部分画像に戻す。 In this example, first, in step S2601, an image that is a background of a reproduced document image is created so as to be a white background. Next, the document image understanding data is analyzed in step S2602. If the encryption flag in the partial area encrypted data is valid (= 1) in step S2603, the document image understanding data is encrypted in step S2604. Retrieve data, decrypt, and return to partial image.

また、暗号化フラグが有効になっていないのであるならば、ステップＳ２６０５で文書画像理解データから画像データを取り出す。そして、ステップＳ２６０６において、取り出した部分画像とその座標データから白い下地画像と合成し再生画像を作成する。 If the encryption flag is not valid, the image data is extracted from the document image understanding data in step S2605. In step S2606, a reproduced image is created by combining the extracted partial image and its coordinate data with a white background image.

再生合成処理（ステップＳ２６０６）の一例としては、第１の実施形態で説明した図１０のフローチャートに従うものである。 An example of the reproduction synthesis process (step S2606) follows the flowchart of FIG. 10 described in the first embodiment.

つまり、ステップＳ１００１において文書画像理解データから部分画像の画像タイプを抽出し、ステップ１００２において、画像タイプが『二値画像タイプ』であれば、ステップＳ１００３において、二値画像の黒、白画素をそれぞれ２４ビット多値画像の黒、白画素に変換する疑似２４ビット化の処理を行う。 That is, in step S1001, the image type of the partial image is extracted from the document image understanding data. If the image type is “binary image type” in step 1002, the black and white pixels of the binary image are respectively determined in step S1003. A pseudo 24-bit conversion process is performed to convert the 24-bit multi-valued image into black and white pixels.

本例では、二値画像の黒画素は１、白画素は０、２４ビット多値画像の黒画素は、Ｒ＝０、Ｇ＝０、Ｂ＝０であり、白画素はＲ＝２５５、Ｇ＝２５５、Ｂ＝２５５（ＲはＲｅｄ成分、ＧはＧｒｅｅｎ成分、ＢはＢｌｕｅ成分で各８ビットの値を持つ。）としている。 In this example, the black pixel of the binary image is 1, the white pixel is 0, the black pixel of the 24-bit multi-valued image is R = 0, G = 0, and B = 0, and the white pixel is R = 255, G = 255, B = 255 (R is a Red component, G is a Green component, and B is a Blue component each having an 8-bit value).

またステップＳ１００２において、画像タイプが『２４ビット多値画像タイプ』であればそのままの部分画像を合成に用いる。つぎに、ステップＳ１００４において下地画像と部分画像に対し部分画像の各画素の論理演算を行い合成画像を作成する。 In step S1002, if the image type is “24-bit multi-value image type”, the partial image as it is is used for synthesis. Next, in step S1004, a logical operation is performed on each pixel of the partial image with respect to the base image and the partial image to create a composite image.

本例では、下地画像の白画素（Ｒ＝２５５，Ｇ＝２５５、Ｂ＝２５５）と部分画像の黒画索（Ｒ＝０，Ｇ＝０，Ｂ＝０）の合成画像が黒画素（Ｒ＝０，Ｇ＝０，Ｂ＝０）となるように論理演算を用いている。 In this example, the composite image of the white pixels (R = 255, G = 255, B = 255) of the background image and the black image (R = 0, G = 0, B = 0) of the partial image is a black pixel (R = 0, G = 0, B = 0).

以上のような処理によって１つの部分画像に対して再生合成処理（Ｓ２６０６）を終了したら、ステップＳ２６０７において残りの部分画像が存在しているか否かを調べ、存在していればステップＳ２６０２に戻り、残りの部分領域について上記の処理を繰り返す。残りの部分領域が存在していなければ、文書画像理解データ再生処理を終了する。 When the reproduction / combination processing (S2606) is completed for one partial image by the above processing, it is checked in step S2607 whether the remaining partial images exist. If there is, the process returns to step S2602. The above process is repeated for the remaining partial areas. If there is no remaining partial area, the document image understanding data reproduction process is terminated.

そして、図４のステップＳ４０３において画像出力部（２１１４）であるモノクロプリンターから再生画像を出力文書として出力する。 In step S403 in FIG. 4, the reproduction image is output as an output document from the monochrome printer as the image output unit (2114).

［第６の実施形態］
上記第５の実施形態では、テキスト領域を暗号化させるものであったが、オペレータが選択した特定属性のみを暗号化させることも可能である。この例を第６の実施形態として説明する。 [Sixth Embodiment]
In the fifth embodiment, the text area is encrypted, but it is also possible to encrypt only the specific attribute selected by the operator. This example will be described as a sixth embodiment.

具体例を図１９を再度用い、更に図２２、図２３を用いて説明する。なお、図２３は図１８の変形例でもある。 A specific example will be described with reference to FIG. 19 and with reference to FIGS. FIG. 23 is also a modification of FIG.

先ず、ステップＳ２３０１で画像入力部（２８０２）であるスキャナより文書画像を多値画像として入力する。 First, in step S2301, a document image is input as a multivalued image from a scanner which is an image input unit (2802).

次に、オペレータ指示部（２８１６）により、暗号化する領域属性を指示する。例えば、入力した画像を表示させ、その中の『表』属性の領域に対して暗号化するように指示する。 Next, the operator instruction section (2816) instructs the area attribute to be encrypted. For example, the input image is displayed, and an instruction is given to encrypt the “table” attribute area in the image.

次いで、ステップＳ３０２で、二値画像出力部（２８０４）により二値画像を作成し、それをもとにステップＳ２３０３において、レイアウト解析部（２８０６）により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報（図１５）を出力する。 Next, in step S302, a binary image is generated by the binary image output unit (2804), and in step S2303, the layout analysis unit (2806) converts the document image into various forms such as figures, texts, and tables. The layout information is divided into areas for each attribute and layout information (FIG. 15) is output.

次に、ステップＳ２３０５で文書画像理解保存部（２８０８）においてレイアウト解析データならびにオペレータの指示をもとにして、入力文書元画像である多値画像及び二値画像から適応的に保存条件を変更しながら文書画像理解データを作成し保存する。 In step S2305, the document image understanding storage unit (2808) adaptively changes the storage conditions from the multi-value image and the binary image that are the input document original images based on the layout analysis data and the operator's instruction. While creating and saving document image understanding data.

さらに具体的な文書画像理解保存処理の例を図２３のフローチャート、及び図１４、図１５、図２７に従って説明する。 A more specific example of document image understanding storage processing will be described with reference to the flowchart of FIG. 23 and FIGS.

本例では、文書画像理解保存処理（ステップＳ２３０５）に入ったら、まずステップＳ２９０１でオペレータが設定した属性をセットする。図３０は、この時の表示画面の例を示しており、ウインドウ右上の暗号化指定のコンボボックスを指定し、その中で適当な領域属性について暗号化を指定することになる。ここでは、ペレータは『テキスト』を指示したものとする。設定するのはポインティングデバイス等を領域内を指定することで行えばよい。 In this example, after entering the document image understanding storage process (step S2305), first, the attribute set by the operator is set in step S2901. FIG. 30 shows an example of the display screen at this time. An encryption designation combo box at the upper right of the window is designated, and encryption is designated for an appropriate area attribute. Here, it is assumed that the operator indicates “text”. Setting may be performed by designating a pointing device or the like in the area.

次にステップＳ２９０２で、各分割領域のレイアウト解析データを分析して各領域の領域属性を分類する。そして、ステップＳ２９０３において領域属性が暗号化対象として指定されたものであれば、つまり、上記の例の場合には『テキスト』であれば、ステップＳ２９０４で切り取り先全体画像として二値画像を選択し、ステップＳ２９０５でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 In step S2902, the layout analysis data of each divided area is analyzed to classify the area attributes of each area. If the region attribute is designated as the encryption target in step S2903, that is, in the case of the above example, “text”, a binary image is selected as the entire cut-out image in step S2904. In step S2905, the partial image is cut out using the coordinate data of the layout analysis data area.

さらに、ステップＳ２９０６で、その部分画像にたいして暗号化処理を行い、暗号化データを作成し、ステップＳ２９０７で、暗号化データを文書画像理解データとして保存する。 In step S2906, the partial image is encrypted to create encrypted data. In step S2907, the encrypted data is stored as document image understanding data.

この時暗号化フラグを有効（＝１）にすることでデータが暗号化されていることを示している。 At this time, the encryption flag is enabled (= 1) to indicate that the data is encrypted.

また、ステップＳ２９０３において、領域属性が非暗号化領域を示す場合には、ステップＳ２９０８で切り取り先全体画像として元画像を選択し、ステップＳ２９０９でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 In step S2903, if the area attribute indicates a non-encrypted area, the original image is selected as the entire cut-out image in step S2908, and the partial image is selected using the coordinate data of the area of the layout analysis data in step S2909. cut out.

次にステップＳ２９１０において切り取った部分画像の画像タイプを調べ、二値画像タイプであれば、ステップＳ２９１１において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ２９１３において文書画像理解データ（図２７）として保存処理を行う。 Next, the image type of the partial image cut out in step S2910 is checked. If the image type is a binary image type, binary image compression processing is performed in step S2911 (for example, MMR), and the compressed partial image is converted into a document in step S2913. Storage processing is performed as image understanding data (FIG. 27).

また、ステップＳ２９１０において調べた結果が、多値画像タイプであったならば、ステップＳ２９１２において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ２９１３において文書画像理解データ（図２７）として保存処理を行う。 If the result checked in step S2910 is a multi-value image type, multi-value image compression processing is performed in step S2912 (for example, JPEG), and the compressed partial image is converted into document image understanding data in step S2913. The storage process is performed as shown in FIG.

さらに詳細な本例の文書画像理解データの例は、図２７に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像データとして保存する。 As shown in FIG. 27, the more detailed example of the document image understanding data of the present example uses the appearance rank in the layout analysis data of the target layout area data as the layout area data ID, and the cut width and cut height. The image type, compression method, image size, and compressed cut image are stored as partial image data.

次にステップＳ２９１４において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ２９０２に戻り、残りの分割領域について上記の処理を繰り返す。残りの分割領域が存在していなければ、文書画像理解データ保存処理を終了する。こうして、ステップＳ２３０５で文書画像理解データを出力する。 Next, in step S2914, it is checked whether or not there is a remaining divided area. If it exists, the process returns to step S2902, and the above processing is repeated for the remaining divided areas. If there is no remaining divided area, the document image understanding data storage process is terminated. In this way, document image understanding data is output in step S2305.

［第７の実施形態］
上記第６の実施形態では、属性毎に暗号化するしないを設定するものであったが、オペレータが入力時の画像およびレイアウト解析結果を確認し、その解析結果から所望とする領域を選択して暗号化を指示することも可能である。 [Seventh Embodiment]
In the sixth embodiment, encryption is not set for each attribute. However, the operator checks the input image and layout analysis result, and selects a desired region from the analysis result. It is also possible to instruct encryption.

具体例を図１９、図２４、図２５を用いて説明する。 A specific example will be described with reference to FIGS.

まず、ステップ２０３１で、画像入力部（３００２）であるスキャナより文書画像を多値画像として入力する。 First, in step 2031, a document image is input as a multi-value image from a scanner which is an image input unit (3002).

次に、ステップＳ２３０２で、二値画像出力部（３００４）により二値画像を作成し、それをもとにステップＳ２３０３において、レイアウト解析部（３００６）により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報（図１５）を出力する。 In step S2302, a binary image is generated by the binary image output unit (3004). In step S2303, the layout analysis unit (3006) converts the document image into a figure, text, table, or the like. The layout information (FIG. 15) is output after being divided into areas for various attributes.

そのレイアウト情報はレイアウト解析データ表示部（３０１６）により入力画像と重ねて表示されることにより、オペレータは、オペレータ指示部（３０１７）において、ポインティングデバイス等で暗号化する領域を指示することが可能となる。図３１は、この時の操作画面であり、適当な領域を先ず、指定し、その後でその領域について暗号化の設定を“ＯＮ”にしていることを示している。暗号化対象となった領域（図示の場合にはテキスト領域）は、他の領域と区別して表示することで操作者にどの領域が暗号化されることになるのかを知らしめる。 The layout information is displayed on the layout analysis data display unit (3016) so as to be superimposed on the input image, so that the operator can instruct an area to be encrypted with a pointing device or the like in the operator instruction unit (3017). Become. FIG. 31 is an operation screen at this time, and shows that an appropriate area is designated first, and then the encryption setting is turned “ON” for that area. The area to be encrypted (in the illustrated case, the text area) is displayed separately from the other areas, thereby informing the operator which area is to be encrypted.

次に、ステップＳ２３０５で文書画像理解保存部（３００８）においてレイアウト解析データならびにオペレータの指示をもとにして、入力文書元画像である多値画像及び二値画像から適応的に保存条件を変更しながら文書画像理解データを作成し保存する。 In step S2305, the document image understanding storage unit (3008) adaptively changes the storage conditions from the multi-valued image and the binary image that are the input document source images based on the layout analysis data and the operator's instruction. While creating and saving document image understanding data.

さらに具体的な文書画像理解保存処理の例を図２５のフローチャート、及び図１４、図１５、図２７に従って説明する。 A more specific example of document image understanding storage processing will be described with reference to the flowchart of FIG. 25 and FIGS.

本例では、文書画像理解保存処理（ステップＳ２３０５）に入ったら、まずステップＳ３１０１でオペレータが設定した領域のＩＤをセットする。次にステップＳ３１０２で、各分割領域のレイアウト解析データを分析して各領域の領域属性を分類する。 In this example, when the document image understanding storage process (step S2305) is entered, first, the ID of the area set by the operator is set in step S3101. In step S3102, the layout analysis data of each divided area is analyzed to classify the area attributes of each area.

そして、ステップＳ３１０３において領域属性が『テキスト』であれば、ステップＳ３１０４で切り取り先全体画像として二値画像を選択し、ステップＳ３１０６でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 If the area attribute is “text” in step S3103, a binary image is selected as the entire cut-out image in step S3104, and a partial image is cut using the coordinate data of the area of the layout analysis data in step S3106.

また、ステップＳ３１０３において、領域属性が『テキスト』でないならば、ステップＳ３１０５で切り取り先全体画像として元画像を選択し、ステップＳ３１０６でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 If the region attribute is not “text” in step S3103, the original image is selected as the entire cut-out image in step S3105, and the partial image is cut using the coordinate data of the region of the layout analysis data in step S3106.

そして、ステップＳ３１０７で領域ＩＤを検査し、暗号化対象領域か否かを調べる。 In step S3107, the area ID is checked to determine whether it is an encryption target area.

暗号化対象領域であるならば、ステップＳ３１０８でその部分画像にたいして暗号化処理を行い、暗号化データを作成し、ステップＳ３１０９で、暗号化データを文書画像理解データとして保存する。 If it is the encryption target area, the partial image is encrypted in step S3108 to create encrypted data, and in step S3109, the encrypted data is stored as document image understanding data.

また、ステップＳ３１０７で暗号化対象領域でないならば、ステップＳ３１１０において切り取った部分画像の画像タイプを調べ、二値画像タイプであれば、ステップＳ３１１１において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ３１１３において文書画像理解データ（図２７）として保存処理を行う。 If it is not the encryption target area in step S3107, the image type of the partial image cut out in step S3110 is checked. If it is a binary image type, binary image compression processing is performed in step S3111 (for example, MMR or the like). In step S3113, the compressed partial image is stored as document image understanding data (FIG. 27).

また、ステップＳ３１１０において調べた結果が、多値画像（写真画像等の非２値画像）タイプであったならば、ステップＳ３１１２において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ３１１３において文書画像理解データ（図２７）として保存処理を行う。 Also, if the result of the examination in step S3110 is a multi-value image (non-binary image such as a photographic image) type, a compression process for multi-value image (for example, JPEG) is performed and compressed in step S3112. In step S3113, the partial image is stored as document image understanding data (FIG. 27).

文書画像理解データの例は、図２７に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像データとして保存する。 In the example of the document image understanding data, as shown in FIG. 27, the appearance order in the layout analysis data of the target layout area data is set as the ID of the layout area data, and the cut width, the cut height, the image type, and the compression The method, image size, and compressed cut image are stored as partial image data.

次にステップＳ３１１４において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ３１０２に戻り、残りの分割領域について上記の処理を繰り返す。 Next, in step S3114, it is checked whether or not there is a remaining divided area. If there is, the process returns to step S3102 and the above processing is repeated for the remaining divided areas.

残りの分割領域が存在していなければ、文書画像理解データ保存処理を終了する。こうして、ステップＳ２３０５で文書画像理解データを出力する。 If there is no remaining divided area, the document image understanding data storage process is terminated. In this way, document image understanding data is output in step S2305.

［第８の実施形態］
領域属性が『テキスト』など文字認識可能な領域の場合において、文字認識処理を用いて、画像データではなく文字認識データを保存する湯合や、画像データと文字認識データの両方を保存する場合等においても、文字認識データならびに画像データを暗号化することによって、暗号化データを保存することが可能である。 [Eighth Embodiment]
When the area attribute is an area that can recognize characters, such as “text”, using character recognition processing to save character recognition data instead of image data, or to save both image data and character recognition data Also, the encrypted data can be stored by encrypting the character recognition data and the image data.

以下に具体例を第８の実施形態として説明する。ここでは、一例として、『テキスト』の場合は文字認識結果を暗号化した暗号化データ、他の領域は画像データを保存させる例を、図１８、図１９、図２６、図１４、図１５、図２７、図２８、図２９を用いて説明する。 A specific example will be described below as an eighth embodiment. Here, as an example, in the case of “text”, the encrypted data obtained by encrypting the character recognition result and the image data in the other area are stored as examples. FIG. 18, FIG. 19, FIG. 26, FIG. This will be described with reference to FIGS. 27, 28, and 29. FIG.

まず、ステップＳ２３０１において、画像入力部（２１０２）であるスキャナより文書画像を多値画像として入力する。 First, in step S2301, a document image is input as a multi-valued image from a scanner which is an image input unit (2102).

ステップＳ２３０２で、二値画像出力部（２１０４）により二値画像を作成し、それをもとにステップＳ２３０３において、レイアウト解析部（２１０６）により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報（図１５）を出力する。 In step S2302, a binary image is generated by the binary image output unit (2104), and in step S2303, the layout analysis unit (2106) converts the document image into various attributes such as diagrams, texts, and tables. The layout information (FIG. 15) is output.

次に、ステップＳ２３０４で文書画像理解保存部（２１０８）においてレイアウト解析データをもとにして、入力文書元画像である多値画像及び二値画像から適応的に保存条件を変更しながら文書画像理解データを作成し保存する。 Next, in step S2304, the document image understanding storage unit (2108) understands the document image while adaptively changing the storage conditions from the multi-value image and the binary image that are the input document original images based on the layout analysis data. Create and save data.

その際に、あらかじめ定められた特定領域のデータの暗号化処理も行う。 At that time, encryption processing of data in a predetermined specific area is also performed.

さらに具体的な文書画像理解保存処理の例を図２６のフローチャート、及び図１４、図１５、図２７、図２８、図２９に従って説明する。 A more specific example of document image understanding storage processing will be described with reference to the flowchart of FIG. 26 and FIGS. 14, 15, 27, 28, and 29. FIG.

本例では、文書画像理解保存処理（ステップＳ２３０５）に入ったら、まずステップＳ３２０１で各分割領域のレイアウト解析データを分析して各領域の領域属性を分類する。そして、ステップＳ３２０２において領域属性が『テキスト』であれば、ステップＳ３２０３で認識用先全体画像として二値画像を選択し、ステップＳ３２０４でレイアウト解析データの領域の座標データを用いて部分画像に対して文字認識処理を行う。 In this example, when the document image understanding storage process (step S2305) is entered, first, in step S3201, the layout analysis data of each divided area is analyzed to classify the area attributes of each area. If the region attribute is “text” in step S3202, a binary image is selected as the recognition destination entire image in step S3203, and the partial image is selected using the coordinate data of the region of the layout analysis data in step S3204. Perform character recognition processing.

次に、ステップＳ３２０５において文字認識した結果の文字認識データを暗号化処理し、暗号化データを作成して、ステップＳ３２０６で、暗号化データを文書画像理解データ（図２９）として保存する。ここで暗号化フラグが有効（＝１）にすることは、そのデータが暗号化されていることを示している。 Next, the character recognition data resulting from the character recognition in step S3205 is encrypted to create encrypted data, and in step S3206, the encrypted data is stored as document image understanding data (FIG. 29). Here, setting the encryption flag to valid (= 1) indicates that the data is encrypted.

また、ステップＳ３２０２において、領域属性が『テキスト』でないならば、ステップＳ３２０７で切り取り先全体画像として元画像を選択し、ステップＳ３２０８でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 If the region attribute is not “text” in step S3202, the original image is selected as the entire cut-out image in step S3207, and the partial image is cut using the coordinate data of the region of the layout analysis data in step S3208.

そして、ステップＳ３２０９において切り取った部分画像の画像タイプを調べ、二値画像タイプであれば、ステップＳ３２１０において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ３２１１において文書画像理解データ（図２７）として保存処理を行う。 Then, the image type of the partial image cut out in step S3209 is checked. If the image type is a binary image type, binary image compression processing is performed in step S3210 (for example, MMR), and the compressed partial image is converted into a document in step S3211. Storage processing is performed as image understanding data (FIG. 27).

また、ステップＳ３２１０において調べた結果が、多値画像タイプであったならば、ステップＳ３２１２において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ３２１１において文書画像理解データ（図２７）として保存処理を行う。 If the result checked in step S3210 is a multi-value image type, multi-value image compression processing is performed in step S3212 (for example, JPEG), and the compressed partial image is converted into document image understanding data in step S3211. The storage process is performed as shown in FIG.

次にステップＳ３２１３において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ３２０１に戻り、残りの分割領域について上記の処理を繰り返す。残りの分割領域が存在していなければ、文書画像理解データ保存処理を終了する。こうして、ステップＳ２３０５で文書画像理解データを出力する。 Next, in step S3213, it is checked whether or not there is a remaining divided area. If there is, the process returns to step S3201, and the above processing is repeated for the remaining divided areas. If there is no remaining divided area, the document image understanding data storage process is terminated. In this way, document image understanding data is output in step S2305.

なお、上記処理に更に、入力した多値画像を表示し、暗号化しようとする領域を指定していき、その中でテキスト領域についてはそのテキスト画像及び文字認識結果の両方を暗号化させるようにしてもよい。解読できるユーザ以外（解除できるキーを入力しなければ）は、その領域は「暗号化されているため表示できません」というメッセージを表示し、その領域をマウス等で指示したときに解除するキー入力を促すキー入力ダイアログボックスを表示し、設定された番号キーと一致した場合にそれを表示するようにする。なお、印刷する場合にも解除しなければその領域の印刷は行わない。 In addition to the above processing, the input multi-valued image is displayed and an area to be encrypted is designated, and in the text area, both the text image and the character recognition result are encrypted. May be. A non-decipherable user (unless you enter a key that can be unlocked) displays a message saying that the area is "encrypted and cannot be displayed", and presses the key input to cancel when the area is instructed with a mouse etc. A prompt key input dialog box is displayed, and when it matches the set number key, it is displayed. Even when printing, if the area is not canceled, the area is not printed.

以上説明したように第５乃至第８の実施形態によれば、
１）文書画像の保存時における、データ量の削減
２）文書画像の配信時における、ネットワークトラフィックへの負荷軽減
３）文書画像の保存、配信時における、再利用に適した高画質の維持
４）出力時の文書画像の画質劣化や情報欠落の防止
５）ＤＴＰ（デスクトップパプリッシング）等の電子文書への再利用の容易化
６）文書画像の保存、配信時におけるセキュリテイーの向上
等がはかれる効果がある。 As described above, according to the fifth to eighth embodiments,
1) Reduction of data amount when storing document images 2) Reduction of network traffic load when distributing document images 3) Maintaining high image quality suitable for reuse when storing and distributing document images 4) Prevention of image quality degradation and information loss during output 5) Facilitation of reuse for electronic documents such as DTP (desktop publishing) 6) Improvement of security during storage and distribution of document images .

［第９の実施形態］
本第９の実施形態では、例えば文書画像中のあるテキストの言語を他の言語に翻訳する機能を付加する例について説明する。 [Ninth Embodiment]
In the ninth embodiment, an example of adding a function of translating a language of a certain text in a document image into another language will be described.

図３２は第９の実施形態における構成概念及びデータの流れを示している。 FIG. 32 shows the configuration concept and the data flow in the ninth embodiment.

同図において、４１０１は入力される対象となる印刷物やコンピュータ上で扱う画像データ等を含む入力文書、４１０２は文書を画像として入力する画像入力部、４１０３は画像入力部から入力された元画像データ、４１０４は入力文書画像から二値画像を作成する二値画像出力部、４１０５は作成された二値画像、４１０６は二値画像をもとにして文書を図やテキストや表等の各種属性毎の領域に分割した情報すなわちレイアウト解析データを出力するレイアウト解析部、４１０７はレイアウト解析データ、４１０８はレイアウト解析データをもとにして、入力文書元画像及び二値画像から適応的に保存条件を変更しながら画像データ、文字認識データ、および翻訳データを文書画像理解データとして保存する文書画像理解データ保存部である。 In the figure, reference numeral 4101 denotes an input document including a printed matter to be input and image data handled on a computer, 4102 denotes an image input unit for inputting the document as an image, and 4103 denotes original image data input from the image input unit. Reference numeral 4104 denotes a binary image output unit that creates a binary image from the input document image. 4105 denotes the created binary image. 4106 denotes a document based on the binary image for each attribute such as a figure, text, or table. Layout analysis unit that outputs information divided into regions, ie, layout analysis data, 4107 is layout analysis data, and 4108 is an adaptive change of storage conditions from the input document original image and binary image based on the layout analysis data The document image understanding data storage unit stores image data, character recognition data, and translation data as document image understanding data. That.

４１０９は文書画像理解データを出力する文書画像理解デーグ出力部、４１１０は文書画像理解保存データ、４１１１は文書画像理解保存データを入力する文書画像理解データ入力部、４１１２は文書画像理解データから適応的に再生条件を変更しながら再生文書画像を作成する文書画像理解再生部、４１１３は再生文書画像、４１１４は再生文書画像を出力する、画像出力部である。４１１５は出力文書、４１１６は任意の領域の文字を認識しコード情報等を文字認識データとして出力する文字認識処理部、４１１７は任意の種類の言語を他の任意の種類の言語に翻訳した情報を翻訳データとして出力する翻訳処理部（翻訳のための各種構文解析用の辞書等が含まれる）である。 4109 is a document image comprehension tag output unit that outputs document image comprehension data, 4110 is document image comprehension storage data, 4111 is a document image comprehension data input unit that inputs document image comprehension storage data, and 4112 is adaptive from the document image comprehension data. A document image understanding / reproducing unit for generating a reproduced document image while changing the reproduction condition is denoted by 4113, a reproduced document image, and 4114 is an image output unit for outputting the reproduced document image. 4115 is an output document, 4116 is a character recognition processing unit that recognizes characters in an arbitrary region and outputs code information as character recognition data, and 4117 is information obtained by translating an arbitrary type of language into another arbitrary type of language. This is a translation processing unit (including a dictionary for various syntax analysis for translation) that is output as translation data.

上記構成におけるシステムは図２と同様であるので、その説明は省略する。 Since the system in the above configuration is the same as that in FIG.

次に処理の流れについて図１９、図４、図３３、図１０のフローチャート、及び図３２、図２、図４０、図１５、図１６、図１７、図４１に従って説明する。 Next, the flow of processing will be described with reference to the flowcharts of FIGS. 19, 4, 33, and 10, and FIGS. 32, 2, 40, 15, 16, 17, and 41.

本第９の実施形態では２４ビット多値画像をスキャナ（２０３や２０６）より入力し、『テキスト』属性の領域に対して自動的に翻訳処理して保存し、ネットワークで配信して、配信先でモノクロプリンター（２０８や２１１あるいは２０５）に出力する場合の画像理解処理システムについて示す。 In the ninth embodiment, a 24-bit multi-valued image is input from a scanner (203 or 206), automatically translated into a “text” attribute area, stored, distributed over a network, and distributed to An image understanding processing system when outputting to a monochrome printer (208, 211, or 205) will be described.

まず、ステップＳ２３０１において、画像入力部（４１０２）であるスキャナより文書画像を多値画像として入力する。 First, in step S2301, a document image is input as a multivalued image from a scanner which is an image input unit (4102).

ステップＳ２３０２で、二値画像出力部（４１０４）により二値画像を作成し、それをもとにステップＳ２３０３において、レイアウト解析部（４１０６）により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報（図１５）を出力する。 In step S2302, a binary image is generated by the binary image output unit (4104), and in step S2303, the layout analysis unit (4106) generates a document image for each attribute such as a figure, text, or table. The layout information (FIG. 15) is output.

次に、ステップＳ２３０４で文書画像理解保存部（４１０８）においてレイアウト解析データをもとにして、入力文書元画像である多値画像及び二値画像から適応的に保存条件を変更しながら文書画像理解データを作成し保存する。その際に、あらかじめ定められた特定領域のデータの文字認識処理ならびに翻訳処理も行う。 Next, in step S2304, the document image understanding storage unit (4108) understands the document image while adaptively changing the storage conditions from the multi-value image and the binary image that are the input document original images based on the layout analysis data. Create and save data. At that time, character recognition processing and translation processing of data in a predetermined specific area are also performed.

本実施形態では、『テキスト』領域の画像データを文字認識し翻訳（日本語→英語）する例を示す。 In the present embodiment, an example in which image data in the “text” region is recognized and translated (from Japanese to English) is shown.

具体的な文書画像理解保存処理の例を図３３のフローチャート、及び図４０、図１５、図１６に従って説明する。 A specific example of document image understanding storage processing will be described with reference to the flowchart of FIG. 33 and FIGS. 40, 15, and 16.

本例では、文書画像理解保存処理（Ｓ２３０４）に入ったら、まずステップＳ４５０１で各分割領域のレイアウト解析データを分析して各領域の領域属性を分類する。 In this example, when the document image understanding storage process (S2304) is entered, first, in step S4501, the layout analysis data of each divided area is analyzed to classify the area attributes of each area.

そして、ステップＳ４５０２において領域属性が『テキスト』であれば、ステップＳ４５０３で認識用先全体画像として二値画像を選択し、ステップＳ４５０４でレイアウト解析データの領域の座標データを用いて部分画像に対して文字認識処理を行う。 If the region attribute is “text” in step S4502, a binary image is selected as the entire image for recognition in step S4503, and the partial image is selected using the coordinate data of the region of the layout analysis data in step S4504. Perform character recognition processing.

ステップＳ４５０５で文字認識データを文書画像理解データ（図１６）として保存する。 In step S4505, the character recognition data is stored as document image understanding data (FIG. 16).

次に、ステップＳ４５０６において文字認識した結果の文字認識データを翻訳処理し、翻訳データを作成して、ステップＳ４５０７で翻訳データも文書画像理解データ（図４１）として保存する。 Next, the character recognition data obtained as a result of character recognition in step S4506 is translated to create translation data. In step S4507, the translation data is also stored as document image understanding data (FIG. 41).

また、ステップＳ４５０２において、領域属性が『テキスト』でないならば、ステップＳ４５０８で切り取り先全体画像として元画像を選択し、ステップＳ４５０９でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 If the region attribute is not “text” in step S4502, the original image is selected as the entire cut-out image in step S4508, and the partial image is cut using the coordinate data of the region of the layout analysis data in step S4509.

そして、ステップＳ４５１０において切り取った部分画像の画像タイプを調べ、二値画像タイプであれば、ステップＳ４５１１において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ４５１３において文書画像理解データ（図１６）として保存処理を行う。 Then, the image type of the partial image cut out in step S4510 is checked. If the image type is a binary image type, binary image compression processing is performed in step S4511 (for example, MMR), and the compressed partial image is converted into a document in step S4513. Storage processing is performed as image understanding data (FIG. 16).

また、ステップＳ４５１０において調べた結果が、多値画像タイプであったならば、ステップＳ４５１２において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ４５１３において文書画像理解データ（図１６）として保存処理を行う。 If the result checked in step S4510 is a multi-value image type, multi-value image compression processing is performed in step S4512 (for example, JPEG), and the compressed partial image is converted into document image understanding data in step S4513. The storage process is performed as shown in FIG.

本例の文書画像理解データの例も図１５に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像データとして保存する。 As shown in FIG. 15, the document image comprehension data example of this example is the layout area data ID, which is the appearance order in the layout analysis data of the target layout area data, and the cut width, the cut height, and the image type. The compression method, the image size, and the compressed cut image are stored as partial image data.

次にステップＳ４５１４において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ４５０１に戻り、残りの分割領域について上記の処理を繰り返す。残りの分割領域が存在していなければ、文書画像理解データ保存処理を終了する。こうして、ステップＳ２３０５で文書画像理解データを出力する。 Next, in step S4514, it is checked whether or not there is a remaining divided area. If there is, the process returns to step S4501, and the above processing is repeated for the remaining divided areas. If there is no remaining divided area, the document image understanding data storage process is terminated. In this way, document image understanding data is output in step S2305.

本実施形態では文書画像理解保存部（４１０８）において作成・保存された文書画像理解データは文書画像理解データ出力部（４１０９）によりネットワーク上に出力され、ネットワークを介して利用先に配信される。 In the present embodiment, the document image understanding data created and stored in the document image understanding storage unit (4108) is output on the network by the document image understanding data output unit (4109), and is distributed to the usage destination via the network.

利用先では、先の第１の実施形態で説明した図４と同様の処理を行うことになる。 At the use destination, the same processing as in FIG. 4 described in the first embodiment is performed.

先ず、ステップＳ４０１において文書画像理解データ入力部（４１１１）により文書画像理解データを入力し、ステップＳ４０２において文書画像理解データから適応的に再生条件を変更しながら再生文書画像を作成する。 First, in step S401, document image understanding data is input by the document image understanding data input unit (4111). In step S402, a reproduction document image is created while changing reproduction conditions adaptively from the document image understanding data.

具体的な文書画像理解再生処理（Ｓ４０２）の例を図３４、図１０を用いて説明する。 A specific example of document image understanding reproduction processing (S402) will be described with reference to FIGS.

本例では、あらかじめ定めた言語の翻訳データがある場合は自動的に翻訳データを取り出し、再生する例を示す。 In this example, when there is translation data in a predetermined language, the translation data is automatically extracted and reproduced.

まず、ステップＳ４６０１において再生文書画像の下地となる画像を白い下地となるように作成する。 First, in step S4601, an image serving as a background of the reproduced document image is created so as to be a white background.

次に、ステップＳ４６０２において文書画像理解データを分析し、ステップＳ４６０３で、部分領域の属性が『テキスト』であるならば、ステップＳ４６０４で、文書画像理解データに翻訳データが存在するか否かを検査する。 Next, the document image understanding data is analyzed in step S4602, and if the attribute of the partial area is “text” in step S4603, it is checked in step S4604 whether translation data exists in the document image understanding data. To do.

翻訳データが存在するならば、ステップＳ４６０５において、文書画像理解データから翻訳データを取り出し、ステップＳ４６０６で翻訳文を構成している文字コードに対応するフォントパターンを、白い下地画像と合成し再生画像を作成する。 If the translation data exists, in step S4605, the translation data is extracted from the document image understanding data, and in step S4606, the font pattern corresponding to the character code constituting the translated sentence is combined with the white background image to generate the reproduced image. create.

また、翻訳データが存在しないならば、ステップＳ４６０７において、文書画像理解データから原文の文字認識データを取り出す。 If there is no translation data, original character recognition data is extracted from the document image understanding data in step S4607.

ステップＳ４６０８において、取り出した原文を白い下地画像と合成し再生画像を作成する。 In step S4608, the extracted original text is combined with a white background image to create a reproduction image.

また、ステップＳ４６０３において、部分領域の属性が『テキスト』でないのであれば、ステップＳ４６０９で文書画像理解データから画像データを取り出し、ステップＳ４６１０において、取り出した部分画像とその座標データから白い下地画像と合成し再生画像を作成する。 If the attribute of the partial area is not “text” in step S4603, image data is extracted from the document image understanding data in step S4609. In step S4610, a white background image is synthesized from the extracted partial image and its coordinate data. Then create a playback image.

再生合成処理（ステップＳ４６１０）の一例としては、第１の実施形態で説明した図１０に従って処理すれば良い。つまり、ステップＳ１００１において文書画像理解データから部分画像の画像タイプを抽出し、ステップＳ１００２において、画像タイプが『二値画像タイプ』であれば、ステップＳ１００３において、二値画像の黒、白画素をそれぞれ２４ビット多値画像の黒、白画素に変換する疑似２４ビット化の処理を行う。 As an example of the reproduction synthesis process (step S4610), the process may be performed according to FIG. 10 described in the first embodiment. That is, in step S1001, the image type of the partial image is extracted from the document image understanding data. If the image type is “binary image type” in step S1002, the black and white pixels of the binary image are respectively determined in step S1003. A pseudo 24-bit conversion process is performed to convert the 24-bit multi-valued image into black and white pixels.

本例では、二値画像の黒画素は１、白画素は０であるとしたとき、それを２４ビット多値画像の黒画素に対応させるため、黒画素はＲ＝０、Ｇ＝０、Ｂ＝０とし、白画素はＲ＝２５５、Ｇ＝２５５、Ｂ＝２５５（ＲはＲｅｄ成分、ＧはＧｒｅｅｎ成分、ＢはＢｌｕｅ成分で各８ビットの値を持つ。）としている。 In this example, when the black pixel of the binary image is 1 and the white pixel is 0, the black pixel is associated with the black pixel of the 24-bit multi-valued image, so that the black pixel has R = 0, G = 0, B The white pixels are R = 255, G = 255, and B = 255 (R is a Red component, G is a Green component, and B is a Blue component, each having a value of 8 bits).

以上のような処理によって１つの部分画像に対して再生合成処理（図３４のステップＳ４６０６，Ｓ４６０８，Ｓ４６１０）を終了したら、ステップＳ４６１１において残りの部分画像が存在しているか否かを調べ、存在していればステップＳ４６０２に戻り、残りの部分領域について上記の処理を繰り返す。 When the reproduction / combination processing (steps S4606, S4608, and S4610 in FIG. 34) is completed for one partial image by the above processing, it is checked in step S4611 whether the remaining partial images are present. If so, the process returns to step S4602, and the above processing is repeated for the remaining partial areas.

次に、図４におけるステップＳ４０３において画像出力部（４１１４）であるモノクロプリンターから再生画像を出力文書として出力する。 Next, in step S403 in FIG. 4, the reproduced image is output as an output document from the monochrome printer as the image output unit (4114).

［第１０の実施形態］
オペレータが入力時の画像およびレイアウト解析結果を確認し、その解析結果から特定領域を選択して翻訳を指示することも可能である。 [Tenth embodiment]
It is also possible for the operator to check the input image and layout analysis result, select a specific area from the analysis result, and instruct translation.

具体例を図１９（第５の実施形態で説明した）、図３５、図３６を用いて説明する。 A specific example will be described with reference to FIG. 19 (described in the fifth embodiment), FIG. 35, and FIG.

まず、ステップ２３０１で、画像入力部（４８０２）であるスキャナより文書画像を多値画像として入力する。 First, in step 2301, a document image is input as a multi-value image from a scanner which is an image input unit (4802).

次に、ステップＳ２３０２で、二値画像出力部（４８０４）により二値画像を作成し、それをもとにステップＳ２３０３において、レイアウト解析部（４８０６）により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報（図１５）を出力する。 Next, in step S2302, a binary image is generated by the binary image output unit (4804), and in step S2303, the layout analysis unit (4806) converts the document image into a figure, text, table, or the like. The layout information (FIG. 15) is output after being divided into areas for various attributes.

そのレイアウト情報はレイアウト解析データ表示部（４８１６）により入力画像と重ねて表示されることにより、オペレータは、オペレータ指示部（４８１７）において、ポインティングデバイス等で翻訳処理する領域を指示することが可能とある。図３８は、この翻訳処理を行う領域の指定画面の一例を示している。図示では、第２カラムの上段部分をその翻訳対象として設定している（領域の指定をＯＮする）様を示している。 The layout information is displayed so as to be superimposed on the input image by the layout analysis data display unit (4816), so that the operator can instruct an area for translation processing with a pointing device or the like in the operator instruction unit (4817). is there. FIG. 38 shows an example of an area designation screen for performing this translation processing. In the figure, the upper part of the second column is set as the translation target (area specification is turned ON).

次に、ステップＳ２３０５で文書画像理解保存部（４８０８）においてレイアウト解析データならびにオペレータの指示をもとにして、入力文書元画像である多値画像及び二値画像から適応的に保存条件を変更しながら文書画像理解データを作成し保存する。 In step S2305, the document image understanding storage unit (4808) adaptively changes the storage conditions from the multi-value image and the binary image that are the input document source images based on the layout analysis data and the operator's instruction. While creating and saving document image understanding data.

具体的な文書画像理解保存処理の例を図３６のフローチャート、及び図４０、図１５、図１６に従って説明する。 A specific example of document image understanding storage processing will be described with reference to the flowchart of FIG. 36 and FIGS. 40, 15, and 16.

本例では、文書画像理解保存処理（ステップＳ２３０５）に入ったら、まずステップＳ４９０１でオペレータが設定した領域のＩＤをセットする。 In this example, when the document image understanding storage process (step S2305) is entered, first, the ID of the area set by the operator is set in step S4901.

次にステップＳ４９０２で、各分割領域のレイアウト解析データを分析して各領域の領域属性を分類する。そして、ステップＳ４９０３において領域属性がテキストを含むかどうかを判断し、テキストを含む場合にはステップＳ４９０４で認識用画像として二値画像を選択し、ステップＳ４９０５でレイアウト解析データの領域の座標データを用いて部分領域に対して文字認識処理を行い文字認識データを得る。そして、ステップＳ４９０６で文字認識データを文書画像理解データとして保存する。なお、誤って写真画像等、テキストを含まない画像については最初から翻訳不可にしておくので問題は発生しない。 In step S4902, the layout analysis data of each divided area is analyzed to classify the area attributes of each area. In step S4903, it is determined whether the region attribute includes text. If the region attribute includes text, a binary image is selected as a recognition image in step S4904, and the coordinate data of the region of the layout analysis data is used in step S4905. Then, character recognition processing is performed on the partial area to obtain character recognition data. In step S4906, the character recognition data is stored as document image understanding data. It should be noted that an image that does not include text, such as a photographic image, is made untranslatable from the beginning, so no problem occurs.

次に、ステップＳ４９０７で部分領域の領域ＩＤを検査することにより翻訳対象領域か否かを調べる。 In step S4907, the region ID of the partial region is checked to determine whether it is a translation target region.

翻訳対象領域でないならば、原文の文字認識データを保存しただけでこの領域に関する処理は終了となる。 If it is not an area to be translated, the processing relating to this area is completed simply by storing the original character recognition data.

翻訳対象領域であるならば、ステップＳ４９０８において翻訳処理を行い、ステップＳ４９０９で翻訳データを文書画像理解データとして保存する。 If it is an area to be translated, translation processing is performed in step S4908, and the translation data is stored as document image understanding data in step S4909.

また、ステップＳ４９０３において、領域属性が翻訳対象とはならない、つまり、テキストを含んでいないならば、ステップＳ４９１０で切り取り先全体画像として元画像を選択し、ステップＳ４９１１でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 In step S4903, if the region attribute is not to be translated, that is, if it does not include text, the original image is selected as the entire cut-out image in step S4910, and the coordinate data of the region of the layout analysis data is selected in step S4911. Cut out the partial image using.

ステップＳ４９１２において切り取った部分画像の画像タイプを調べ、二値画像タイプであれば、ステップＳ４９１３において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ４９１５において文書画像理解データ（図１６）として保存処理を行う。 The image type of the partial image cut out in step S4912 is checked. If it is a binary image type, the binary image is compressed in step S4913 (for example, MMR), and the compressed partial image is understood in step S4915. Storage processing is performed as data (FIG. 16).

また、ステップＳ４９１０において調べた結果が、多値画像タイプであったならば、ステップＳ４９１４において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をステップＳ４９１５において文書画像理解データ（図１６）として保存処理を行う。 If the result checked in step S4910 is a multi-value image type, multi-value image compression processing is performed in step S4914 (for example, JPEG), and the compressed partial image is converted into document image understanding data in step S4915. The storage process is performed as shown in FIG.

本例の文書画像理解データの例は図１６に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像データとして保存する。 In the example of the document image understanding data of this example, as shown in FIG. 16, the appearance order in the layout analysis data of the target layout area data is taken as the ID of the layout area data, the cut width, the cut height, the image type The compression method, the image size, and the compressed cut image are stored as partial image data.

次にステップＳ４９１６において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ４９０２に戻り、残りの分割領域について上記の処理を繰り返す。 Next, in step S4916, it is checked whether or not there is a remaining divided area. If it exists, the process returns to step S4902, and the above processing is repeated for the remaining divided areas.

［第１１の実施形態］
複数言語の翻訳手段をもつことにより、１つの原文の文字認識データに対して複数の翻訳データを併せ持たせることも可能である。 [Eleventh embodiment]
By providing a plurality of language translation means, it is possible to have a plurality of translation data for one original text recognition data.

［第１２の実施形態］
複数の翻訳先から、オペレータが選択した言語に翻訳させることも可能である。 [Twelfth embodiment]
It is also possible to translate from a plurality of translation destinations into the language selected by the operator.

具体例を再び図１９を用い、更に、図３５、図３７を用いて説明する。 A specific example will be described with reference to FIG. 19 again, and further with reference to FIGS.

まず、ステップＳ２３０１、画像入力部（４８０２）であるスキャナより文書画像を多値画像として入力する。 First, in step S2301, a document image is input as a multi-value image from a scanner which is an image input unit (4802).

次に、オペレータ指示部（４８１７）により、翻訳先言語を指示する。 Next, the translation language is instructed by the operator instruction unit (4817).

例えばオペレータは原文が英語の文書に対して、翻訳先言語として『日本語』を指示したものとする。本例では翻訳先は１つであるが複数の言語を指定することも可能である。 For example, it is assumed that the operator designates “Japanese” as the translation target language for a document whose original text is English. In this example, there is one translation destination, but a plurality of languages can be designated.

次いで、ステップＳ２３０２で、二値画像出力部（４８０４）により二値画像を作成し、それをもとにステップＳ２３０３において、レイアウト解析部（４８０６）により、文書画像を図やテキストや表等の各種属性毎の領域に分割しレイアウト情報（図１５）を出力する。 In step S2302, a binary image is generated by the binary image output unit (4804). Based on the binary image output unit (4804), the layout analysis unit (4806) converts the document image into various kinds of figures, texts, tables, and the like. The layout information is divided into areas for each attribute and layout information (FIG. 15) is output.

ここでのオペレータの指示としては、レイアウト解析された内容を図３９に示すように表示し、翻訳先言語を選択する処理である。本実施形態では翻訳対象はテキスト領域であるが、表等の文字を含むものをもその対象にしてもよい。 The instruction from the operator here is a process of displaying the layout-analyzed contents as shown in FIG. 39 and selecting a translation destination language. In this embodiment, the object to be translated is a text area, but an object including characters such as a table may be the object.

具体的な文書画像理解保存処理の例を図３７のフローチャート、図４０、図１５、図１６、図１７、図４１に従って説明する。 A specific example of document image understanding storage processing will be described with reference to the flowchart of FIG. 37, FIG. 40, FIG. 15, FIG. 16, FIG.

本例では、文書画像理解保存処理（ステップＳ２３０５）に入ったら、まずステップＳ５００１でオペレータが設定した翻訳先言語をセットし、翻訳処理の前準備を行う。 In this example, when the document image understanding storage process (step S2305) is entered, first, the translation destination language set by the operator in step S5001 is set, and preparation for the translation process is performed.

次にステップＳ５００２で、各分割領域のレイアウト解析データを分析して各領域の領域属性を分類する。 In step S5002, the layout analysis data of each divided area is analyzed to classify the area attributes of each area.

そして、ステップＳ５００３において領域属性が『テキスト』であるかどうかを判断する。もしそうなら、ステップＳ５００４に進み、その領域に対する認識用画像として二値画像を選択し、ステップＳ５００５でレイアウト解析データの領域の座標データを用いて部分領域に対して文字認識処理を行い、文字認識データを得る。さらに、ステップＳ５００６で、その文字認識データを文書画像理解データとして保存する。次いで、ステップ５００８に進み、翻訳処理を行い、ステップ５００９で翻訳結果も保存する（図４１）。 In step S5003, it is determined whether the region attribute is “text”. If so, the process advances to step S5004 to select a binary image as a recognition image for the area, and in step S5005, character recognition processing is performed on the partial area using the coordinate data of the area of the layout analysis data. Get the data. In step S5006, the character recognition data is stored as document image understanding data. Next, the process proceeds to step 5008 to perform translation processing, and the translation result is also saved in step 5009 (FIG. 41).

一方、ステップＳ５００３において、領域属性が『テキスト』でないと判断した場合には、ステップＳ５０１０で切り取り先全体画像として元画像を選択し、Ｓ５０１１でレイアウト解析データの領域の座標データを用いて部分画像を切り取る。 On the other hand, if it is determined in step S5003 that the region attribute is not “text”, the original image is selected as the entire cut-out image in step S5010, and the partial image is selected using the coordinate data of the region of the layout analysis data in step S5011. cut out.

次にステップＳ５０１２において切り取った画像部分の画像タイプを調べ、二値画像タイプであれば、ステップＳ５０１３において二値画像用の圧縮処理を施し（例えばＭＭＲ等）、圧縮した部分画像をステップＳ５０１５において文書画像理解データ（図１６）として保存処理を行う。 Next, the image type of the image portion cut out in step S5012 is checked. If the image type is a binary image type, a binary image compression process is performed in step S5013 (for example, MMR), and the compressed partial image is converted into a document in step S5015. Storage processing is performed as image understanding data (FIG. 16).

この時、文書画像理解データに圧縮方式も記録しておく事により、文書画像理解再生処理において圧縮画像を復号可能とする。 At this time, by recording the compression method in the document image understanding data, the compressed image can be decoded in the document image understanding reproduction process.

また、ステップＳ５０１２において調べた結果が、多値画像タイプであったならば、ステップＳ５０１４において多値画像用の圧縮処理を施し（例えばＪＰＥＧ等）、圧縮した部分画像をＳ１０１５において文書画像理解データ（図１６）として保存処理を行う。 If the result checked in step S5012 is a multi-valued image type, multi-valued image compression processing (for example, JPEG) is performed in step S5014, and the compressed partial image is converted into document image understanding data (in step S1015). The storage process is performed as shown in FIG.

さらに詳細な本例の文書画像理解データ（非テキスト画像）の例は、図１６に示すように、対象となったレイアウト領域データのレイアウト解析データ内での出現順位をレイアウト領域データのＩＤとし、切り取り幅、切り取り高さ、画像タイプ、圧縮方式、画像サイズおよび圧縮した切り取り画像を部分画像データとして保存する。 In a more detailed example of document image understanding data (non-text image) of this example, as shown in FIG. 16, the appearance order in the layout analysis data of the target layout area data is set as the layout area data ID, The cut width, the cut height, the image type, the compression method, the image size, and the compressed cut image are stored as partial image data.

次にステップＳ５０１６において残りの分割領域が存在しているか否かを調べ、存在していればステップＳ５００２に戻り、残りの分割領域について上記の処理を繰り返す。残りの分割領域が存在していなければ、文書画像理解データ保存処理を終了する。こうして、ステップＳ２３０５で文書画像理解データを出力する。 Next, in step S5016, it is checked whether or not there is a remaining divided area. If there is, the process returns to step S5002, and the above processing is repeated for the remaining divided areas. If there is no remaining divided area, the document image understanding data storage process is terminated. In this way, document image understanding data is output in step S2305.

以上説明したように本第９乃至第１２の実施形態によれば、
１）文書画像の保存時における、データ量の削減
２）文書画像の配信時における、ネットワークトラフィックへの負荷軽減
３）文書画像の保存、配信時における、再利用に適した高画質の維持
４）出力時の文書画像の画質劣化や情報欠落の防止
５）ＤＴＰ（デスクトップパプリッシング）等の電子文書への再利用の容易化
６）地球規模の文書配信時における利便性の向上
等がはかれる効果がある。 As described above, according to the ninth to twelfth embodiments,
1) Reduction of data amount when storing document images 2) Reduction of network traffic load when distributing document images 3) Maintaining high image quality suitable for reuse when storing and distributing document images 4) Prevention of image quality degradation and information loss at the time of output 5) Ease of reuse for electronic documents such as DTP (desktop publishing) 6) Improvement of convenience at the time of global document distribution, etc. .

なお、本発明は、複数の機器（例えばホストコンピュータ、インタフェイス機器、リーダ、プリンタなど）から構成されるシステムに適用しても、一つの機器からなる装置（例えば、複写機、ファクシミリ装置など）に適用してもよい。 Note that the present invention can be applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, and a printer), and a device (for example, a copying machine and a facsimile device) including a single device. You may apply to.

また、図２ではネットワークを介して情報の授受を行う例を示したが、このネットワークとしてインターネットを採用しても良いのは勿論であるし、ネットワークの形態そのものによって本願発明が限定されるものではない。 FIG. 2 shows an example in which information is exchanged via a network. However, the Internet may be adopted as this network, and the present invention is not limited by the form of the network itself. Absent.

また、上記の各実施形態を大きく分けると、第１乃至第４の実施形態、第５乃至第８の実施形態、第９乃至第１２の実施形態に分けられるが、それらを如何様にむ見合わせても良いのは勿論である。 The above embodiments can be broadly divided into the first to fourth embodiments, the fifth to eighth embodiments, and the ninth to twelfth embodiments. Of course, it may be.

また、本発明の目的は、前述した各実施形態の機能を実現するソフトウェアのプログラムコードを記録した記憶媒体（または記録媒体）を、システムあるいは装置に供給し、そのシステムあるいは装置のコンピュータ（またはCPUやMPU）が記憶媒体に格納されたプログラムコードを読み出し実行することによっても、達成されることは言うまでもない。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施形態の機能を実現することになり、そのプログラムコードを記憶した記憶媒体は本発明を構成することになる。また、コンピュータが読み出したプログラムコードを実行することにより、前述した実施形態の機能が実現されるだけでなく、そのプログラムコードの指示に基づき、コンピュータ上で稼働しているオペレーティングシステム(OS)などが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 In addition, an object of the present invention is to supply a storage medium (or recording medium) that records a program code of software that realizes the functions of the above-described embodiments to a system or apparatus, and to perform a computer (or CPU) of the system or apparatus. Needless to say, this can also be achieved by reading and executing the program code stored in the storage medium. In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiments, and the storage medium storing the program code constitutes the present invention. Further, by executing the program code read by the computer, not only the functions of the above-described embodiments are realized, but also an operating system (OS) running on the computer based on the instruction of the program code. It goes without saying that a case where the function of the above-described embodiment is realized by performing part or all of the actual processing and the processing is included.

さらに、記憶媒体から読み出されたプログラムコードが、コンピュータに挿入された機能拡張カードやコンピュータに接続された機能拡張ユニットに備わるメモリに書込まれた後、そのプログラムコードの指示に基づき、その機能拡張カードや機能拡張ユニットに備わるCPUなどが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 Furthermore, after the program code read from the storage medium is written into a memory provided in a function expansion card inserted into the computer or a function expansion unit connected to the computer, the function is determined based on the instruction of the program code. It goes without saying that the CPU or the like provided in the expansion card or the function expansion unit performs part or all of the actual processing and the functions of the above-described embodiments are realized by the processing.

以上説明したように第１の発明によれば、原稿のレイアウトを保持しながら、データ量を少なくでき、しかも、再生出力する際に画質劣化することを抑えることが可能になる。 As described above, according to the first invention, it is possible to reduce the amount of data while maintaining the document layout, and to suppress the deterioration of the image quality when reproducing and outputting.

また、第２の発明によれば、上記第１の発明に加えてセキュリティーに優れたものとすることができる。 Further, according to the second invention, in addition to the first invention, the security can be improved.

そして、第３の発明によれば、上記第１の発明に加えて、画像中のテキストについて自然言語の違いを吸収できるようにもなる。従って、国内外を問わずに情報を共有することができるようになる。 And according to 3rd invention, in addition to the said 1st invention, it also becomes possible to absorb the difference of a natural language about the text in an image. Therefore, it becomes possible to share information regardless of whether inside or outside the country.

第１の実施形態におけるデータの流れを示す図である。It is a figure which shows the flow of the data in 1st Embodiment. 第１の実施形態のシステム構成を示す図である。It is a figure which shows the system configuration | structure of 1st Embodiment. 第１の実施形態における画像入力から文書画像理解保存データの出力処理までの手順を示すフローチャートである。It is a flowchart which shows the procedure from the image input in 1st Embodiment to the output process of document image understanding preservation | save data. 各実施形態における文書画像理解保存データ入力から再生画像出力までの処理手順を示すフローチャートである。It is a flowchart which shows the process sequence from document image understanding preservation | save data input in each embodiment to reproduction | regeneration image output. 第１の実施形態の文書画像理解保存処理を示すフローチャートである。It is a flowchart which shows the document image understanding preservation | save process of 1st Embodiment. 第１の実施形態におけるテキスト判別処理を示すフローチャートである。It is a flowchart which shows the text discrimination | determination process in 1st Embodiment. 第１の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 1st Embodiment. 第１の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 1st Embodiment. 第１の実施形態の文書画像理解再生処理のフローチャートである。It is a flowchart of the document image understanding reproduction process of the first embodiment. 第１の実施形態の再生画像合成処理のフローチャートである。It is a flowchart of the reproduction | regeneration image composition process of 1st Embodiment. 第２の実施形態における文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process in 2nd Embodiment. 第３の実施形態における文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process in 3rd Embodiment. 第３、第４の実施形態の文書画像理解再生処理のフローチャートである。It is a flowchart of the document image understanding reproduction process of 3rd, 4th embodiment. 第１の実施形態の文書画像理解データの構造を示す図である。It is a figure which shows the structure of the document image understanding data of 1st Embodiment. 第１の実施形態のレイアウト解析データの構造を示す図である。It is a figure which shows the structure of the layout analysis data of 1st Embodiment. 第１の実施形態の画像データの構造を示す図である。It is a figure which shows the structure of the image data of 1st Embodiment. 第１の実施形態の文字認識データの構造を示す図である。It is a figure which shows the structure of the character recognition data of 1st Embodiment. 第５の実施形態におけるデータの流れを示す図である。It is a figure which shows the flow of the data in 5th Embodiment. 第５の実施形態の画像入力から文書画像理解データ出力までのフローチャートである。It is a flowchart from the image input of 5th Embodiment to document image understanding data output. 第５の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 5th Embodiment. 第５の実施形態の文書画像理解再生処理のフローチャートである。It is a flowchart of the document image understanding reproduction process of 5th Embodiment. 第６の実施形態におけるデータの流れを示す図である。It is a figure which shows the flow of the data in 6th Embodiment. 第６の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 6th Embodiment. 第７の実施形態におけるデータの流れを示す図である。It is a figure which shows the flow of the data in 7th Embodiment. 第７の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 7th Embodiment. 第８の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 8th Embodiment. 第５の実施形態の画像データの構造を示す図である。It is a figure which shows the structure of the image data of 5th Embodiment. 第５の実施形態の文字認識データの構造を示す図である。It is a figure which shows the structure of the character recognition data of 5th Embodiment. 第５の実施形態の暗号化文字認識データの構造を示す図である。It is a figure which shows the structure of the encryption character recognition data of 5th Embodiment. 第６の実施形態のＧＵＩの例を示す図である。It is a figure which shows the example of GUI of 6th Embodiment. 第７の実施形態のＧＵＩの例を示す図である。It is a figure which shows the example of GUI of 7th Embodiment. 第９の実施形態におけるデータの流れを示す図である。It is a figure which shows the flow of the data in 9th Embodiment. 第９の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 9th Embodiment. 第９の実施形態の文書画像理解再生処理のフローチャートである。It is a flowchart of the document image understanding reproduction process of 9th Embodiment. 第１０の実施形態におけるデータの流れを示す図である。It is a figure which shows the flow of the data in 10th Embodiment. 第１１の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 11th Embodiment. 第１１の実施形態の文書画像理解保存処理のフローチャートである。It is a flowchart of the document image understanding preservation | save process of 11th Embodiment. 第１０の実施形態のＧＵＩの例を示す図である。It is a figure which shows the example of GUI of 10th Embodiment. 第１１の実施形態のＧＵＩの例を示す図である。It is a figure which shows the example of GUI of 11th Embodiment. 第９の実施形態の文書画像理解データの構造を示す図である。It is a figure which shows the structure of the document image comprehension data of 9th Embodiment. 第９の実施形態の翻訳データの構造を示す図である。It is a figure which shows the structure of the translation data of 9th Embodiment.

Claims

Input means for inputting multivalued image data of a document;
Binary image generation means for generating binary image data from the input multi-value image data;
A layout analysis unit that divides the generated binary image data into regions for each attribute, and generates layout information for each of the divided regions;
First partial image cutting means for cutting a partial image of a corresponding area from the binary image data for a text attribute area based on the layout information;
Based on the layout information, second partial image cutting means for cutting a partial image of a corresponding area from the multi-valued image data for a non-text attribute area;
Encryption target attribute setting means for setting an attribute to be encrypted based on an instruction from the operator;
Of the partial images cut by the first partial image cutting unit or the second partial image cutting unit, the partial image corresponding to the attribute area set as the encryption target by the encryption target attribute setting unit is encrypted. An encryption means for generating encrypted data;
The first partial image cutout corresponding to the encrypted data generated by the encryption means corresponding to the encryption target area and the attribute area not set as the encryption target by the encryption target attribute setting means Means for storing the partial image cut by the second partial image cutting means, and the layout information;
An image processing apparatus comprising:

Input means for inputting multivalued image data of a document;
Binary image generation means for generating binary image data from the input multi-value image data;
A layout analysis unit that divides the generated binary image data into regions for each attribute, and generates layout information for each of the divided regions;
First partial image cutting means for cutting a partial image of a corresponding area from the binary image data for a text attribute area based on the layout information;
Character recognition means for recognizing characters for a partial image corresponding to the text attribute region cut by the first partial image cutting means;
Based on the layout information, second partial image cutting means for cutting a partial image of a corresponding area from the multi-valued image data for a non-text attribute area;
When the encryption target area is the text attribute area, both the character recognition data of the encryption target area recognized by the character recognition means and the partial image cut by the first partial image cutting means are used. Encryption means for encrypting and generating encrypted data;
Storage means for storing the encrypted data generated by the encryption means, the partial image cut by the second partial image cutting means, and the layout information;
An image processing apparatus comprising:

The image processing apparatus of the described the encrypted data and the partial image and the layout information stored by the storing means, an output means for outputting to another device, to claim 1 or 2, further comprising.

A binary image generation means for generating binary image data from multi-value image data of a document;
A layout analysis unit that divides the generated binary image data into regions for each attribute and generates layout information for each of the divided regions; and
A first partial image cutting step for cutting a partial image of a corresponding region from the binary image data for a text attribute region based on the layout information;
A second partial image cutting step for cutting a partial image of a corresponding area from the multi-valued image data for a non-text attribute area based on the layout information;
An encryption target attribute setting unit sets an attribute to be encrypted based on an instruction from an operator, and an encryption target attribute setting step;
Of the partial images cut by the first partial image cutting step or the second partial image cutting step, the portion corresponding to the attribute region set as the encryption target in the encryption target attribute setting step An encryption process for encrypting the image and generating encrypted data;
The storage means includes the encrypted data generated in the encryption step corresponding to the encryption target region and the attribute region not set as the encryption target in the encryption target attribute setting step . A storing step of storing the partial image cut out in the one partial image cutting step or the second partial image cutting step and the layout information;
An image processing method comprising:

A binary image generation means for generating binary image data from multi-value image data of a document;
A layout analysis unit that divides the generated binary image data into regions for each attribute and generates layout information for each of the divided regions; and
A first partial image cutting step for cutting a partial image of a corresponding region from the binary image data for a text attribute region based on the layout information;
A character recognizing step for recognizing characters for a partial image corresponding to the text attribute region cut out in the first partial image cutting step;
A second partial image cutting step for cutting a partial image of a corresponding area from the multi-valued image data for a non-text attribute area based on the layout information;
When the encryption unit is the text attribute region, the encryption unit recognizes the character recognition data of the encryption target region recognized in the character recognition step and the portion cut out in the first partial image cutting step An encryption process for encrypting both the image and generating encrypted data;
A storage unit that stores the encrypted data generated in the encryption step, the partial image cut out in the second partial image cutting step, and the layout information;
An image processing method comprising:

A storage medium for storing a computer program that can be read and executed by a computer,
Computer
Binary image generation means for generating binary image data from multivalued image data of a document;
Layout analysis means for dividing the generated binary image data into areas for each attribute and generating layout information for each of the divided areas;
First partial image cutting means for cutting a partial image of a corresponding area from the binary image data for a text attribute area based on the layout information;
Second partial image cutting means for cutting a partial image of a corresponding region from the multi-valued image data for a non-text attribute region based on the layout information;
Encryption target attribute setting means for setting an attribute to be encrypted based on an instruction from the operator,
Of the partial images cut by the first partial image cutting unit or the second partial image cutting unit, the partial image corresponding to the attribute area set as the encryption target by the encryption target attribute setting unit is encrypted. Encryption means for generating encrypted data;
The first partial image cutout corresponding to the encrypted data generated by the encryption means corresponding to the encryption target area and the attribute area not set as the encryption target by the encryption target attribute setting means Storing means for storing the partial image clipped by the second partial image cutting means and the layout information;
A storage medium for storing a computer program for functioning as a computer.

A storage medium storing a computer program that can be read and executed by a computer,
Computer
Binary image generation means for generating binary image data from multivalued image data of a document;
Layout analysis means for dividing the generated binary image data into areas for each attribute and generating layout information for each of the divided areas;
First partial image cutting means for cutting a partial image of a corresponding area from the binary image data for a text attribute area based on the layout information;
Character recognition means for recognizing characters for a partial image corresponding to the region of the text attribute cut by the first partial image cutting means;
A second partial image cutting means for cutting a partial image of a corresponding area from the multi-valued image data for a non-text attribute area based on the layout information;
When the encryption target area is the text attribute area, both the character recognition data of the encryption target area recognized by the character recognition means and the partial image cut by the first partial image cutting means are used. Encryption means for encrypting and generating encrypted data;
Storage means for storing the encrypted data generated by the encryption means, the partial image cut by the second partial image cutting means, and the layout information;
A storage medium storing a computer program for functioning as a computer.