JP3740271B2

JP3740271B2 - Text line entry area reading method and mail address recognition method

Info

Publication number: JP3740271B2
Application number: JP02807598A
Authority: JP
Inventors: 達彦影広; 昌史古賀; 好博嶋; 裕酒匂; 浩道藤澤; 成渡辺
Original assignee: Hitachi Omron Terminal Solutions Corp
Current assignee: Hitachi Channel Solutions Corp
Priority date: 1998-02-10
Filing date: 1998-02-10
Publication date: 2006-02-01
Anticipated expiration: 2018-02-10
Also published as: JPH11221527A

Description

【０００１】
【発明の属する技術分野】
本発明は、郵便物や宅配物を区分するために宛先を認識する際に、文字行を抽出し、抽出された文字行から宛名が書かれている部分を選択し、文字行認識を行い、宛名認識結果とする郵便物等宛名認識方法に関する。
【０００２】
【従来の技術】
郵便物自動読み取り装置においては、郵便物上の宛名記載領域を検出する方法として特許出願公開平５ー１５１３８９があり、これは郵便番号位置を検出してこの位置から宛名記載領域を検出する手法である。宛名記載領域から文字行を切出す方法としては特許出願公告昭６０ー４１３９６があり、これは記載されている文字等のブロックパターンの高さを検出し同じ高さのパターンを行として検出する手法である。切出された文字行の縦横書き判別法には特許出願公開昭６３ー１８７８５があり、これは宛名領域の文字群の水平方向周辺長と垂直方向周辺長を求め、これらの周辺長を比較することにより文字群の行方向を判別する。宛名行の文字認識方法では、宛名の都道府県名、市区群名、町名等の地名を読み取る手段が種々考案されており、例えば特許出願公開平２ー６４８８２では、住所の先頭から町名までの文字列と丁目番地部分の文字列において異なった文字切出し処理を行う手法で宛名行を認識している。
【０００３】
【発明が解決しようとする課題】
郵便物等の宛名行においては予め記載されている文字サイズ、文字間隔、記載手段（印刷、筆、ボールペン等）が不明である。そのため宛名として記載されている文字は多種多様な形態で記載されており、これらを予め既知の設定１つで、様々な文字で記載された宛名行を全て正確に抽出することは不可能である。宛名行を抽出する際のパラメータ設定のために、画像データから宛名行として記載されている文字の形態を判別するのは容易なことではない。
【０００４】
また、郵便物等の宛名の書式には、長辺を縦において宛名を縦方向に記載する縦置き縦書と、長辺を横にして宛名を横方向に記載する横置き横書、長辺を縦において宛名を横方向に記載する縦置き横書の書式が主に多い。これらの書式の種類の中で、郵便物の長辺を縦置きにして画像入力すると、宛名行の方向においては、縦置き縦書と横置き横書は同じ縦方向と判別される。これは、一文字一文字の方向が不明だからである。縦置き縦書の宛名は、大きな文字で記載されている場合が多く、横書横書の宛名は小さい文字で記載されている場合が多い。この２つの書式を同一のパラメータ設定で宛名行抽出処理を行い、両方とも正しい抽出結果を得るのは困難である。
【０００５】
【課題を解決するための手段】
本発明は、郵便物の宛名として記載されている文字の大きさ、文字間隔、記載方法が不明であっても、正確な宛名行抽出を可能にし、宛名認識精度の向上に有効な手法である。
【０００６】
図１で本発明で解決しようとしている課題の解決手段を解説する。101と105に郵便物の長辺を縦に置き電子化した宛名面の画像を示す。101の宛名面の書式は縦置き縦書で、105の宛名面の書式は横置き横書とである。102が101の宛名面に対し、大きい文字の文字行を抽出するパラメータ設定で文字行抽出を行った結果で、101では宛名が大きい文字で記載されているため、正しい文字行が抽出できている。この102の文字行抽出結果を用いて、宛名領域抽出を行うと103のような部分画像になり、これを用いて宛名認識を行うと正しい宛名認識結果が得られる。しかし、104のように小さい文字の文字行を抽出するパラメータ設定で文字行抽出を行うと、小さい文字で構成された文字行が存在しないため、何も抽出されない。
【０００７】
また、106が105の宛名面に対し、大きい文字用のパラメータ設定で文字行抽出した結果であり、小さい文字で記載されている文字行は複数行をまとめて１行として抽出している。この文字行抽出結果を元に宛名領域抽出を行うと107のような部分画像になる。この部分画像に対し、文字方向を判定し画像を回転すると108の部分画像が得られるが、この108の部分画像を用いて正しい宛名認識結果は得られない。また109は105の宛名面に対し、小さい文字用のパラメータ設定で文字行抽出した結果であり、宛名行は小さい文字で記載されていたため、正しく文字行が抽出されている。109の文字行抽出結果を元に宛名領域抽出を行うと、112の宛名領域が生成される。この宛名領域候補に対し文字方向判定し、回転した結果113の部分画像になる。113の部分画像に対し宛名認識処理を行うと、正しい宛名認識結果が得られる。
【０００８】
このように、宛名面に対し複数の設定値を用いて文字行抽出を行うと、複数の文字行抽出結果の中に正しい結果が含まれている可能性が高い。そこで、全ての文字行抽出結果を保管しておき、この文字行抽出結果から宛名行の記載領域の候補を複数生成する。この複数の記載領域の候補に対し、認識処理を行い認識結果を元に郵便物の宛名認識結果を決定する。
【０００９】
【発明の実施の形態】
以下、図面を参照して本発明の実施例を説明する。
【００１０】
図２に本発明を実現するハードウェア構成を示す。本発明の実施例の目的は郵便物の宛名認識による自動ソーティングである。よって、201で示すように郵便物の宛名面を入力画像として電子化するためにスキャナーがありバスにつながっている。205で示すように表示系がありここで処理中の宛名面の画像や宛名認識結果、処理モード、エラーコードを表示する。206に操作系があり、ここで操作員がマシンの操作を行う。202に宛名辞書を装備し、ここには宛名認識用に文字の標準パターンや、配達地域の住民住所データ、都道府県、市町村の階層構造情報を格納してある。203にメモリがあり、204にＣＰＵを備えてあり、このメモリとＣＰＵを用いて宛名認識用のプログラムを動作させる。このメモリとＣＰＵを用いて動作している宛名認識用のプログラムは宛名認識結果を207で示すソーティング部に入力され、認識結果に対応したビンに郵便物を誘導する。また、208で示すようにネットワーク接続部を備え、ネットワーク回線に接続され、遠隔地から宛名辞書、宛名認識用プログラム、宛名面の画像データ等のやり取りを行うことを可能としている。
【００１１】
図３に宛名認識用のプログラムの処理構成を示す。301で示すように郵便物の宛名面において長辺を縦にして、かつ左上を切手貼付位置としてスキャンした二値画像データを入力画像として、まず302で宛名行生成モジュールが処理を行い、宛名行領域候補を生成する。
【００１２】
この宛名行生成モジュール内には、303で示す文字行抽出部、304で示す宛名領域候補生成部、305で示す宛名領域候補選択部から構成されている。文字行抽出部では、宛名面の入力画像に対し近傍の連結成分毎に組み合わせを、連結成分同士の距離や、統合した際の面積の増加分、連結成分同士の直線性を検証し、統合処理を行い文字行を抽出する。その後、宛名領域候補生成部では、複数の文字行抽出結果を組み合わせる際に、文字行の大きさ、組み合わせた際の面積の増加分を検証し、宛名領域候補を生成する。次に宛名領域候補選択部では、生成された宛名領域候補の位置情報、サイズ、宛名領域候補内の文字行サイズ、文字行のばらつきを検証し、宛名領域候補毎に宛名領域らしさの得点付けを行い得点順に並べ替える。この並べ替えられた順に306の宛名行認識モジュールに入力し、宛名行認識処理を行う。この宛名行認識処理は例えば特許出願公開平２ー６４８８２で公知となっている処理を用いる。
【００１３】
次に307で示す宛名認識結果決定部で、宛名認識結果を付加された宛名領域候補を評価する。この宛名認識結果決定部では、入力された複数の宛名領域候補に付加されている認識結果を参照してアクセプトの結果が得られるまで宛名行認識処理を繰り返す。アクセプトの認識結果が得られた時点で、宛名行認識処理を止め、そのアクセプトであった認識結果を郵便物の宛先の認識結果として出力する。
【００１４】
図４に図３で示した宛名行生成モジュール内の細かい構成について説明する。402の文字行抽出部には、403のパラメータを大文字抽出用に設定した大文字用文字行抽出、404の大文字用に設定された大文字用文字行検定があり、405にはパラメータを小文字抽出用に設定した小文字用文字行抽出、406には小文字用に設定された小文字用文字行検定がある。この402の文字行抽出部内では、まず、大文字用文字行抽出処理を行い、その結果に対し大文字用文字行検定処理を行う。この検定処理を通過した文字行抽出結果のみが、大文字の文字行抽出結果としてテーブルに登録される。次に同様に小文字用文字行抽出処理を行い、その結果に対し小文字用文字行検定処理を行う。この検定処理を通過した文字行抽出結果が、小文字の文字行抽出結果としてテーブルに登録される。
【００１５】
これらの大文字用文字行抽出処理と小文字用文字行抽出処理は、文字行抽出処理内部のパラメータ設定が異なっているだけである。このパラメータ設定とは、連結成分を統合処理を行う際に参照する連結成分同士の距離、統合する際の面積の増加分の閾値の設定である。また、大文字用文字行検定と小文字用文字行検定も、文字行検定処理内部のパラメータ設定が異なっているだけである。このパラメータ設定とは、文字行抽出処理で得られた文字行矩形候補に対し検定処理を行う際に参照する、矩形の大きさ、矩形内の連結成分、矩形内の連結成分の大きさの平均、矩形内の連結成分のばらつきの閾値の設定である。
【００１６】
次に407で宛名行領域候補生成を行う。ここでは、文字行抽出部でテーブルに登録された文字行抽出結果を参照し、文字行同士の位置関係や、揃い具合を尺度にして、宛名行が記載されている領域として妥当な宛名領域候補を生成し、テーブルに登録する。
【００１７】
次に、408で宛名領域候補選択を行うが、ここではテーブルを参照し、生成された宛名領域候補のサイズ、位置、内部に含まれる文字行矩形の情報を用いて、宛名領域候補に対し宛名領域らしさの得点付けを行い得点順にソーティングし、宛名行認識部に入力する宛名領域候補が宛名として尤もらしい順に並べられて出力される。
【００１８】
図５に図４で示した文字行抽出の処理フローを示す。このフローはパラメータ設定を変えることにより、文字行抽出処理の対象となる文字サイズを変更したり、行方向を変更したりすることが可能である。この文字行抽出では、連結成分を２段階で統合し、文字行を抽出する。１段目の統合で生成される矩形を１次マージ矩形と呼び、２段目の統合で生成される矩形を行候補矩形と呼ぶ。
【００１９】
まず１段目の統合では、501で核になる連結成分を選び１次マージ矩形の上端、下端、右端、左端座標に設定する。次に502で他の連結成分と１次マージ矩形の座標を比較し最近傍の連結成分を選ぶ。503で選ばれた連結成分と１次マージ矩形との距離を判定し、一定値以下の場合、504でこの連結成分と１次マージ矩形を統合した際に面積の増加分を判定し、妥当な場合、505でこの連結成分を１次マージ矩形に登録し、１次マージ矩形の座標も更新する。次に506で１次マージ矩形が更新されたかどうかを判断し、更新されていた場合、もう一度他の連結成分と比較を行い処理を繰り返す。１次マージ矩形が更新されていなかった場合、統合可能な連結成分は残っていないため、この１次マージ矩形に関する処理は終わる。次に507で、全ての１次マージ矩形において登録されていない連結成分の存在をチェックし、存在する場合、508で核にする連結成分を登録されていない連結成分に設定し、近傍の連結成分を探索し処理を繰り返す。ここで、503と504の判定処理の閾値がパラメータ設定の値である。これらのパラメータを変更することにより、縦と横の抽出方向の切り替えや、抽出対象とする文字行内の文字サイズや文字間隔を変更できる。
【００２０】
次に前記の処理で生成された１次マージ矩形を元に行候補矩形を生成する処理フローを説明する。509で核にする１次マージ矩形を行候補矩形の上端、下端、右端、左端の座標に設定し、510で他の１次マージ矩形と縦か横で直線状に並んでいるかチェックし、並んでいる場合511で行候補矩形と１次マージ矩形との距離をチェックする。これらの２つチェックを満たした場合、512で行候補矩形を更新する。513で行候補矩形の更新をチェックし、更新されている場合は他の１次マージ矩形を比較を行い、同様に処理を繰り返す。更新されていなかった場合、この行候補矩形に統合される１次マージ矩形は存在しないため、514でこの行候補矩形を登録する。次に515で行候補に反映されていない１次マージ矩形をチェックし、反映されていないものがあれば核になる１次マージ矩形として設定し、同様に統合処理を繰り返す。ここで、510と511の判定部分の閾値がパラメータ設定である。これらのパラメータを変更することにより、縦と横の抽出方向の切り替えや、抽出対象とする文字行内の文字サイズや文字間隔を変更できる。
【００２１】
図６では文字行抽出のあと行候補矩形が文字行として適当かどうか検定を行う文字行検定の処理フローを示す。１つ１つの行候補矩形において601で横幅と602で高さをチェックし、妥当なサイズであったら、603で行候補矩形内に含まれる連結成分を探索する。ここで探索された連結成分において、604で連結成分数をチェックし、605で連結成分の平均サイズをチェックし、606で極端に大きな連結成分の存在をチェックする。これらのチェックを通過した行候補矩形は607でテーブルに文字行として登録される。608で行候補矩形が最後かチェックし、最後の行候補矩形まで同様のチェックを繰り返す。この文字行検定処理は601、602、604、605、606での判定処理での閾値がパラメータ設定となっている。これらのパラメータ設定を変更することにより、検定対象となる文字行の大きさや、文字行内の文字サイズ、文字サイズのばらつきを変えることができる。
【００２２】
図７に宛名行領域候補生成の処理フローを示す。まず701で核になる文字行を決め宛名領域候補矩形に設定する。702で宛名領域候補矩形と近傍の文字行を探索し、その近傍の文字行に対し、703で文字行が異質な大きさでないかチェックし、704で文字行を加えた時の宛名領域候補の面積増加分をチェックし、どちらのチェックでも妥当であった場合、705で文字行を追加して宛名行領域矩形を更新する。706で宛名領域矩形に更新の有無をチェックし、更新があった場合、近傍の文字行を探索し同様の処理を繰り返す。更新が生じていなかった場合709で宛名領域候補をテーブルに登録し、707で宛名領域候補に登録されていない文字行が存在するかチェックし、存在する場合708で核にする文字行を設定し直して処理を繰り返す。
【００２３】
図８では宛名領域候補選択の処理フローを示す。テーブルを参照し１つ１つの宛名領域候補に対し、801でサイズをチェックし妥当な場合802で得点を加点し、803で宛名領域の位置が差出人の記載部分に含まれているかチェックし、含まれている場合804で得点を減点する。次に805で宛名領域内の宛名行を探索し、806で宛名行数をチェックし妥当なら807で得点を加点し、808で宛名行の並び方をチェックし揃っているなら809で得点を加点し、810で宛名行のサイズが揃っているかチェックし揃っているなら811で得点を加点する。812で宛名領域候補が最後かチェックし、最後でなければ同様の得点付けを繰り返し、宛名領域子補全てに得点付けが行われる。全ての宛名領域候補に対する得点付けが終わると、813で宛名領域候補を得点の順にソーティングし、宛名行認識モジュールに入力する。宛名行認識モジュールでは、ソーティングされた順に認識処理を行い、妥当な認識結果が得られた時点で、処理中の郵便物の宛名認識結果を決定する。
【００２４】
【発明の効果】
以上説明したように本発明によると、郵便物の宛名行の抽出精度が向上し、結果として郵便物の宛名認識精度の向上につながる。多種多様な郵便物の宛名面の書式に対する処理として有効な発明である。
【図面の簡単な説明】
【図１】宛名行抽出の課題に対する解決方法の図である。
【図２】本発明の実施例におけるハードウェア構成の図である。
【図３】宛名認識処理のモジュール構成の図である。
【図４】宛名行抽出処理の内のモジュール構成の図である。
【図５】宛名行抽出処理の処理フローの図である。
【図６】宛名行検定処理の処理フローの図である。
【図７】宛名領域候補生成処理の処理フローの図である。
【図８】宛名領域候補選択処理の処理フローの図である。
【符号の説明】
301・・・宛名画像 302・・・宛名行生成モジュール 303・・・文字行抽出部 304・・・宛名領域候補生成部 305・・・宛名領域候補選択部 306・・・宛名行認識モジュール 307・・・宛名認識結果決定部 308・・・宛名認識結果[0001]
BACKGROUND OF THE INVENTION
The present invention extracts a character line when recognizing a destination in order to classify a mail item or a delivery product, selects a portion where the address is written from the extracted character line, performs character line recognition, The present invention relates to a method for recognizing a mail or the like as a result of address recognition.
[0002]
[Prior art]
In a mail automatic reading device, there is Japanese Patent Application Laid-Open No. 5-151389 as a method for detecting an address description area on a mail, and this is a technique for detecting a postal code position and detecting the address description area from this position. is there. As a method of cutting out a character line from an address description area, there is Japanese Patent Application Publication No. Sho 60-41396, which detects the height of a block pattern of a written character or the like and detects a pattern of the same height as a line. It is. Japanese Patent Application Publication No. Sho 63-18785 discloses a method for discriminating vertical and horizontal writing of cut out character lines, which calculates the horizontal and vertical peripheral lengths of character groups in the address area, and compares these peripheral lengths. Thus, the line direction of the character group is determined. In the address line character recognition method, various means for reading a place name such as a prefecture name, a city group name, and a town name of the address have been devised. For example, in Japanese Patent Application Publication No. 2-64882, the address from the head of the address to the town name is devised. The address line is recognized by a method of performing different character extraction processing between the character string and the character string at the chome address portion.
[0003]
[Problems to be solved by the invention]
In a mailing address, the character size, character spacing, and description means (printing, brush, ballpoint pen, etc.) described in advance are unknown. Therefore, the characters described as the address are described in a wide variety of forms, and it is impossible to accurately extract all the address lines described in various characters with one known setting in advance. . In order to set parameters when extracting an address line, it is not easy to determine the character form described as the address line from the image data.
[0004]
In addition, the format of the address of mails, etc. includes the vertical writing with the long side in the vertical direction and the address in the vertical direction, the horizontal writing with the long side in the horizontal direction and the horizontal writing in the horizontal direction, and the long side There are mainly many vertical and horizontal writing formats in which the address is written in the vertical direction. Among these types of formats, when an image is input with the long side of the mail piece placed vertically, in the direction of the address line, the vertically placed vertical book and the horizontally placed horizontal book are determined to be the same longitudinal direction. This is because the direction of each character is unknown. In many cases, the address of a vertically placed vertical book is written in large characters, and the address of a horizontal and horizontal book is often written in small characters. It is difficult to perform an address line extraction process for these two formats with the same parameter setting and to obtain a correct extraction result for both.
[0005]
[Means for Solving the Problems]
The present invention is an effective technique for enabling accurate address line extraction and improving address recognition accuracy even if the size, character spacing, and description method of the letters described as mail address are unknown. .
[0006]
FIG. 1 explains the means for solving the problem to be solved by the present invention. Figures 101 and 105 show the images of the mailing surface, which is the digitized long side of the mail. The format of the 101 address is vertical and the format of 105 is horizontal. As a result of performing character line extraction with a parameter setting that extracts a character line with a large character for the address surface with 102 as 101, the correct character line can be extracted because 101 is described with a character with a large address . If address area extraction is performed using the character line extraction result of 102, a partial image such as 103 is obtained. If address recognition is performed using this, a correct address recognition result is obtained. However, if character line extraction is performed with the parameter setting for extracting a character line of a small character like 104, nothing is extracted because there is no character line composed of small characters.
[0007]
Further, 106 is a result of extracting character lines with the parameter setting for large characters on the address surface of 105, and character lines described in small characters are extracted as one line by combining a plurality of lines. If the address area extraction is performed based on the character line extraction result, a partial image 107 is obtained. When the character direction is determined for this partial image and the image is rotated, 108 partial images are obtained. However, a correct address recognition result cannot be obtained using the 108 partial images. 109 is the result of character line extraction for 105 address planes with small character parameter settings. Since the address line was written in small characters, the character line was correctly extracted. If address area extraction is performed based on the 109 character line extraction results, 112 address areas are generated. The character direction is determined for this address area candidate, and the result of rotation is a partial image 113. When address recognition processing is performed on the 113 partial images, a correct address recognition result is obtained.
[0008]
As described above, when character line extraction is performed using a plurality of setting values for the address surface, there is a high possibility that a correct result is included in the plurality of character line extraction results. Therefore, all character line extraction results are stored, and a plurality of address area description area candidates are generated from the character line extraction results. A recognition process is performed on the plurality of description area candidates, and a mail address recognition result is determined based on the recognition result.
[0009]
DETAILED DESCRIPTION OF THE INVENTION
Embodiments of the present invention will be described below with reference to the drawings.
[0010]
FIG. 2 shows a hardware configuration for realizing the present invention. An object of the embodiment of the present invention is automatic sorting by mail address recognition. Therefore, as shown by 201, there is a scanner for digitizing the mail address as an input image and connected to the bus. As shown by 205, there is a display system that displays an image of the address plane being processed, an address recognition result, a processing mode, and an error code. There is an operation system at 206, where an operator operates the machine. A mail address dictionary is installed in 202, which stores standard patterns of characters for address recognition, resident address data of delivery areas, and hierarchical structure information of prefectures and municipalities. A memory 203 is provided, and a CPU 204 is provided. The address recognition program is operated using the memory and the CPU. The address recognition program operating using this memory and CPU is input to the sorting unit 207 indicating the address recognition result, and guides the mail to the bin corresponding to the recognition result. Further, as indicated by 208, a network connection unit is provided, which is connected to a network line, so that it is possible to exchange an address dictionary, an address recognition program, image data of the address surface, etc. from a remote location.
[0011]
FIG. 3 shows the processing configuration of the address recognition program. As shown in 301, the address line generation module first processes the address line generation module in 302 using binary image data scanned with the long side vertical and the upper left as the stamp attachment position as shown in 301. Generate region candidates.
[0012]
This address line generation module includes a character line extraction unit indicated by 303, an address area candidate generation part indicated by 304, and an address area candidate selection part indicated by 305. In the character line extraction unit, a combination for each connected component in the vicinity of the input image on the address plane is verified, the distance between the connected components, the increase in area when integrated, and the linearity between the connected components are verified, and the integration process To extract a character line. Thereafter, the address area candidate generation unit verifies the size of the character line and the increase in area when the character line is combined, and generates an address area candidate when combining a plurality of character line extraction results. Next, the address area candidate selection unit verifies the positional information and size of the generated address area candidates, the character line size in the address area candidates, and variations in the character lines, and assigns a score of the address area to each address area candidate. Sort by score. The information is input to the address line recognition module 306 in the rearranged order, and address line recognition processing is performed. For this address line recognition processing, for example, processing known in Japanese Patent Application Publication No. 2-6882 is used.
[0013]
Next, the address recognition result determination unit indicated by 307 evaluates the address area candidate to which the address recognition result is added. The address recognition result determination unit repeats the address line recognition process until an acceptance result is obtained with reference to the recognition results added to the plurality of input address area candidates. When the acceptance recognition result is obtained, the address line recognition process is stopped, and the recognition result that is the acceptance is output as the recognition result of the mail address.
[0014]
FIG. 4 illustrates a detailed configuration in the address line generation module shown in FIG. The 402 character line extraction unit has upper case character line extraction with 403 parameters set for upper case extraction, and upper case character line test set for 404 upper case extraction, and 405 has parameters for lower case extraction. There is a set of character line extraction for lowercase letters, and 406 has a character line test for lowercase letters set for lowercase letters. In the character line extraction unit 402, uppercase character line extraction processing is first performed, and uppercase character line verification processing is performed on the result. Only the character line extraction results that have passed this verification process are registered in the table as uppercase character line extraction results. Next, lowercase character line extraction processing is performed in the same manner, and lowercase character line verification processing is performed on the result. The character line extraction result that has passed this verification process is registered in the table as a lowercase character line extraction result.
[0015]
These upper-case character line extraction processing and lower-case character line extraction processing differ only in the parameter settings within the character line extraction processing. This parameter setting is a setting of a distance between connected components that are referred to when the connected components are integrated, and a threshold value for an increase in area when the connected components are integrated. Also, the character setting for uppercase characters and the character row verification for lowercase characters differ only in the parameter settings inside the character line verification process. This parameter setting refers to the size of the rectangle, the connected components in the rectangle, and the average size of the connected components in the rectangle that are referenced when performing the verification process on the character line rectangle candidates obtained in the character line extraction process. , Setting of a threshold of variation of connected components in a rectangle.
[0016]
Next, in 407, address line area candidate generation is performed. Here, by referring to the character line extraction result registered in the table by the character line extraction unit, the address area candidate that is appropriate as the area in which the address line is described on the basis of the positional relationship and alignment of the character lines Is created and registered in the table.
[0017]
Next, address area candidate selection is performed at 408. Here, the table is referred to, and the address area candidate is addressed using the generated address area candidate size, position, and information on the text line rectangle contained therein. Area-likeness scoring is performed, sorting is performed in the order of the scores, and addressable area candidates to be input to the addressed line recognition unit are arranged and output in the order that they are likely to be addressed.
[0018]
FIG. 5 shows a processing flow of character line extraction shown in FIG. In this flow, by changing the parameter setting, it is possible to change the character size to be subjected to the character line extraction process or change the line direction. In this character line extraction, connected components are integrated in two stages, and a character line is extracted. A rectangle generated by the first stage integration is called a primary merge rectangle, and a rectangle generated by the second stage integration is called a row candidate rectangle.
[0019]
First, in the integration of the first stage, the connected component that becomes the core is selected at 501 and set to the upper end, lower end, right end, and left end coordinates of the primary merge rectangle. Next, at 502, the coordinates of the primary merge rectangle are compared with other connected components, and the nearest connected component is selected. The distance between the connected component selected in 503 and the primary merge rectangle is determined. If the distance is less than a certain value, the increase in area is determined in 504 when the connected component and the primary merge rectangle are merged. In this case, the connected component is registered in the primary merge rectangle in 505, and the coordinates of the primary merge rectangle are also updated. Next, in 506, it is determined whether or not the primary merge rectangle has been updated. If the primary merge rectangle has been updated, it is compared again with other connected components and the process is repeated. If the primary merge rectangle has not been updated, there is no connectable component that can be integrated, and the processing related to the primary merge rectangle ends. Next, in 507, the existence of connected components that are not registered in all the primary merge rectangles is checked. If there is a connected component, the connected component to be used as a nucleus is set as an unregistered connected component in 508, and the neighboring connected components are set. And repeat the process. Here, the threshold value of the determination processing of 503 and 504 is a parameter setting value. By changing these parameters, the vertical and horizontal extraction directions can be switched, and the character size and character spacing in the character line to be extracted can be changed.
[0020]
Next, a processing flow for generating a row candidate rectangle based on the primary merge rectangle generated by the above processing will be described. In 509, the primary merge rectangle to be the core is set to the coordinates of the top, bottom, right, and left edges of the line candidate rectangle, and in 510, it is checked whether it is aligned with other primary merge rectangles vertically or horizontally and aligned. In step 511, the distance between the line candidate rectangle and the primary merge rectangle is checked. If these two checks are satisfied, the row candidate rectangle is updated at 512. In step 513, the update of the line candidate rectangle is checked. If the rectangle is updated, another primary merge rectangle is compared, and the process is repeated in the same manner. If it has not been updated, there is no primary merge rectangle to be integrated with this line candidate rectangle, so this line candidate rectangle is registered at 514. Next, in 515, the primary merge rectangle that is not reflected in the row candidates is checked, and if there is any that is not reflected, it is set as the primary merge rectangle, and the integration process is repeated in the same manner. Here, the threshold values of the judgment portions 510 and 511 are parameter settings. By changing these parameters, the vertical and horizontal extraction directions can be switched, and the character size and character spacing in the character line to be extracted can be changed.
[0021]
FIG. 6 shows a processing flow of a character line test for testing whether a line candidate rectangle after character line extraction is appropriate as a character line. In each row candidate rectangle, the width is checked at 601 and the height is checked at 602. If the size is appropriate, a connected component included in the row candidate rectangle is searched at 603. In the connected components searched here, the number of connected components is checked at 604, the average size of the connected components is checked at 605, and the presence of an extremely large connected component is checked at 606. The line candidate rectangles that have passed these checks are registered as character lines in the table at 607. In step 608, it is checked whether the line candidate rectangle is the last one, and the same check is repeated until the last line candidate rectangle. In this character line verification process, the threshold value in the determination process at 601, 602, 604, 605, and 606 is set as a parameter. By changing these parameter settings, it is possible to change the size of the character line to be verified, the character size in the character line, and the variation in character size.
[0022]
FIG. 7 shows a processing flow of address line area candidate generation. First, in 701, a character line as a core is determined and set as a candidate address area rectangle. In 702, the address area candidate rectangle and the adjacent character line are searched, and in the adjacent character line, the text line is checked for a different size in 703, and the address area candidate when adding the character line in 704 is checked. The area increase is checked, and if both checks are valid, a character line is added in 705 to update the address line area rectangle. In 706, it is checked whether or not the address area rectangle is updated. If there is an update, a nearby character line is searched and the same processing is repeated. If no update has occurred, register the address area candidate in the table in 709, check if there is a character line not registered in the address area candidate in 707, and if it exists, set the character line to be the core in 708 Correct and repeat the process.
[0023]
FIG. 8 shows a processing flow of address area candidate selection. Referring to the table, check the size of each addressable area candidate with 801, add a score with 802 if appropriate, and check with 803 whether the address area location is included in the sender's description. If it is, the score is deducted at 804. Next, search the address line in the address area at 805, check the number of address lines at 806, add a score at 807 if appropriate, add a score at 809 if check the arrangement of the address lines at 808 810, check if the address line size is complete, and if it is complete, score 811. In 812, it is checked whether the address area candidate is the last, and if it is not the last, the same scoring is repeated, and scoring is performed for all the address area assistants. When scoring is completed for all addressable area candidates, the addressable area candidates are sorted in the order of points in 813 and input to the address line recognition module. In the address line recognition module, recognition processing is performed in the sorted order, and when an appropriate recognition result is obtained, the address recognition result of the mail piece being processed is determined.
[0024]
【The invention's effect】
As described above, according to the present invention, the accuracy of extracting mail address lines is improved, and as a result, the mail address recognition accuracy is improved. This invention is effective as a process for address forms of various mail items.
[Brief description of the drawings]
FIG. 1 is a diagram of a solution to an address line extraction problem.
FIG. 2 is a diagram of a hardware configuration in an embodiment of the present invention.
FIG. 3 is a diagram showing a module configuration for address recognition processing;
FIG. 4 is a diagram showing a module configuration in address line extraction processing;
FIG. 5 is a flowchart of address line extraction processing;
FIG. 6 is a flowchart of address line verification processing;
FIG. 7 is a flowchart of address area candidate generation processing;
FIG. 8 is a flowchart of address area candidate selection processing;
[Explanation of symbols]
301: Address image 302 ... Address line generation module 303 ... Character line extraction unit 304 ... Address area candidate generation unit 305 ... Address area candidate selection unit 306 ... Address line recognition module 307 ..Address recognition result determination unit 308 ... Address recognition result

Claims

The image information on the character description surface is converted into an electrical signal and input, and the input image is repeatedly extracted multiple times with multiple parameter settings for at least one of the character size and the character spacing in the character line. A character line description area reading method comprising: extracting a plurality of character lines and extracting a character line description area by combining the plurality of character lines extracted with different parameter settings .

The image information of the address description surface such as postal items is converted into an electric signal and input, and the input image is set multiple times by setting a plurality of parameters for at least one of the character size and the character interval in the character line. Repeat extraction to extract a plurality of character lines, a combination of a plurality of character lines extracted the with different parameter settings to extract the address described region, the character identification character line of said extracted address described region A method for recognizing mail addresses, etc.

During the address area extraction candidate region address lines from the image data of one address plane is described to output a plurality extracted, the position of the region with respect to a plurality of area candidates issued extract, size Using the whole or a part of the size of the character line included in the area or the relative position information of the character line, the validity as the area where the address is described is evaluated, and the evaluation Area candidates are arranged in order of validity evaluation values, character lines in the area candidates of the arranged address lines are character-identified, and a mail address recognition result from areas candidates in which a plurality of address lines are described 3. A method for recognizing a mail or the like according to claim 2, wherein the mail address is determined.

A first step of scoring the likelihood of the address line description area based on the position information on the address surface of the area candidate in which the address line is described in the validity evaluation, and the area where the address line is described The second step of counting the number of character lines in the candidate and scoring the likelihood of the address line description area based on the number, and the coordinates of the character line rectangles in the candidate area of the address line A third step of scoring the likelihood of the address line description area according to the value deviation, and the address line description according to the size difference between the character line rectangles in the candidate area where the address line is described 4. A postal item according to claim 3, wherein the area candidates are rearranged in the order of scores, and the candidates for the areas where the addresses scored in the first to fourth steps are listed. Address recognition method.

In the extraction of the above character line, the parameter setting for the character line described in large characters with respect to the extraction result obtained by setting the character line extraction parameter setting as the extraction target for the character line extraction parameter setting. The character line extraction result that passed the character line verification process is stored in the memory, and the parameter setting of the character line extraction is set as the extraction target character line described in small characters Perform the character line verification process that is set for the character line described in small characters for the obtained extraction result, and store the character line extraction result that has passed the character line verification process in the memory, 3. A method for recognizing a mail address or the like according to claim 2, wherein a candidate for an area in which the address line is described is generated with reference to a character line extraction result stored in the memory.