Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
JPS6343789B2 - - Google Patents
[go: Go Back, main page]

JPS6343789B2 - - Google Patents

Info

Publication number
JPS6343789B2
JPS6343789B2 JP55125762A JP12576280A JPS6343789B2 JP S6343789 B2 JPS6343789 B2 JP S6343789B2 JP 55125762 A JP55125762 A JP 55125762A JP 12576280 A JP12576280 A JP 12576280A JP S6343789 B2 JPS6343789 B2 JP S6343789B2
Authority
JP
Japan
Prior art keywords
character
pattern
character string
pitch
projected
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP55125762A
Other languages
Japanese (ja)
Other versions
JPS5750076A (en
Inventor
Yoshiaki Kurosawa
Kunio Murashige
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Tokyo Shibaura Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tokyo Shibaura Electric Co Ltd filed Critical Tokyo Shibaura Electric Co Ltd
Priority to JP55125762A priority Critical patent/JPS5750076A/en
Publication of JPS5750076A publication Critical patent/JPS5750076A/en
Publication of JPS6343789B2 publication Critical patent/JPS6343789B2/ja
Granted legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Input (AREA)

Description

【発明の詳細な説明】 本発明は特に平均配列ピツチの知られた文字列
パターンから各文字パターンを効果的に切出して
読取ることのできる文字読取装置に関する。
DETAILED DESCRIPTION OF THE INVENTION The present invention particularly relates to a character reading device that can effectively extract and read each character pattern from a character string pattern with a known average pitch.

近時、OCRを始めとして各種の文字読取装置
が開発され、業務処理等に大きく貢献している。
しかして一般に文字列は、所定ピツチで配列され
た文字枠内にそれぞれ書込まれた文字の集団とし
てとらえることができ、これを読取つて認識処理
する場合、文字の検出切出し処理が行われる。こ
の文字枠を正確に切出すことは認識処理において
非常に重要であり、従来例えば次のようにして行
われていた。
Recently, various character reading devices including OCR have been developed and are greatly contributing to business processing.
Generally, a character string can be viewed as a group of characters written in character frames arranged at a predetermined pitch, and when this is read and recognized, character detection and extraction processing is performed. Accurately cutting out this character frame is very important in recognition processing, and conventionally this has been done, for example, as follows.

例えば帳票等の媒体に予め文字位置を示すマー
クを付し、これを検出して文字の検出切出しを行
つたり、あるいは文字自体の拡がりを検出して文
字位置を確認し、検出切出しを行う等している。
ところが前者方式では媒体上に余分なマークが存
在するので媒体、例えば帳票の取扱いが煩雑化し
たり、またマークが小さい場合にはゴミの付着等
によるノイズの影響によつて誤つた位置認識がな
される虞れもあつた。しかも上記ノイズの影響を
除くには相当複雑な装置機構を要するので実用性
に乏しい問題があつた。一方、後者方式では、
「1」や「I」等、その文字が文字枠いつぱいに
書かれないものに対しては、その検出切出し位置
に多くの誤差が生じ、この結果他の文字に対して
所謂片切り状態となる虞れがあつた。その上、読
取り対象とする文字自体から位置検出を行うので
装置構成および検出処理が複雑化することのみな
らず、ノイズに対して弱い等の大きな問題があつ
た。
For example, a mark indicating the character position is attached to a medium such as a form in advance, and this is detected to detect and cut out the character, or the expansion of the character itself is detected to confirm the character position and then the character is detected and cut out. are doing.
However, in the former method, the presence of extra marks on the medium makes handling of the medium, such as forms, complicated, and when the marks are small, incorrect position recognition may occur due to the influence of noise due to adhesion of dust, etc. There was also some fear. Furthermore, eliminating the influence of the noise requires a fairly complicated device mechanism, which poses a problem of poor practicality. On the other hand, in the latter method,
For characters such as "1" and "I" that are not written to fill the entire character frame, there will be many errors in the detected extraction position, resulting in a so-called single-cut state with respect to other characters. I was afraid. Furthermore, since the position is detected from the character itself to be read, the device configuration and detection process are not only complicated, but also there are major problems such as vulnerability to noise.

本発明はこのような事情を考慮してなされたも
ので、その目的とするところは、平均配列ピツチ
の知られた文字列を簡易にして安定に、且つ正確
に切出すことのできる実用性の高い文字読取装置
を提供することにある。
The present invention was made in consideration of these circumstances, and its purpose is to provide a practical method that can easily, stably, and accurately cut out character strings with known average pitches. The objective is to provide a high quality character reading device.

以下、図面を参照して本発明の実施例につき説
明する。
Embodiments of the present invention will be described below with reference to the drawings.

第1図は予め平均配列ピツチの知られた文字列
パターンの一例を示すもので、各文字1はそれぞ
れ所定のピツチで配列された文字枠2内に書込ま
れ、あるいは印刷されたものと看做すことができ
る。この図に例示されるように文字1は、「I」
の如き枠2内の中央部にのみ書かれたものと
「W」の如き枠2の全域に拡がつたもの等が混在
する。
Figure 1 shows an example of a character string pattern with a known average arrangement pitch. Each character 1 is assumed to be written or printed within a character frame 2 arranged at a predetermined pitch. I can do it. As illustrated in this figure, character 1 is "I"
There is a mixture of words such as ``W'' written only in the center of frame 2 and words such as ``W'' that extend over the entire area of frame 2.

しかして、このような文字列パターンに対し
て、先ず切出し基準位置が仮設定され、同位置
を基準として上記予め知られた平均配列ピツチ
で図中一点鎖線で示すように上記文字列パターン
の切出しが行われる。このようにして切出された
各パターンは、次段処理として第2図aに示すよ
うに重ね合せ処理され、その射影成分が同図bの
如く求められる。この重ね合せ処理は、例えば5
〜6パターンに亘つて行われる。しかしてこの重
ね合せパターンの射影成分は、上記切出し位置に
対応して表われ、文字1が2分された状態が重ね
合せパターンの両側に集中することが多い。そし
て、これらの射影成分の情報量の大なる両側部の
中間位置に文字枠2間に相当する空白部分が存在
するようになる。ちなみに上記仮設定された切出
し基準位置が枠2間に合致している場合、重ね
パターンの中央部分に射影成分が生じ、その両側
に空白部分が現われることになる。尚、この空白
部分は、例えば射影成分を所定のレベルで弁別
し、上記所定レベル以下の成分をゴミ等の雑音や
文字のはみ出し等によるノイズとして認識するこ
とにより検出される。しかしてこのようにして求
められた空白部分は前記文字列パターンの本来の
切出し位置に相当するものである。従つて、この
空白部分と重ね合せパターンの端部との距離Lを
算出することにより、前記仮設定された基準位置
のずれ量Lを求めることが可能となる。このず
れ量Lに基づいて前記基準位置を本来の基準位
置に修正し、第1図に示す文字列パターンを図
中破線に示す如く切出すことにより、個々の文字
1が正確に切出される。
Therefore, for such a character string pattern, a cutting reference position is first temporarily set, and the character string pattern is cut out using the same position as a reference at the above-mentioned average arrangement pitch as shown by the dashed line in the figure. will be held. Each of the patterns cut out in this way is subjected to superimposition processing as shown in FIG. 2A as the next step, and its projected component is determined as shown in FIG. 2B. This superimposition process is performed by, for example, 5
It is carried out over 6 patterns. However, the projection components of the overlapping pattern appear corresponding to the cutout positions, and the state in which the character 1 is divided into two parts is often concentrated on both sides of the overlapping pattern. Then, a blank portion corresponding to the space between the character frames 2 exists in the middle position between both sides where the amount of information of these projected components is large. Incidentally, if the tentatively set cutout reference position matches the frame 2, a projected component will appear in the center of the overlapping pattern, and blank areas will appear on both sides. Note that this blank area is detected by, for example, discriminating projected components at a predetermined level and recognizing components below the predetermined level as noise such as dust or noise due to protruding characters. However, the blank area obtained in this way corresponds to the original cutting position of the character string pattern. Therefore, by calculating the distance L between this blank area and the end of the superimposed pattern, it is possible to determine the amount of deviation L of the temporarily set reference position. The reference position is corrected to the original reference position based on the amount of deviation L, and the character string pattern shown in FIG. 1 is cut out as shown by the broken line in the figure, whereby each character 1 is accurately cut out.

このような一連の信号処理は、例えば第3図に
その概略構成を示す位置検出部にて行われる。即
ち文字列パターンの像信号は一旦イメージメモリ
11に格納され、CPU等からなる切出位置計算
回路12の制御を受けて前述した如く所定の平均
配列ピツチで切出されている。このようにして切
出されてイメージメモリ11から出力される複数
のパターンは射影成分計算回路13に導かれてそ
の射影成分が求められ、上記複数のパターンに亘
つて重ね合せられている。尚、射影成分計算回路
13は複数のパターンを重ね合せ処理したのち、
その射影成分を求めるものであつてもよい。しか
して、この射影成分計算回路13によつて求めら
れた重ね合せパターンの射影成分は前記切出位置
計算回路12に導入され、前記したずれ量Lが求
められている。このずれ量Lによつて仮設定され
た基準位置がに修正され、その位置信号が出
力される。また同時に、上記修正位置を基準とし
て、前記文字列パターンの切出しが行われる。
尚、イメージメモリ11は、予め文字列パターン
の射影成分のみを格納し、これを計算回路13に
切出して供給するものであつてもよい。
Such a series of signal processing is performed, for example, in a position detection section whose schematic configuration is shown in FIG. That is, the image signal of the character string pattern is temporarily stored in the image memory 11, and is cut out at a predetermined average array pitch as described above under the control of the cutout position calculation circuit 12 consisting of a CPU or the like. The plurality of patterns cut out in this way and outputted from the image memory 11 are guided to the projected component calculation circuit 13 to obtain their projected components, and are superimposed across the plurality of patterns. Incidentally, after the projected component calculation circuit 13 superimposes a plurality of patterns,
It may also be a method to obtain the projected component. The projected component of the superimposed pattern determined by the projected component calculation circuit 13 is then introduced into the cutout position calculation circuit 12, and the shift amount L is determined. The temporarily set reference position is corrected by this shift amount L, and the position signal is output. At the same time, the character string pattern is cut out using the corrected position as a reference.
Incidentally, the image memory 11 may store only the projected components of the character string pattern in advance, and cut out and supply the projected components to the calculation circuit 13.

かくして本装置によれば、平均配列ピツチの知
られた文字列パターンの空白部分、つまり文字問
隙を重ね合せパターンの空白部分から求めて切出
し位置を決定し、これに従つて上記文字列パター
ンを切出すので、その切出し位置は非常に正確な
ものとなる。しかも仮りに切出したパターンの重
ね合せパターンからその射影成分を求めて位置決
定するので、文字枠全体に拡がらない文字が存在
する場合であつても安定に、且つ確実に位置検出
することができる。その上、ゴミや文字のはみ出
し成分等の影響が重ね合せパターンの射影成分に
殆んど反映されることがないので、結果的にS/
Nの良い検出が可能となる。従つて本装置によれ
ば従来のように媒体に位置検出マークを付す必要
がなく、またノイズの悪影響も受け難い為に、帳
票等の媒体に対する仕様を大幅に緩和することが
でき、従つてその取扱いの容易化を図ることも可
能となる。また平均配列ピツチの自動検出機能を
備えれば各種文字列パターンに対処することがで
きるので、その利用範囲が大幅に拡大される。更
には、各文字に対する次段の認識処理の簡素化を
図り得る等の効果も奏する。
Thus, according to the present device, the blank area, that is, the character gap, of a character string pattern with a known average sequence pitch is determined from the blank area of the superimposed pattern, the cutting position is determined, and the character string pattern is processed according to this. Since it is cut out, the cutout position is very accurate. Moreover, since the position is determined by determining the projected component from the superimposed pattern of the temporarily cut out patterns, the position can be detected stably and reliably even when there are characters that do not spread across the entire character frame. . Furthermore, the effects of dust and protruding characters are hardly reflected in the projected components of the superimposed pattern, resulting in S/
Good detection of N becomes possible. Therefore, with this device, there is no need to attach position detection marks to the medium as in the past, and it is less susceptible to the adverse effects of noise, so the specifications for media such as forms can be significantly relaxed, and the It also becomes possible to facilitate handling. Furthermore, if it is equipped with an automatic detection function for the average sequence pitch, it will be able to deal with various character string patterns, and its range of use will be greatly expanded. Furthermore, it is possible to simplify the next stage recognition process for each character.

尚、本発明は上記実施例にのみ限定されるもの
ではない。例えば平均配列ピツチの認識方式を
種々採用することも可能であり、またピツチを予
め定めておくものであつてもよい。また位置検出
に供する重ね合せパターンを形成するパターン数
も適宜定めればよく、文字も実施例に示した英文
字に限定されるものではない。また実施例の如き
横方向のみならず、縦方向についても同様に位置
検出を行うことも可能であり、これらを併用して
2次元配列された文字列パターンを読取ることも
可能である。要するに本発明はその要旨を逸脱し
ない範囲で種々変形して実施することができる。
Note that the present invention is not limited only to the above embodiments. For example, it is possible to adopt various methods for recognizing the average arrangement pitch, or the pitch may be determined in advance. Further, the number of patterns forming the overlapping pattern used for position detection may be determined as appropriate, and the characters are not limited to the English characters shown in the embodiments. Further, it is possible to perform position detection not only in the horizontal direction as in the embodiment, but also in the vertical direction, and it is also possible to read a two-dimensionally arranged character string pattern by using these in combination. In short, the present invention can be implemented with various modifications without departing from the gist thereof.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は所定配列ピツチの文字列パターンの一
例を示す図、第2図a,bは重ね合せパターンと
その射影成分を示す図、第3図は位置検出部の概
略構成図である。 1……文字、2……枠、11……イメージメモ
リ、12……切出位置計算回路、13……射影成
分計算回路。
FIG. 1 is a diagram showing an example of a character string pattern with a predetermined array pitch, FIGS. 2a and 2b are diagrams showing a superimposed pattern and its projected components, and FIG. 3 is a schematic diagram of a position detecting section. 1... Character, 2... Frame, 11... Image memory, 12... Cutting position calculation circuit, 13... Projection component calculation circuit.

Claims (1)

【特許請求の範囲】[Claims] 1 切出し基準位置を仮設定して平均配列ピツチ
の知られた文字列パターンを上記ピツチで切出す
手段と、この手段で切出された複数のパターンを
重ね合せてその射影成分を求める手段と、この射
影成分からノイズの影響を除去して文字が記載さ
れていないと判断される空白部分を求める手段
と、の空白部分の位置と前記重ね合せパターンの
端部の位置とに基いて前記仮設定された基準位置
を修正する手段と、この修正された基準位置から
前記平均配列ピツチで前記文字列パターンを順次
切出して認識する手段とを具備したことを特徴と
する文字読取装置。
1. Means for temporarily setting a cutting reference position and cutting out a character string pattern with a known average sequence pitch at the pitch, and means for superimposing a plurality of patterns cut out by this means to obtain their projected components; means for removing the influence of noise from the projected component to determine a blank area where it is determined that no characters are written; and the provisional setting based on the position of the blank area and the position of the end of the overlapping pattern. 1. A character reading device comprising: means for correcting the corrected reference position; and means for sequentially cutting out and recognizing the character string pattern at the average arrangement pitch from the corrected reference position.
JP55125762A 1980-09-10 1980-09-10 Character reader Granted JPS5750076A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP55125762A JPS5750076A (en) 1980-09-10 1980-09-10 Character reader

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP55125762A JPS5750076A (en) 1980-09-10 1980-09-10 Character reader

Publications (2)

Publication Number Publication Date
JPS5750076A JPS5750076A (en) 1982-03-24
JPS6343789B2 true JPS6343789B2 (en) 1988-09-01

Family

ID=14918186

Family Applications (1)

Application Number Title Priority Date Filing Date
JP55125762A Granted JPS5750076A (en) 1980-09-10 1980-09-10 Character reader

Country Status (1)

Country Link
JP (1) JPS5750076A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0473589U (en) * 1990-11-07 1992-06-29

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60146376A (en) * 1984-01-11 1985-08-02 Nec Corp Device for segmenting character
JPS6139172A (en) * 1984-07-31 1986-02-25 Toshiba Corp Character detecting and cutting out system
JPS6172373A (en) * 1984-09-17 1986-04-14 Fujitsu Ltd Character recognizer
JPH0727541B2 (en) * 1985-06-26 1995-03-29 株式会社日立製作所 Optical character reading method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0473589U (en) * 1990-11-07 1992-06-29

Also Published As

Publication number Publication date
JPS5750076A (en) 1982-03-24

Similar Documents

Publication Publication Date Title
US5229589A (en) Questionnaire scanning system employing expandable answer mark areas for efficient scanning and mark detection
JPS6077279A (en) Initiation of character image
JPS6343789B2 (en)
GB2027963A (en) Automatic mark recognition
JPS5824265A (en) Mark read system using facsimile transmitter
JPS6111886A (en) Character recognition system
JPH051510B2 (en)
JPS6027436B2 (en) Character recognition correction method
JPH07192087A (en) Optical character reader
JPS59180783A (en) Optical character reader
JPS61100877A (en) Method of detecting deficiency of graphic in graphic recognizing device
JP2511131B2 (en) Character recognition device
JP2663468B2 (en) Character recognition method
JPS5854431B2 (en) Misreading detection method for character recognition device
JPS6010671B2 (en) pattern reading device
JPH0221385A (en) printer device
JPS61165188A (en) Symbol recognizing system
JPS59206989A (en) Letter segmenting device
JPH0321950B2 (en)
JPH06301814A (en) Character reader
JPS5920082A (en) Character recognizing system
JP2511131C (en)
JPS6139171A (en) Character recognizing system
JPS61296484A (en) Reading method for optical character
JPH10222810A (en) Magnetic stripe reader