Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
JPS629958B2 - - Google Patents
[go: Go Back, main page]

JPS629958B2 - - Google Patents

Info

Publication number
JPS629958B2
JPS629958B2 JP54002469A JP246979A JPS629958B2 JP S629958 B2 JPS629958 B2 JP S629958B2 JP 54002469 A JP54002469 A JP 54002469A JP 246979 A JP246979 A JP 246979A JP S629958 B2 JPS629958 B2 JP S629958B2
Authority
JP
Japan
Prior art keywords
category
value
minimum value
difference
dissimilarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP54002469A
Other languages
Japanese (ja)
Other versions
JPS5595190A (en
Inventor
Yukio Hoshino
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
Nippon Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Electric Co Ltd filed Critical Nippon Electric Co Ltd
Priority to JP246979A priority Critical patent/JPS5595190A/en
Publication of JPS5595190A publication Critical patent/JPS5595190A/en
Publication of JPS629958B2 publication Critical patent/JPS629958B2/ja
Granted legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Description

【発明の詳細な説明】 本発明は、文字認識装置、特に、標準パターン
との相違度を判定尺度とした文字認識装置の読取
不能率を少くするための改善に関する。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a character recognition device, and particularly to an improvement for reducing the unreadable rate of a character recognition device using the degree of difference from a standard pattern as a criterion for determination.

文字を認識する場合に、複数個の標準パターン
と入力パターンとの相違度(または、逆に類似度
をとることもある)を求め、最も相違度が小さい
値D1(以下、最小値と称する)とカテゴリー名
N1と、このカテゴリー名と異るカテゴリー名の
うちで(同一カテゴリーの標準パターンが1つと
は限らないので)相違度が最小となるもの(次小
値D2と称する)とそのカテゴリー名N2とを求め
て、最小値D1が所定の閾値T以下で且つ、次
小値D2と最小値D1との差が所定の閾値Td1以上で
あれば、カテゴリー名N1を、判定カテゴリー名
とし、これらの条件が満足されなければ読取不能
とする場合が多かつた。例えば特開昭50―156322
号公報「パターン認識装置〓がその1つである。
しかしながら数字のゼロ0と英字のオーOあるい
は、英字同志でもオーOとデーDのように元来が
類似の形をしたものを、区別しようとすると、例
えば数字0が入力された場合数字0との相違度と
英字Oとの相違度とは差が非常に小さく、そのた
めに、しばしば識別不能となつてしまう。そこで
識別不能の率を少なくするために閾値Td1を小さ
くすると、誤読が増えるという悪い現象が生れ
る。この閾値Td1の調整は、文字認識装置設計に
おいて最も困難なものの1つである。
When recognizing characters, the degree of dissimilarity (or, conversely, the degree of similarity) between multiple standard patterns and the input pattern is determined, and the value D 1 with the smallest degree of dissimilarity (hereinafter referred to as the minimum value) is calculated. ) and category name
N 1 , and the one with the minimum degree of difference among the category names different from this category name (because there is not necessarily one standard pattern for the same category) (referred to as the next smallest value D 2 ) and its category name N 2 , and if the minimum value D 1 is less than a predetermined threshold T and the difference between the next smallest value D 2 and the minimum value D 1 is more than a predetermined threshold Td 1 , determine the category name N 1 . Category names were often used, and unless these conditions were met, they were often rendered unreadable. For example, Japanese Patent Publication No. 50-156322
The publication "Pattern Recognition Device" is one of them.
However, if you try to distinguish between the number zero 0 and the alphabetic letter O-O, or even the alphabetical characters O-O and D-D, which originally have similar shapes, for example, if the number 0 is entered, it will be recognized as the number 0. The difference between the degree of difference between the letter O and the letter O is very small, and as a result, it is often impossible to identify the letter O. Therefore, if the threshold value T d1 is made small in order to reduce the rate of indiscernibility, a bad phenomenon occurs in that the number of misreadings increases. Adjusting this threshold value T d1 is one of the most difficult things in character recognition device design.

本発明の目的は、上述のように、類似の文字が
あつた場合に、読取不能を減少させ、しかも、誤
読を増加させない文字認識装置を提供することに
ある。
As described above, an object of the present invention is to provide a character recognition device that reduces unreadability and does not increase misreading when similar characters occur.

本発明では、前述の次小値D2と最小値D1との
差が閾値Td1以下であつても、これら2つの値
D1,D2となつたカテゴリー名N1とN2のカテゴリ
ー対が、前もつて類似であると定められたカテゴ
リー名の対のうちの1つであれば、次小値D2
最小値D1との差は、閾値Td2(Td1より小さい)
以上であれば良いものとし、カテゴリーN1とN2
とは異るカテゴリーで、相違度が最小の相違度
(以下次々小値と呼ぶ)D3と最小値D1との差が
Td1以上である条件を満足すればカテゴリー名N1
を判定結果として出力するようにしたものであ
る。
In the present invention, even if the difference between the above-mentioned next minimum value D 2 and minimum value D 1 is less than the threshold value Td 1 , these two values
If the category pair of category names N 1 and N 2, which became D 1 and D 2 , is one of the pairs of category names previously determined to be similar, then the next minimum value D 2 and the minimum The difference from the value D 1 is the threshold Td 2 (less than Td 1 )
If it is above, it is good, and categories N 1 and N 2
In a category different from , the difference between the minimum difference D 3 (hereinafter referred to as successively smaller values) and the minimum difference D 1 is
Category name N 1 if the condition of Td 1 or more is satisfied
is output as the judgment result.

以下、本発明について詳細に説明する。 The present invention will be explained in detail below.

図は本発明の文字認識装置の一実施例を示すブ
ロツク図である。走査量子化手段1によつて、書
類上の1文字が量子化されて量子化された入力パ
ターンが相違度検出手段2にセツトされる。標準
パターンメモリ3に記憶された複数個の標準パタ
ーンと、入力パターンとが、相違度検出手段2に
おいて照合されて各標準パターンとの相違度が計
算される。この相違度検出手段は、前述の特開昭
50―156322号公報「パターン認識装置」等のよう
に、相違度を求める方法でも良いし、特開公報昭
53―32658「文字読取装置」のように類似度を求
める方法で実現しても良い。ともかく、相違度と
類似度とは相違度最小が、類似度最大となり本質
的に同じなので本願では、相違度としている。相
違度の順位付け回路4は相違度の小さいものから
順に大きいものへと順を付けて、それらの相違度
とカテゴリー名とを2組だけ候補カテゴリレジス
タ51,52と53,54にセツトする。3番目
の相違度をレジスタ56にセツトする。またこれ
らの、3つの相違度を生じた標準パターンのカテ
ゴリー名は異るようにする。
The figure is a block diagram showing an embodiment of the character recognition device of the present invention. One character on a document is quantized by the scanning quantization means 1 and the quantized input pattern is set in the difference detection means 2. The plurality of standard patterns stored in the standard pattern memory 3 and the input pattern are compared in the difference detection means 2, and the degree of difference with each standard pattern is calculated. This difference detection means is based on the above-mentioned Japanese Patent Application Publication No.
50-156322 "Pattern Recognition Device" etc., a method of calculating the degree of dissimilarity may be used, or
53-32658 "Character reading device" may be used to obtain similarity. In any case, the degree of dissimilarity and the degree of similarity are essentially the same, with the minimum degree of difference being the maximum degree of similarity, and therefore, in this application, they are referred to as the degree of dissimilarity. The dissimilarity ranking circuit 4 ranks the dissimilarities from the smallest to the largest, and sets only two sets of these dissimilarities and category names in candidate category registers 51, 52 and 53, 54. The third degree of difference is set in register 56. Furthermore, the category names of the standard patterns that have caused these three degrees of difference are made different.

レジスタ51,52には、最小値D1とカテゴ
リー名N1が、レジスタ53,54には次小値D2
とカテゴリー名N2が、更にレジスタ56には
次々小値D3がセツトされる。このようにして順
位付け回路4、レジスタ51,52,53,5
4,56により、候補カテゴリーが決定され出力
される。
The minimum value D 1 and category name N 1 are stored in registers 51 and 52, and the next minimum value D 2 is stored in registers 53 and 54.
and the category name N2 , and furthermore, the small value D3 is set in the register 56 one after another. In this way, the ranking circuit 4, registers 51, 52, 53, 5
4 and 56, candidate categories are determined and output.

レジスタ51にセツトされたカテゴリー名N1
とレジスタ53にセツトされたカテゴリー名N2
とはカテゴリー名検査回路61において、類似カ
テゴリー対テーブル62に記憶されたカテゴリー
名対と一致するか検査され一致するカテゴリー名
対が検出されたらレジスタ63に“1”をセツト
する。
Category name set in register 51 N 1
and the category name N 2 set in register 53.
The category name checking circuit 61 checks whether the category name pairs match the category name pairs stored in the similar category pair table 62, and if a matching category name pair is detected, "1" is set in the register 63.

(検査回路が動く前にこのレジスタ63は
“0”にリセツトされているとする) 類似カテゴリー対テーブルには(数字0と英字
O)(英字Oと数字0)(8とB)(Bと8)のよ
うに類似カテゴリーの対が記憶されている。例え
ば入力パターンが数字0である時にはレジスタ5
1に数字0、レジスタ53に英字Oが候補カテゴ
リーとしてセツトされる場合が多い。
(Assume that this register 63 is reset to “0” before the test circuit operates) The similar category pair table includes (number 0 and alphabet O) (alphabet O and number 0) (8 and B) (B and Pairs of similar categories are stored as in 8). For example, if the input pattern is the number 0, register 5
In many cases, the number 0 is set to 1 and the alphabetic character O is set to register 53 as a candidate category.

この場合には、カテゴリー対(数字0と英字
O)は類似カテゴリー対テーブルに存在するので
レジスタ63に“1”がセツトされる。尚、テー
ブルの内容を(数字0と英字O)(8とB)とい
うように記憶し、検査回路でカテゴリーN1とN2
とを交換して検索し直しても良い。レジスタ52
にセツトされた最小値D1が比較器71におい
て、閾値レジスタ72にセツトされた閾値Tと
比較されて、D1がT以下であれば比較器71
から信号“1”が出力される。
In this case, since the category pair (number 0 and alphabetic letter O) exists in the similar category pair table, "1" is set in register 63. In addition, the contents of the table are memorized as (number 0 and alphabet O) (8 and B), and categories N 1 and N 2 are stored in the test circuit.
You can also replace it with and search again. register 52
The minimum value D 1 set in the threshold value register 72 is compared with the threshold value T set in the threshold value register 72 in the comparator 71, and if D 1 is less than or equal to T, the comparator 71
A signal “1” is output from.

レジスタ54にセツトされた次小値D2と、レ
ジスタ52にセツトされた最小値D1とは減算器
81において、D2からD1が減算されて、その値
D2―D1が比較器83及び84に入力される。
The next minimum value D 2 set in the register 54 and the minimum value D 1 set in the register 52 are obtained by subtracting D 1 from D 2 in a subtracter 81 and obtaining the value.
D 2 -D 1 are input to comparators 83 and 84.

比較器83においては、閾値レジスタ87にセ
ツトされている。閾値Td2と比較され、D2―D1
Td2以上ならば信号“1”が信号線831に出力
される。比較器84においては、閾値レジスタ8
6にセツトされている閾値Td1と比較され、D2
D1がTd1以上ならば信号“1”が信号線841に
出力される。
In the comparator 83, a threshold value register 87 is set. It is compared with the threshold Td 2 and D 2D 1 is
If Td 2 or more, a signal “1” is output to the signal line 831. In comparator 84, threshold register 8
D 2
If D 1 is greater than or equal to Td 1 , a signal “1” is output to the signal line 841.

レジスタ56にセツトされた次々小値D3とレ
ジスタ52にセツトされた最小値D1とが減算器
82において、D3からD1が減算されてその値が
比較器85にセツトされる。比較器85において
はD3―D1が閾値レジスタ86にセツトされた閾
値Td1と比較されて、D3―D1がTd1以上ならば信
号線851に信号“1”を出力する。
The successive small values D 3 set in the register 56 and the minimum value D 1 set in the register 52 are used in a subtracter 82 , where D 1 is subtracted from D 3 and the resulting value is set in a comparator 85 . In the comparator 85, D 3 -D 1 is compared with the threshold value Td 1 set in the threshold value register 86, and if D 3 -D 1 is greater than or equal to Td 1 , a signal "1" is outputted to the signal line 851.

レジスタ63に、類似カテゴリー対テーブルの
検査結果がセツトされた時に、これらの比較器7
1,83,84および85の比較結果が得られる
ようにすることは容易である。レジスタ63の出
力信号線631と、比較器71,83,84およ
び85の出力が全て“1”ならばアンドゲート9
1の出力は“1”となり、この信号“1”はオア
ゲート93を通過して、カテゴリー名N1を、ゲ
ート94から通過させる。即ち、カテゴリー名
N1とN2が類似カテゴリー対の時には、最小値D1
が閾値T以下で、次小値D2と最小値D1との差
が閾値Td2以上で且つ次々小値D3と最小値D1との
差が閾値Td1以上であるならば、カテゴリー名N1
が判定結果として認められる。
When the test results of the similar category pair table are set in the register 63, these comparators 7
It is easy to obtain comparison results of 1, 83, 84, and 85. If the output signal line 631 of the register 63 and the outputs of the comparators 71, 83, 84 and 85 are all "1", the AND gate 9
The output of "1" becomes "1", and this signal "1" passes through the OR gate 93 and causes the category name N 1 to pass through the gate 94. i.e. category name
When N 1 and N 2 are a similar category pair, the minimum value D 1
is less than the threshold T, the difference between the next smallest value D 2 and the minimum value D 1 is more than the threshold Td 2 , and the difference between the next smallest value D 3 and the smallest value D 1 is more than the threshold Td 1 , then the category Name N 1
is accepted as the judgment result.

レジスタ63に信号“1”がセツトされない場
合にはすなわち信号線631が“0”信号線63
2が“1”となり、比較器71と83の比較結果
が“1”ならアンドゲート92の出力は“1”と
なつて、この信号“1”はオアゲート93を通過
し、カテゴリー名N1をゲート94によつて通過
させる。即ち、カテゴリー名N1とN2が類似カテ
ゴリー対テーブルにないならば、最小値D1が閾
値T以下で、且つ次小値D2と最小値D1との差
が閾値Td1以上である時に、カテゴリーN1を判定
結果とする。
If the signal “1” is not set in the register 63, that is, the signal line 631 becomes “0” and the signal line 63
2 becomes "1" and the comparison result of the comparators 71 and 83 is "1", the output of the AND gate 92 becomes "1", this signal "1" passes through the OR gate 93, and the category name N1 is It is allowed to pass through gate 94. That is, if category names N 1 and N 2 are not in the similar category pair table, the minimum value D 1 is less than or equal to the threshold T, and the difference between the next smallest value D 2 and the minimum value D 1 is greater than or equal to the threshold Td 1 . In some cases, category N 1 is determined as the determination result.

かくして、数字の0と英字のOのように、本質
的に類似した文字が読取対象に含まれる場合に
も、これらの類似文字の読取率の低下をまねくこ
とのない文字認識装置を得ることができる。
In this way, even when characters that are essentially similar, such as the number 0 and the alphabet O, are included in the reading target, it is possible to obtain a character recognition device that does not cause a decrease in the reading rate of these similar characters. can.

以上の説明で英数字の活字についてのみ述べた
が漢字、およびカタカナの活字、の認識にも適用
することは可能であるし、また、電子通信学会論
文誌、76/6、Vol.J59―DNo.6 「手書カタカナ文字と数字の機械認識―位相線
分法と自動概念形成一」に述べられているよう
に、手書文字から特徴抽出して、標準パターンと
の距離を求めて判定する場合にも利用することが
できる。
In the above explanation, we have only talked about alphanumeric characters, but it can also be applied to the recognition of kanji and katakana characters. .6 As described in "Machine Recognition of Handwritten Katakana Characters and Numbers - Phase Line Segment Method and Automatic Concept Formation-1", features are extracted from handwritten characters and judgments are made by determining the distance from the standard pattern. It can also be used in cases.

【図面の簡単な説明】[Brief explanation of the drawing]

図は、本発明の文字認識装置の一実施例を示す
ブロツク図である。図中の1は、走査量子化手
段、2は相違度検出手段、3は標準パターンメモ
リ、4は順位付け回路、51,52,53,54
および56は、レジスタ、61はカテゴリー名検
索回路、62は類似カテゴリー名テーブル、8
1,82は減算器、71,83,84,85は比
較器、63,86,87,72はレジスタ、9
1,92はアンドゲート、93はオアゲート、9
4はアンドゲートを示す。
The figure is a block diagram showing an embodiment of the character recognition device of the present invention. 1 in the figure is a scanning quantization means, 2 is a difference detection means, 3 is a standard pattern memory, 4 is a ranking circuit, 51, 52, 53, 54
and 56 is a register, 61 is a category name search circuit, 62 is a similar category name table, and 8
1 and 82 are subtracters, 71, 83, 84, and 85 are comparators, 63, 86, 87, and 72 are registers, and 9
1,92 is and gate, 93 is or gate, 9
4 indicates an AND gate.

Claims (1)

【特許請求の範囲】[Claims] 1 入力パターンと複数個のカテゴリー名付きの
標準パターンとの名相違度を求める相違度検出手
段と、前記相違度のうちの最小相違度D1と、そ
の標準パターンのカテゴリー名N1、前記カテゴ
リー名N1以外のカテゴリーで最小となる相違度
D2(以下、次小値D2と称す)とその標準パター
ンのカテゴリー名N2、前記カテゴリーN1および
N2以外のカテゴリーで最小となる相違度D3(以
下次々小値D3と称す)を求める候補カテゴリー
決定手段と、前記決定されたカテゴリーN1とN2
が、前記類似カテゴリー対であるか否かを検査す
る類似カテゴリー対検査手段と、前記類似カテゴ
リー対検査手段により前記カテゴリーN1とN2
前記類似カテゴリー対であると検査された場合、
前記最小値D1が所定の閾値T以下で、前記
次々小値D3と前記最小値D1の差が所定の閾値Td1
以上で、且つ、前記次小値D2と前記最小値D1
の差が所定の閾値Td2(<Td1)以上であるという
3つの条件が満足されたならば、前記カテゴリー
名N1を判定結果として認め、出力し、カテゴリ
ーN1とN2とが、類似カテゴリー対に無いと検査
された場合は、前記最小値D1が所定の閾値T
以下で前記次小値D2と、前記最小値D1との差が
所定の閾値Td1以上という2つの条件を満たすな
らば、前記カテゴリー名N1を判定結果として認
め出力する判定手段とを備えたことを特徴とする
文字認識装置。
1 Dissimilarity detection means for determining the degree of dissimilarity between an input pattern and a plurality of standard patterns with category names, the minimum degree of dissimilarity D 1 among the dissimilarities, the category name N 1 of the standard pattern, and the category Name N Minimum dissimilarity in categories other than 1
D 2 (hereinafter referred to as the next smallest value D 2 ) and its standard pattern category name N 2 , the category N 1 and
Candidate category determining means for determining the minimum degree of dissimilarity D 3 (hereinafter referred to as successively smaller value D 3 ) in categories other than N 2 , and the determined categories N 1 and N 2
is a similar category pair, and when the similar category pair checking means tests that the categories N 1 and N 2 are the similar category pair,
The minimum value D 1 is less than or equal to a predetermined threshold T, and the difference between the successively smaller value D 3 and the minimum value D 1 is a predetermined threshold Td 1
If the above three conditions are satisfied, and the difference between the next minimum value D 2 and the minimum value D 1 is greater than or equal to a predetermined threshold Td 2 (<Td 1 ), then the category name N 1 is recognized as a judgment result and output, and if it is checked that categories N 1 and N 2 are not in a similar category pair, the minimum value D 1 is set to a predetermined threshold T
In the following, if the difference between the next minimum value D 2 and the minimum value D 1 satisfies two conditions, that is, a predetermined threshold value Td 1 or more, the category name N 1 is recognized as a determination result and is output. A character recognition device characterized by:
JP246979A 1979-01-10 1979-01-10 Character recognition unit Granted JPS5595190A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP246979A JPS5595190A (en) 1979-01-10 1979-01-10 Character recognition unit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP246979A JPS5595190A (en) 1979-01-10 1979-01-10 Character recognition unit

Publications (2)

Publication Number Publication Date
JPS5595190A JPS5595190A (en) 1980-07-19
JPS629958B2 true JPS629958B2 (en) 1987-03-03

Family

ID=11530167

Family Applications (1)

Application Number Title Priority Date Filing Date
JP246979A Granted JPS5595190A (en) 1979-01-10 1979-01-10 Character recognition unit

Country Status (1)

Country Link
JP (1) JPS5595190A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0264358U (en) * 1988-11-05 1990-05-15
JPH0411387U (en) * 1990-05-21 1992-01-30
JPH0411388U (en) * 1990-05-21 1992-01-30

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63263591A (en) * 1987-04-21 1988-10-31 Nec Corp Character recognizing circuit

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0264358U (en) * 1988-11-05 1990-05-15
JPH0411387U (en) * 1990-05-21 1992-01-30
JPH0411388U (en) * 1990-05-21 1992-01-30

Also Published As

Publication number Publication date
JPS5595190A (en) 1980-07-19

Similar Documents

Publication Publication Date Title
US4989258A (en) Character recognition apparatus
JPS6156553B2 (en)
US3810093A (en) Character recognizing system employing category comparison and product value summation
JPS629958B2 (en)
US4769851A (en) Apparatus for recognizing characters
Chaudhuri et al. OCR error detection and correction of an inflectional indian language script
JPH11232296A (en) Document filing system and document filing method
KR100332752B1 (en) Character recognition method
JPS60138689A (en) Character recognizing method
JP3157557B2 (en) Character recognition device
JPS59158482A (en) Character recognizing device
JPH01171080A (en) Recognizing device for error automatically correcting character
JPH0580710B2 (en)
JPS61114388A (en) Character input device
JPS6111886A (en) Character recognition system
JP3033904B2 (en) Character recognition post-processing method
JPH0540854A (en) Post-processing method for character recognizing result
JP3659688B2 (en) Character recognition device
JPH03224079A (en) Character recognizer
Oodaira et al. Detection of a Key String from Scene Images Using Saliency
JPH01191992A (en) character recognition device
JPH04242494A (en) Optical character recognizing device
JPS6235153B2 (en)
JPS60138688A (en) Character recognizing method
JPH06243294A (en) Character recognition postprocessing device