Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
JP2749692B2 - Standard pattern creation method - Google Patents
[go: Go Back, main page]

JP2749692B2 - Standard pattern creation method - Google Patents

Standard pattern creation method

Info

Publication number
JP2749692B2
JP2749692B2 JP2057993A JP5799390A JP2749692B2 JP 2749692 B2 JP2749692 B2 JP 2749692B2 JP 2057993 A JP2057993 A JP 2057993A JP 5799390 A JP5799390 A JP 5799390A JP 2749692 B2 JP2749692 B2 JP 2749692B2
Authority
JP
Japan
Prior art keywords
pattern
standard pattern
feature
character
sum
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP2057993A
Other languages
Japanese (ja)
Other versions
JPH03260888A (en
Inventor
伸二 松井
保夫 本郷
哲夫 木内
章子 紺野
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuji Electric Co Ltd
Fuji Facom Corp
Original Assignee
Fuji Electric Co Ltd
Fuji Facom Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuji Electric Co Ltd, Fuji Facom Corp filed Critical Fuji Electric Co Ltd
Priority to JP2057993A priority Critical patent/JP2749692B2/en
Publication of JPH03260888A publication Critical patent/JPH03260888A/en
Application granted granted Critical
Publication of JP2749692B2 publication Critical patent/JP2749692B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は、パターンマッチングによる文字認識処理に
用いる標準パターンの作成方法に関する。
Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for creating a standard pattern used for character recognition processing by pattern matching.

〔従来の技術〕[Conventional technology]

パターンマッチングを用いた一般的な文字認識処理手
順を第2図に示す。同図において、ステップS1は文字画
像入力、ステップS2は特徴抽出、ステップS3は字種判
別、を示す。
FIG. 2 shows a general character recognition processing procedure using pattern matching. In the figure, step S1 shows input of a character image, step S2 shows feature extraction, and step S3 shows character type determination.

ここで、字種判別部(ステップS3)では、認識対象と
なる文字ごとにあらかじめ用意された標準パターンと入
力画像から抽出された特徴パターンとの間で類似度が算
出され、入力文字の字種が判別される。
Here, the character type determination unit (step S3) calculates a similarity between a standard pattern prepared in advance for each character to be recognized and a feature pattern extracted from the input image, and calculates the character type of the input character. Is determined.

類似度Gは、標準パターンと入力画像の特徴パターン
それぞれをn次元のベクトルS(n),P(n)として、
以下の式から求められる。
The similarity G is obtained by using an n-dimensional vector S (n), P (n) as a standard pattern and a feature pattern of an input image, respectively.
It is obtained from the following equation.

ここで、(S,P)=ΣiS(i)P(i)であり、ベク
トルの内積をあらわす。(S,S)、(P,P)はそれぞれ、
標準パターン、特徴パターンの自己相関値と呼ばれる。
Here, (S, P) = Σ i S (i) P (i), which represents the inner product of the vectors. (S, S) and (P, P)
These are called autocorrelation values of standard patterns and feature patterns.

このようにして使用される標準パターンの一般的な作
成手順を第3図に示す。
FIG. 3 shows a general procedure for creating a standard pattern used in this manner.

同図に見られるように、ステップS1で得た複数のサン
プルからの特徴抽出をステップS2で行い、その結果をス
テップS3で各要素ごとに足し込み、ステップS4で得られ
る総和パターンに対し、必要に応じた濃度変換をステッ
プS5で施して標準パターンを得る。ここで言う濃度とは
特徴要素の大きさをさす。以後同様とする。
As can be seen from the figure, feature extraction from the multiple samples obtained in step S1 is performed in step S2, and the result is added for each element in step S3, and the necessary sum is obtained for the total pattern obtained in step S4. Is performed in step S5 to obtain a standard pattern. The density here refers to the size of the characteristic element. The same applies hereinafter.

一般に濃度変換としては、それぞれの標準パターンの
中の最大要素の所定の値Fに揃えるように、他の要素も
それに比例して一律的に大きさを変換する処理が行われ
る。複数ある字種の中のj番目字種の総和パターンHj
(n)の最大要素をMjとすれば、標準パターンS
j(n)のi番目の要素は、 Sj(i)=Hj(i)・F/Mj …(1) として得られる。
Generally, as the density conversion, a process of uniformly changing the size of other elements in proportion to the predetermined value F of the largest element in each standard pattern is performed. Sum pattern H j of the j-th character type among a plurality of character types
If the maximum element of (n) is M j , the standard pattern S
The i-th element of j (n) is obtained as Sj (i) = Hj (i) .F / Mj (1).

これは認識処理を行う装置の有効桁数や、標準パター
ンの格納領域の制約から、限られたビット幅を有効に使
うためである。または、入力画像の特徴パターンと標準
パターンの濃度のオーダを一致させるために、総和パタ
ーンの各要素をサンプル数で割る処理が行われる。
This is because a limited bit width is effectively used due to the number of significant digits of the device that performs the recognition process and restrictions on the storage area of the standard pattern. Alternatively, in order to match the order of the density of the characteristic pattern of the input image with the density of the standard pattern, a process of dividing each element of the total pattern by the number of samples is performed.

〔発明が解決しようとする課題〕 所で、このようにして作成された標準パターンが、結
果的に、少数の突出した値の特徴要素をもつことがあ
り、その場合、これらの突出した要素が字種の判別に大
きな影響を与えることが判明した。すなわち、文字の変
動に弱くなり、認識率が低くなる。
[Problems to be Solved by the Invention] By the way, the standard pattern created in this way may have a small number of feature elements of prominent values as a result, in which case, these prominent elements It has been found that this has a great effect on the discrimination of the character type. In other words, the character is less susceptible to fluctuations and the recognition rate is lower.

第4図は、従来手法で作成された標準パターンにおけ
る濃度分布を示したグラフであるが、横軸に沿って濃度
255、208及び207の所にそれぞれ、1個の画素が存在
し、少数の突出した値の特徴要素になっていることが分
かる。
FIG. 4 is a graph showing the density distribution in the standard pattern created by the conventional method.
It can be seen that there is one pixel at each of 255, 208 and 207, which is a feature element with a small number of prominent values.

本発明の目的は、かかる問題点を改善し、少数の突出
した値の特徴要素をもつ標準パターンの場合には、それ
らの特徴要素をうまく処理することにより、認識率の低
下を招かないようにした標準パターンの作成方法を提供
することにある。
An object of the present invention is to improve such a problem, and in the case of a standard pattern having a small number of feature elements having a prominent value, by processing those feature elements well so as not to reduce the recognition rate. Another object of the present invention is to provide a method for creating a standard pattern.

〔課題を解決するための手段〕[Means for solving the problem]

標準パターン作成時、各サンプルの特徴抽出結果を足
し込んだ総和パターンが、少数の突出した特徴要素をも
つとき、該総和パターンの各特徴要素に対し、a乗根
(a>1.0)をとる演算や対数に類似した関数f(x)
を用いた演算をほどこすことにより、第6図に示すよう
な濃度の非線形変換を行い、この後、必要に応じた濃度
の線形変換を行う。
When creating a standard pattern, if the sum pattern obtained by adding the feature extraction results of each sample has a small number of prominent feature elements, an operation that takes the a-th root (a> 1.0) for each feature element of the sum pattern Or a function f (x) similar to a logarithm
, A nonlinear conversion of density as shown in FIG. 6 is performed, and then a linear conversion of density is performed as necessary.

f(x)は、任意のx(≧0)に対して以下の条件を
満たすような凸関数である。
f (x) is a convex function that satisfies the following condition for an arbitrary x (≧ 0).

f(x)−f(x−Δx)≧f(x+Δx)−f(x)
≧0 …(2) さらに、以下の条件を満たす。
f (x) −f (x−Δx) ≧ f (x + Δx) −f (x)
≧ 0 (2) Further, the following condition is satisfied.

ここで、Δxは任意の正の実数であり、関数の増分を
表す。
Here, Δx is an arbitrary positive real number and represents an increment of a function.

〔作用〕[Action]

上記の如き非線形変換を施すことにより、突出した特
徴要素を、そうでない特徴要素よりも一段と押え込むこ
とが可能になるから、これにより、文字の変動に強い安
定した標準パターンを得ることができ、従来と同様の処
理を用いた認識処理において認識率を向上させることが
できる。
By performing the non-linear conversion as described above, it becomes possible to further press down the prominent feature elements more than the other feature elements, so that it is possible to obtain a stable standard pattern that is strong against character fluctuations, The recognition rate can be improved in the recognition processing using the same processing as the related art.

〔実施例〕〔Example〕

第1図は本発明の一実施例としての標準パターン作成
方法を示すフローチャートである。同図に見られるよう
に、各サンプル画像に対し順次特徴抽出処理が施され足
し込まれる。この総和パターンに対して、ステップS5に
見られる如く、各特徴要素のa乗根(aは1.0より大き
な実数)をとり、その結果で新たな総和パターンを得
る。すなわち、a乗根をとる演算は、下記に示すような
非線形関数f(x)を用いた演算により、非線形な濃度
変換を行うことを意味する。
FIG. 1 is a flowchart showing a standard pattern creating method as one embodiment of the present invention. As can be seen from the figure, feature extraction processing is sequentially performed on each sample image and added. For this sum pattern, as shown in step S5, the a-th root of each feature element (a is a real number larger than 1.0) is obtained, and a new sum pattern is obtained as a result. In other words, the calculation that takes the a-th root means performing a non-linear density conversion by a calculation using a non-linear function f (x) as described below.

f(x)=x1/a …(4) その後、更に、従来も行われていた濃度変換(第1図
のステップS6或いは第3図のステップS5)として、前記
(1)式を用いた濃度変換が行われる。ここでは、各特
徴要素のビット長を8ビットに圧縮するためF=255と
して一律的な濃度変換を行っている。従来手法と異なる
のは各特徴要素のa乗根をとって非線形変換を行う点で
ある。
f (x) = x 1 / a (4) Thereafter, the above-mentioned equation (1) is used as density conversion (step S6 in FIG. 1 or step S5 in FIG. 3) which has been conventionally performed. Density conversion is performed. Here, uniform density conversion is performed with F = 255 in order to compress the bit length of each feature element to 8 bits. The difference from the conventional method is that nonlinear transformation is performed by taking the a-th root of each feature element.

従来手法と本手法(a=4)を用いた場合の標準パタ
ーンの例を、“あ”という文字について、学習サンプル
数80個で、特徴抽出手法として、(16×16)メッシュに
正規化された画像の輪郭画素を白画素を挟んだ輪郭画素
間の相対距離で複数の特徴面(例えば15面)に割り付け
る手法(特願平成1年第71757号)を用いて作成し比較
してみると、後者の方が、突出した特徴要素を低く抑
え、その後、全体をカサ上げするように濃度変換してい
るので、全体に濃度が濃くなっているのが分かる。
An example of a standard pattern using the conventional method and the present method (a = 4) is as follows. The character “A” is normalized to a (16 × 16) mesh as a feature extraction method using 80 learning samples. Created using a method (Japanese Patent Application No. 71757) that assigns the contour pixels of the image to a plurality of feature planes (for example, 15 planes) based on the relative distance between the contour pixels sandwiching the white pixels. In the latter, since the prominent characteristic elements are suppressed low and thereafter the density is converted so that the whole is raised, it can be seen that the density is increased as a whole.

本発明による手法で標準パターンをひらがな75文字に
対して作成し、判別式として類似度を用いた従来と同様
の認識処理を学習サンプル80文字、ひらがな75字種に対
して行った場合、従来手法で作成された標準パターンを
用いた場合の認識率90.5%に対して、95.5%に認識率が
向上した。変数a(a乗根のa)と認識率の関係を第5
図に示したので参照されたい。
When a standard pattern is created for 75 characters of Hiragana by the method according to the present invention, and the same recognition processing as in the past using similarity as a discriminant is performed for 80 characters of a learning sample and 75 characters of Hiragana, the conventional method is used. The recognition rate was improved to 95.5%, compared to the recognition rate of 90.5% when using the standard pattern created in. The relationship between the variable a (a of the a-th root) and the recognition rate is the fifth.
Please refer to the figure.

本発明は、前記特徴抽出手法に依存するものではな
く、伝播停止処理を用いた背景特徴(特開昭59-792号公
報)でも、その有効性が認識されている。
The present invention does not depend on the above-described feature extraction method, and its effectiveness has been recognized even for background features using a propagation stop process (Japanese Patent Laid-Open No. 59-792).

ここでは前記(4)式において、a=4としたが、a
の最適値は認識対象とするフィールドや特徴抽出手法、
判別式の種類によって異なると考えられる。また、総和
パターンの各特徴要素の対数をとる非線形変換も、a乗
根をとる非線形変換と同様に有効である。すなわち、f
(x)として下記に示す関数を用いる。
Here, in the above equation (4), a = 4.
The optimal value of is the field to be recognized, the feature extraction method,
It is thought that it depends on the type of discriminant. Further, the non-linear transformation taking the logarithm of each feature element of the sum pattern is also effective as the non-linear transformation taking the a-th root. That is, f
The following function is used as (x).

f(x)=log10(x+1) …(5) ただし、前記(4)式におけるaを大きくしていくと
特徴パターンは2値画像に近づき、認識率は低下する。
そこで、サンプル画像から抽出された特徴パターンの各
要素をa乗して足し込み、前記総和パターンを得ること
で、総和パターンにおける濃度分布をある程度保存した
まま、突出した特徴要素の値を低く押さえることができ
る。
f (x) = log 10 (x + 1) (5) However, as a in the above equation (4) increases, the feature pattern approaches a binary image, and the recognition rate decreases.
Therefore, by adding the respective elements of the feature pattern extracted from the sample image to the power a and adding the sum to obtain the sum pattern, the value of the prominent feature element can be kept low while the density distribution in the sum pattern is preserved to some extent. Can be.

〔発明の効果〕〔The invention's effect〕

本発明によれば、標準パターン作成時に、各サンプル
文字画像から抽出した特徴要素の足し込み総和パターン
において、生じることのある突出した特徴要素を押え込
むような非線形変換を、各特徴要素に施して標準パター
ンを作成することにより、認識率が向上するという利点
がある。非線形変換として4乗根を用いる場合、認識率
は90.5%から95.5%に向上し、認識率にして53%の改善
がみられた。
According to the present invention, at the time of creating a standard pattern, in the sum total pattern of the characteristic elements extracted from each sample character image, a non-linear transformation is performed on each characteristic element such that a prominent characteristic element that may occur is suppressed. There is an advantage that the recognition rate is improved by creating the standard pattern. When the fourth root is used as the nonlinear transformation, the recognition rate is improved from 90.5% to 95.5%, and the recognition rate is improved by 53%.

【図面の簡単な説明】[Brief description of the drawings]

第1図は本発明の一実施例を示すフローチャート、第2
図はパターンマッチングを用いる一般的な文字認識手順
を示すフローチャート、第3図は従来の標準パターン作
成方法を示すフローチャート、第4図は従来手法により
得られる標準パターンの濃度分布を示すグラフ、第5図
は変数aと認識率の関係を示す特性図、第6図は本発明
による非線形変換の特性例を示す特性図、である。 符号の説明 S1〜S6……ステップ
FIG. 1 is a flowchart showing one embodiment of the present invention, and FIG.
FIG. 3 is a flowchart showing a general character recognition procedure using pattern matching, FIG. 3 is a flowchart showing a conventional standard pattern creation method, FIG. 4 is a graph showing a standard pattern density distribution obtained by a conventional method, and FIG. FIG. 6 is a characteristic diagram showing the relationship between the variable a and the recognition rate, and FIG. 6 is a characteristic diagram showing an example of the characteristic of the non-linear conversion according to the present invention. Explanation of reference symbols S1 to S6 ... Step

───────────────────────────────────────────────────── フロントページの続き (72)発明者 木内 哲夫 東京都日野市富士町1番地 富士ファコ ム制御株式会社内 (72)発明者 紺野 章子 東京都日野市富士町1番地 富士ファコ ム制御株式会社内 (56)参考文献 特開 平1−116892(JP,A) 特開 平2−242391(JP,A) 特開 昭60−160488(JP,A) 特開 昭49−103536(JP,A) 特開 昭63−313283(JP,A) (58)調査した分野(Int.Cl.6,DB名) G06K 9/68──────────────────────────────────────────────────続 き Continuing on the front page (72) Inventor Tetsuo Kiuchi 1 Fujimachi, Hino-shi, Tokyo Fujifacom Control Co., Ltd. (72) Akiko Konno 1 Fujimachi, Hino-shi, Tokyo Fujifacom Control Ltd. (56) References JP-A-1-116892 (JP, A) JP-A-2-242391 (JP, A) JP-A-60-160488 (JP, A) JP-A-49-103536 (JP, A) JP-A-63-313283 (JP, A) (58) Fields investigated (Int. Cl. 6 , DB name) G06K 9/68

Claims (1)

(57)【特許請求の範囲】(57) [Claims] 【請求項1】パターンマッチングによる文字認識処理の
ために用いる標準パターンを複数のサンプル文字画像か
ら作成する標準パターン作成方法において、 各サンプル文字画像からの特徴抽出結果を各特徴要素毎
に足し込んでその総和パターンを得た後、得られた該総
和パターンを構成する各特徴要素に対し、その大きさの
a乗根(但し、a>1.0)をとる演算又は対数をとる演
算の如き、非線形演算を行う濃度変換を施して、その演
算結果から新たな総和パターンを得ることにより、突出
した特徴要素の値を押さえ込む段階を含むことを特徴と
する標準パターン作成方法。
In a standard pattern creation method for creating a standard pattern used for character recognition processing by pattern matching from a plurality of sample character images, a feature extraction result from each sample character image is added for each feature element. After obtaining the sum pattern, a non-linear operation such as an operation for taking the a-th root (a> 1.0) of the size or an operation for taking a logarithm is performed on each of the characteristic elements constituting the obtained sum pattern. And performing a density conversion to obtain a new total pattern from the calculation result, thereby suppressing the value of the prominent feature element.
JP2057993A 1990-03-12 1990-03-12 Standard pattern creation method Expired - Lifetime JP2749692B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2057993A JP2749692B2 (en) 1990-03-12 1990-03-12 Standard pattern creation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2057993A JP2749692B2 (en) 1990-03-12 1990-03-12 Standard pattern creation method

Publications (2)

Publication Number Publication Date
JPH03260888A JPH03260888A (en) 1991-11-20
JP2749692B2 true JP2749692B2 (en) 1998-05-13

Family

ID=13071535

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2057993A Expired - Lifetime JP2749692B2 (en) 1990-03-12 1990-03-12 Standard pattern creation method

Country Status (1)

Country Link
JP (1) JP2749692B2 (en)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60160488A (en) * 1984-02-01 1985-08-22 Nec Corp Framing system of standard pattern in pattern recognition
JPS63313283A (en) * 1987-06-17 1988-12-21 Agency Of Ind Science & Technol Image normalizing method
JPH0789362B2 (en) * 1987-10-30 1995-09-27 工業技術院長 Non-linear normalization method

Also Published As

Publication number Publication date
JPH03260888A (en) 1991-11-20

Similar Documents

Publication Publication Date Title
Yang et al. A local multiple patterns feature descriptor for face recognition
EP4109350B1 (en) Neural network compression method, data processing method, and related apparatuses
US20230386002A1 (en) Shadow elimination method and apparatus for text image, and electronic device
Vishwakarma et al. A novel non-linear modifier for adaptive illumination normalization for robust face recognition
Ojala et al. Texture discrimination with multidimensional distributions of signed gray-level differences
CN1459761B (en) Character Recognition Technology Based on Gabor Filter Bank
US10373022B1 (en) Text image processing using stroke-aware max-min pooling for OCR system employing artificial neural network
CN111949805B (en) Subtitle generation method, device, equipment and storage medium based on artificial intelligence
CN115240203A (en) Business data processing method, device, equipment and storage medium
CN114882278A (en) Tire pattern classification method and device based on attention mechanism and transfer learning
CN118981511A (en) Method, system, device and medium for improving retrieval performance of RAG knowledge base
CN117173409A (en) A real-time interactive image segmentation method
CN112035701A (en) Internet short video source tracing method and system
Lin et al. A genetic algorithm approach to Chinese handwriting normalization
Hirata et al. Iterative design of morphological binary image operators
JP2749692B2 (en) Standard pattern creation method
CN113705571B (en) Method and device for removing red seal based on RGB threshold, readable medium and electronic equipment
Krupiński et al. Improved two-step binarization of degraded document images based on Gaussian mixture model
CN117036711A (en) A weakly supervised semantic segmentation method based on attention regulation
CN119180774A (en) Industrial defect sample generation method and device, electronic equipment and storage medium
CN114943975A (en) Multi-modal question searching method and system based on deep learning
Borusyak et al. Development of an algorithm for adaptive compression of indexed images using contextual simulation
EP4068081A1 (en) Method and system for measuring a feature of persons
CN119887586B (en) Image restoration method for incomplete text image
EP4571671B1 (en) Fpga-based nlmeans noise-reduction simplification method and system for infrared image, and medium and device