Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
JPH0766274B2 - Word speech recognizer - Google Patents
[go: Go Back, main page]

JPH0766274B2 - Word speech recognizer - Google Patents

Word speech recognizer

Info

Publication number
JPH0766274B2
JPH0766274B2 JP62132903A JP13290387A JPH0766274B2 JP H0766274 B2 JPH0766274 B2 JP H0766274B2 JP 62132903 A JP62132903 A JP 62132903A JP 13290387 A JP13290387 A JP 13290387A JP H0766274 B2 JPH0766274 B2 JP H0766274B2
Authority
JP
Japan
Prior art keywords
word
words
distance
group
alternative
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP62132903A
Other languages
Japanese (ja)
Other versions
JPS63294600A (en
Inventor
隆夫 渡辺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP62132903A priority Critical patent/JPH0766274B2/en
Publication of JPS63294600A publication Critical patent/JPS63294600A/en
Publication of JPH0766274B2 publication Critical patent/JPH0766274B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Description

【発明の詳細な説明】 (産業上の利用分野) 本発明は音声認識技術、特に単語音声認識の技術の改良
に関する。
Description: TECHNICAL FIELD The present invention relates to a speech recognition technique, and more particularly to improvement of a word speech recognition technique.

(従来の技術とその問題点) 単語音声認識装置では、利用者のアプリケーシヨンに従
つて認識語いを設定し、語いの各単語を標準パタンとし
て登録してこれを用いて認識を行つているところが、従
来の単語音声認識装置は、設定する語いの如何によつて
は、誤認識を起こしやすい。これは、認識対象となる単
語の中に互いに類似している単語が存在することがある
からである。
(Prior art and its problems) With a word voice recognition device, a recognition word is set according to the application of the user, each word of the word is registered as a standard pattern, and recognition is performed using this. However, the conventional word voice recognition device is prone to erroneous recognition depending on the set word. This is because there may be words that are similar to each other among the words to be recognized.

本発明は、利用者が使いたい単語名を、利用者に代替可
能な複数の単語群として設定してもらい、これらの単語
群の中で最も誤認識を起こしにくい組合せを自動的に決
定する手段を装置の中に組み入れることにより、認識誤
りの起こりにくい音声認識装置を実現することを目的と
している。
The present invention is a means for allowing a user to set a word name that a user wants to use as a plurality of word groups that can be substituted, and for automatically determining a combination that is most unlikely to cause misrecognition in these word groups. The object of the present invention is to realize a voice recognition device in which recognition errors are unlikely to occur by incorporating the above into the device.

(問題点を解決するための手段) また、本願の発明は、認識対象となる語いを設定するに
際して、対象となる単語機能の各各について互いに代替
可能な複数の単語からなる単語群(代替単語群)を定義
し、各代替単語群の中から一つの単語を選択することに
よつて語いを決定する手段を含んでなり、この語い決定
手段における前記単語の選択は代替単語群中の単語の音
形記述により算出された各単語間距離を用いて行われる
ことを特徴とする。
(Means for Solving Problems) Further, according to the invention of the present application, when setting a vocabulary to be recognized, a word group consisting of a plurality of words that can be mutually replaced for each of the target word functions (alternative (Group of words) and determining a vocabulary by selecting one word from each of the alternative word groups. Is performed by using the distance between words calculated by the phonetic description of the word.

(作 用) 本発明の基本的な原理を以下に説明する。利用者の必要
とする単語の種類をKとする。ここではこれをK種類の
カテゴリがあると呼ぶことにする。利用者は各単語の代
替として可能な単語名を与える。即ち、利用者は各カテ
ゴリ毎に可能な単語の群を与える。一方、単語の間での
誤認識の起こりやすさを表すものとして、単語間での距
離D(X,Y)を考える。ここでX,Yは単語を表す。
(Operation) The basic principle of the present invention will be described below. Let K be the type of word required by the user. Here, this will be called that there are K types of categories. The user gives a possible word name as a substitute for each word. That is, the user gives a group of possible words for each category. On the other hand, the distance D (X, Y) between words is considered as a measure of the likelihood of misrecognition between words. Here, X and Y represent words.

問題は、各カテゴリに属する単語群の内から、単語を選
んだとき、選択された単語相互間の距離がなるべく大き
くなるように、単語を選択することである。誤認識の点
からは、単語間距離の小さい単語対の存在を避けること
が必要である。このためには、選択された各単語間の距
離の最小値が最大となるように、単語を選択を行えばよ
い。選択の組合せが少ないときはすべての組合せについ
て単語間距離を計算すればよいが、語いが増えるに従つ
て処理量も著しく増加する。ここでは、最適ではない
が、単語間距離をできるだけ大きく保つより効果的な次
の方法について説明する。
The problem is to select the words from the group of words belonging to each category so that the distance between the selected words becomes as large as possible. In terms of misrecognition, it is necessary to avoid the presence of word pairs with small interword distances. For this purpose, the words may be selected so that the minimum value of the distances between the selected words becomes the maximum. When the number of selected combinations is small, the inter-word distance may be calculated for all the combinations, but the processing amount increases remarkably as the number of words increases. Here, the following method, which is not optimal but more effective than keeping the distance between words as large as possible, will be described.

基本的な手順は、すべてカテゴリの代替単語群の中か
ら、他の単語との単語間距離の小さい単語を順次削除
し、最終的に各カテゴリ1ケの単語を残すというもので
ある。具体的には次のようになる。
The basic procedure is to sequentially delete words having a small interword distance from other words from the alternative word group of each category, and finally leave one word for each category. Specifically, it is as follows.

(1) 全カテゴリの代替単語群中の各単語Wについて
自分の属するカテゴリ以外のすべての単語(但し、既に
削除されている単語を除く)との距離を計算し、これら
の距離の最小値をd(w)とする。
(1) For each word W in the alternative word group of all categories, calculate the distance from all the words other than the category to which it belongs (excluding the words that have already been deleted), and calculate the minimum value of these distances. Let d (w).

(2) 得られたd(w)のうちで最小値を与えるW
(=w)を選択し、これを削除する。但し、wがそ
のカテゴリに属する唯一の単語であるときには、w
かわりに、2番目に小さい値を与えるwを選択する。
(2) W giving the minimum value of the obtained d (w)
Select (= w * ) and delete it. However, when w * is the only word belonging to that category, w is selected instead of w * , which gives the second smallest value.

D(X,Y)を決定する方法として、ここでは次の方法を
示す。
The following method is shown here as a method of determining D (X, Y).

単語の音形記述、すなわち、単語を音素(ないし音節)
の列としたものを使う方法である。音素(ないし音節)
間の距離をあらかじめ音声学の知識に基いて定義してお
けば、通常のDPマツチングの方法においてパタンのベツ
トル間距離のかわりにこの音素間距離を用いることによ
つて、単語間距離が算出される。すなわち比較する2つ
の単語の音形記述をそれぞれA={a1…a}、B=
{b1…b}とし、音素X,Y間距離をd(x,y)とする
と、単語間距離D(A,B)は次の漸化式により計算する
ことができる。
Phonetic description of a word, that is, a phoneme (or syllable) of a word
This is a method of using the columns of. Phonemes (or syllables)
If the distance between words is defined in advance based on the knowledge of phonetics, the distance between words can be calculated by using this distance between phonemes instead of the distance between patterns in the normal DP matching method. It That is, the phonetic descriptions of the two words to be compared are A = {a 1 ... A M }, B =
If {b 1 ... b N } and the phoneme X, Y distance is d (x, y), the word distance D (A, B) can be calculated by the following recurrence formula.

初期条件g(m,n)=0,m=0,n=0のとき=∽(m>0,n
=0)又は(m=0,u>0)のとき m=1,…,M;n=1,…,N D(A,B)=g(M,N)/(M+N) (実施例) 第1図は本発明を実現した装置の一実施例を示すブロツ
ク図である。参照数字1はパタンバツフアであり各カテ
ゴリに属する単語群の音形記述が格納される。参照数字
2はパタン間距離計算部でありパタンバツフア1内の各
カテゴリkに属する単語群W(1)…W(J
(K))のそれぞれの間の距離の群{d(j1,j2)}が
計算され、パタン間距離バツフア3に格納される。但し
ここでd(j1,j2)は単語W(j1)とW(j2)の間
のパタン間距離であり、前述の方法により算出されるも
のである。参照数字4はパタン選択部であり、上記の距
離バツフア3の内容を読み出し、前述した単語を順次削
除する手順を実行し、最終的に標準パタンとして使用す
る単語を選択し選択結果を出力する。認識部5は、上記
により選択された単語について標準パタン登録を行い、
これを用いて単語認識を行う。ここで、単語認識の実現
形態としては標準パタンとのマツチングによるものであ
ればいかなるものでも使用可能である。
When initial condition g (m, n) = 0, m = 0, n = 0 = ∽ (m> 0, n
= 0) or (m = 0, u> 0) m = 1, ..., M; n = 1, ..., N D (A, B) = g (M, N) / (M + N) (Embodiment) FIG. 1 shows an embodiment of an apparatus realizing the present invention. It is a block diagram showing. Reference numeral 1 is a pattern and stores the phonetic description of a word group belonging to each category. Reference numeral 2 is an inter-pattern distance calculation unit, which is a word group W K (1) ... W K (J belonging to each category k in the pattern buffer 1
A group {d (j 1 , j 2 )} of distances between (K)) is calculated and stored in the inter-pattern distance buffer 3. Here, d (j 1 , j 2 ) is the inter-pattern distance between the words W K (j 1 ) and W K (j 2 ) and is calculated by the method described above. Reference numeral 4 is a pattern selection unit, which reads the contents of the distance buffer 3 described above, executes the above-described procedure for sequentially deleting words, and finally selects a word to be used as a standard pattern and outputs the selection result. The recognition unit 5 performs standard pattern registration for the word selected above,
Word recognition is performed using this. Any word recognition can be used as long as it is based on matching with a standard pattern.

(発明の効果) 以上述べたように本発明によれば、複数の単語群の中で
最も誤認識を起こしにくい組合せを自動的に決定する手
段を装置の中に組み入れることが可能となり、認識精度
の高い音声認識装置を実現できる。
(Effects of the Invention) As described above, according to the present invention, it is possible to incorporate a means for automatically determining a combination that is most unlikely to cause misrecognition in a plurality of word groups into the device, and to improve recognition accuracy. It is possible to realize a high-quality voice recognition device.

【図面の簡単な説明】[Brief description of drawings]

第1図は本発明による一実施例を示すブロツク図であ
り、図中、1……パタンバツフア、2……距離計算部、
3……距離バツフア、4……選択部、5……認識部であ
る。
FIG. 1 is a block diagram showing an embodiment according to the present invention, in which 1 ... Pattern buffer, 2 ... distance calculator,
3 ... Distance buffer, 4 ... selection unit, 5 ... recognition unit.

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】認識対象となる語いを設定するに際して、
対象となる単語機能の各々について互いに代替可能な複
数の単語からなる単語群を代替単語群として定義し、各
代替単語群の中から一つの単語を選択することによって
語いを決定する機能を含んでなり、この語い決定手段に
おける前記単語の選択は前記代替単語群中の単語の音形
記述により算出された各単語間距離を用いて行われるこ
とを特徴とする単語音声認識装置。
1. When setting a vocabulary to be recognized,
Includes a function that defines a word group consisting of multiple words that can be substituted for each target word function as an alternative word group, and selects a word from each alternative word group The word speech recognition apparatus characterized in that the word selection in the vocabulary determination means is performed by using the inter-word distances calculated by the phonetic description of the words in the alternative word group.
JP62132903A 1987-05-27 1987-05-27 Word speech recognizer Expired - Fee Related JPH0766274B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP62132903A JPH0766274B2 (en) 1987-05-27 1987-05-27 Word speech recognizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP62132903A JPH0766274B2 (en) 1987-05-27 1987-05-27 Word speech recognizer

Publications (2)

Publication Number Publication Date
JPS63294600A JPS63294600A (en) 1988-12-01
JPH0766274B2 true JPH0766274B2 (en) 1995-07-19

Family

ID=15092236

Family Applications (1)

Application Number Title Priority Date Filing Date
JP62132903A Expired - Fee Related JPH0766274B2 (en) 1987-05-27 1987-05-27 Word speech recognizer

Country Status (1)

Country Link
JP (1) JPH0766274B2 (en)

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS57129497A (en) * 1981-02-03 1982-08-11 Sanyo Electric Co Voice register

Also Published As

Publication number Publication date
JPS63294600A (en) 1988-12-01

Similar Documents

Publication Publication Date Title
US6912499B1 (en) Method and apparatus for training a multilingual speech model set
US6269335B1 (en) Apparatus and methods for identifying homophones among words in a speech recognition system
US7555430B2 (en) Selective multi-pass speech recognition system and method
CN1315809A (en) Apparatus and method for spelling speech recognition in mobile communication
US6549883B2 (en) Method and apparatus for generating multilingual transcription groups
JPS592040B2 (en) Voice recognition device
US6408271B1 (en) Method and apparatus for generating phrasal transcriptions
JPH0766274B2 (en) Word speech recognizer
JPH049320B2 (en)
JPS6325366B2 (en)
Scagliola et al. Continuous speech recognition via diphone spotting a preliminary implementation
JP3100208B2 (en) Voice recognition device
JP2574242B2 (en) Voice input device
JPH0437997B2 (en)
JPS6131878B2 (en)
JPS6134599A (en) Pattern recognition system
JPH0449719B2 (en)
JPS617890A (en) Word inspection system for voice to be recognized
JPH0574838B2 (en)
JPH0632021B2 (en) Japanese speech recognizer
JPH08305389A (en) Voice recognition device
JPS632100A (en) Voice recognition equipment
JPH05232987A (en) Method for recognizing japanese voice
JPH0638198B2 (en) Continuous speech recognizer
JPS6140686A (en) Registration system for standard pattern

Legal Events

Date Code Title Description
LAPS Cancellation because of no payment of annual fees