JPS6329757B2

JPS6329757B2 -

Info

Publication number: JPS6329757B2
Application number: JP56169177A
Authority: JP
Inventors: Norimasa Kishi; Kazunori Noso; Tooru Futami
Original assignee: Nissan Motor Co Ltd
Current assignee: Nissan Motor Co Ltd
Priority date: 1981-10-22
Filing date: 1981-10-22
Publication date: 1988-06-15
Also published as: JPS5870293A

Description

[Detailed description of the invention]

本発明は、予め登録した操作者の登録音声デー
タに基づいた音声命令の認識により音声命令に対
応した負荷の作動を指令する装置において、順次
入力する２つの同じ音声命令の認識処理に基づい
て負荷の作動を指令することにより音声命令の認
識を確実に行なえるようにした車両用音声認識装
置に関する。従来、この種の音声認識装置としては、例えば
第１図に示すようなものがある。第１図において、１は運転席の近傍に設けられ
たマイクロホンであり、音声認識装置の使用に際
しては、まず登録スイツチ２の操作により操作者
の音声による所定の音声命令の登録を行なつてお
く。すなわち、登録スイツチ２をオン操作する
と、スイツチ入力検出回路４がスイツチ操作を検
出して信号線４ａに出力を生じ、制御部５により
各回路部に対し登録モードの処理指令が与えられ
る。このような登録モードにおいて、所定の音声
命令、例えばカーラジオの電源オンについて定め
られている音声命令「ラジオ」をマイクロホン１
に向けて発すると、音声命令はマイクロホン１で
電気信号に変換されて音声入力回路６で増幅さ
れ、この増幅出力信号は音声検出回路７に入力し
て時系列信号となる音声信号の開始時点が信号レ
ベルの立上り変化に基づいて検出され、制御部５
に対し登録処理の開始を指令する。そのため、マイクロホン１よりの音声信号は帯
域フイルタ群８で所定の周波数に分割され、パラ
メータ抽出部９において各周波数帯域での音声パ
ワースペクトラムを得るために、２乗又は整流さ
れ、音声パワー情報となる時系列の音声データに
デジタル変換され、メモリ１０に記憶される。こ
のとき登録スイツチ２の操作により登録モードと
なつているので、メモリ１０の音声データは登録
データ記憶部１１に転送され、音声認識処理を行
なうための基準データとして登録される。次に、音声命令の登録が終了した後に、車載負
荷、例えばラジオを聞きたい場合には、指令開始
スイツチ３をオン操作してマイクロホン１に向つ
て所定の音声命令「ラジオ」を与えるようにす
る。この指令開始スイツチ３のオン操作に対して
は、スイツチ入力検出回路４が信号線４ｂに出力
を生じ、制御部５によつて各回路部に対し認識モ
ードの制御指令が与えられ、このときマイクロホ
ン１より入力する音声命令を、登録時と同様に音
声データに変換してメモリ１０に書込むようにな
る。このメモリ１０に記憶された音声データは、
登録データ記憶部１１より順次読出される複数の
登録データとの間で、類似度比較処理部１２にお
いて類似度の演算を行なう。この類似度の演算と
しては、音声データと登録データとについて時間
軸の正規化やレベルの正規化を行なつた後に、チ
エビシエフ距離やその他の距離についての演算を
実行して類似度を求めるようになる。類似度比較処理部１２で演算された類似度の値
は、認識語処理判断部１３で類似度の値が予め定
められた閾値の範囲内にあるか否かが判別され、
範囲内にあるときには、この類似度をもつ音声デ
ータに対応した指令を駆動部１４に出力し、カー
ラジオの電源を投入するようになる。ところで、車両用の音声認識装置の使用環境
は、一般の音声認識装置に比べ周囲の騒音レベル
が高く、特にエンジンを始動した車両の使用状態
では車室の騒音レベルが上昇し、しかも雑音の混
入はランダムに生ずるため、負荷の作動を音声命
令により指令したときの音声信号に含まれる雑音
成分の割合が高く、比較的大きな雑音が混入した
場合には、音声命令を認識できなかつたり、更に
は誤認識により誤つた負荷の作動を指令する恐れ
があつた。本発明は、このような従来の問題点に着目して
なされたもので、指令開始スイツチのオン操作に
続いて入力する音声命令を登録音声データに基づ
いて認識し、該認識出力に基づいて上記音声命令
に対応した負荷の作動を指令する装置において、
音声命令の認識を確実に行なうため、順次入力す
る２つの音声命令が同一であることを判別したと
きにのみ、該音声命令に対応した負荷の作動を指
令するようにして上記の問題点を解決することを
目的としている。以下、本発明を図面に基づいて説明する。第２図は本発明の一実施例を示したブロツク図
である。まず、構成を説明すると、登録スイツチ
２、指令開始スイツチ３、スイツチ入力検出回路
４、制御部５、マイクロホンよりの音声信号を周
波数帯域毎に分割する帯域フイルタ群８（図示せ
ず）よりの音声信号について各周波数帯域毎のパ
ワースペクトラムにおける特徴抽出を行つて所定
の時系列データに変換するパラメータ抽出回路９
でなる回路部は第１図に示した従来装置と同じに
なる。これに加えて本発明では、パラメータ抽出
回路９の出力に、制御部５により制御されるマル
チプレクサ２０を設け、マルチプレクサ２０の出
力に第１回目の音声命令による音声データF₁
（Ｘ）を記憶するメモリ１０ａと、第２回目の音
声命令の音声データF₂（Ｘ）を記憶するメモリ１
０ｂを設けている。又、登録データ記憶部１１には登録スイツチ２
のオン操作による登録モードにおいて操作者が発
した所定の音声命令を時系列データに変換した登
録データF₀（Ｘ）が予め記憶されている。この登
録データ記憶部１１に記憶されている登録データ
としては、例えば次表−１のようになる。 The present invention provides a device that commands the operation of a load corresponding to a voice command by recognizing voice commands based on registered voice data of an operator registered in advance. The present invention relates to a voice recognition device for a vehicle that can reliably recognize voice commands by instructing the operation of a voice command. Conventionally, as this type of speech recognition device, there is one shown in FIG. 1, for example. In FIG. 1, 1 is a microphone installed near the driver's seat. When using the voice recognition device, first register a predetermined voice command by the operator's voice by operating a registration switch 2. . That is, when the registration switch 2 is turned on, the switch input detection circuit 4 detects the switch operation and produces an output on the signal line 4a, and the control section 5 gives a registration mode processing command to each circuit section. In such a registration mode, a predetermined voice command, for example, a voice command "radio" specified for turning on a car radio, is transmitted to the microphone 1.
When the voice command is issued to the user, the microphone 1 converts the voice command into an electric signal, which is amplified by the voice input circuit 6. This amplified output signal is input to the voice detection circuit 7, and the start point of the voice signal becomes a time-series signal. It is detected based on the rise change in the signal level, and the control unit 5
command to start the registration process. Therefore, the audio signal from the microphone 1 is divided into predetermined frequencies by the band filter group 8, and squared or rectified to obtain the audio power spectrum in each frequency band in the parameter extractor 9, which becomes audio power information. The data is digitally converted into time-series audio data and stored in the memory 10. At this time, since the registration mode is set by operating the registration switch 2, the voice data in the memory 10 is transferred to the registration data storage section 11 and registered as reference data for performing voice recognition processing. Next, after registering the voice command, if you want to listen to the on-vehicle load, for example, the radio, turn on the command start switch 3 to give the predetermined voice command "radio" to the microphone 1. . In response to this ON operation of the command start switch 3, the switch input detection circuit 4 generates an output on the signal line 4b, and the control section 5 gives a recognition mode control command to each circuit section. The voice commands input from 1 are converted into voice data and written into the memory 10 in the same way as at the time of registration. The audio data stored in this memory 10 is
A similarity comparison processing section 12 calculates the degree of similarity between a plurality of registered data sequentially read out from the registered data storage section 11. To calculate this degree of similarity, after normalizing the time axis and normalizing the level of the audio data and the registered data, the degree of similarity is determined by performing calculations on Tievishiev distance and other distances. Become. The similarity value calculated by the similarity comparison processing section 12 is judged by the recognition word processing judgment section 13 as to whether or not the similarity value is within a predetermined threshold range.
When it is within the range, a command corresponding to the audio data having this degree of similarity is output to the drive unit 14, and the car radio is turned on. By the way, the environment in which a voice recognition device for a vehicle is used has a higher ambient noise level than that of a general voice recognition device.In particular, when the vehicle is in use with the engine started, the noise level in the passenger compartment rises, and noise is mixed in. Since these occur randomly, when a voice command is given to operate a load, there is a high proportion of noise components in the voice signal, and if a relatively large amount of noise is mixed in, the voice command may not be recognized or even worse. There was a risk that the incorrect recognition would command the operation of the wrong load. The present invention has been made by focusing on such conventional problems, and recognizes the voice command inputted following the ON operation of the command start switch based on registered voice data, and based on the recognition output, the above-mentioned voice command is recognized. In a device that commands the operation of a load in response to a voice command,
In order to ensure recognition of voice commands, the above problem is solved by instructing the operation of the load corresponding to the voice command only when it is determined that two voice commands input sequentially are the same. It is intended to. Hereinafter, the present invention will be explained based on the drawings. FIG. 2 is a block diagram showing one embodiment of the present invention. First, to explain the configuration, a registration switch 2, a command start switch 3, a switch input detection circuit 4, a control unit 5, and audio from a band filter group 8 (not shown) that divides an audio signal from a microphone into frequency bands. A parameter extraction circuit 9 that extracts features in the power spectrum of each frequency band of the signal and converts it into predetermined time series data.
The circuit section consisting of is the same as the conventional device shown in FIG. In addition, in the present invention, a multiplexer 20 controlled by the control unit 5 is provided at the output of the parameter extraction circuit 9, and the output of the multiplexer 20 receives audio data F ₁ based on the first audio command.
A memory 10a that stores (X) and a memory 1 that stores voice data F ₂ (X) of the second voice command.
0b is provided. In addition, the registration data storage section 11 includes a registration switch 2.
Registration data F ₀ (X) is stored in advance, which is obtained by converting a predetermined voice command issued by the operator into time-series data in the registration mode by the ON operation. The registration data stored in the registration data storage section 11 is as shown in Table 1 below, for example.

【表】【table】

【表】この表−１に示す登録データの意味するところ
は、例えばラジオを例にとると、但し、ｉ＝１〜４のフイルタ段数ｊ＝１〜32の時系列データ数となる時系列データを構成するものである。な
お、上記の時系列データは帯域フイルタ群８にお
けるフイルタ段数を４段とした場合を例にとつて
いる。類似度比較処理部１２は、音声認識の為の類似
度を演算するもので、この類似度の演算としては
チエビシエフ距離の演算等が用いられる。この類
似度演算のために類似度演算器１２０ａ，１２０
ｂが設けられ、類似度演算器１２０ａはメモリ１
０ａの記憶データF₁（Ｘ）と登録データ記憶部１
１の登録データF₀（Ｘ）との間の類似度を、 l₁（Ｘ）＝｜F₀（Ｘ）−F₁（Ｘ）｜＝｜₄ 〓ⁱ⁼¹ ₃₂ 〓^j=1 f^O _X（ｉ、ｊ）−₄ 〓ⁱ⁼¹ ₃₂ 〓^j=1 f¹ _X（ｉ、ｊ）｜ …(1) となるチエビシエフ距離の演算をもつて行ない、
又、類似度演算器１２０ｂはメモリ１０ａ，１０
ｂの音声データF₁（Ｘ）、F₂（Ｘ）との間の類似度
を、 l₂（Ｘ）＝｜F₁（Ｘ）−F₂（Ｘ）｜＝₄ 〓ⁱ⁼¹ ₃₂ 〓^j=1 f¹ _X（ｉ、ｊ）−₄ 〓ⁱ⁼¹ ₃₂ 〓^j=1 f² _X（ｉ、ｊ）｜ …(2) のチエビシエフ距離の演算をもつて行う。類似度
演算器１２０ａの出力には最小値判別器１２２が
設けられ、類似度演算器１２０ａで演算された複
数の類似度の値の中から最小値となる類似度を判
別して出力するようにしている。認識語判断処理部１３は、類似度比較処理部１
２で演算された類似度が所定の閾値の範囲内にあ
るかどうかを判別し、閾値の範囲内にある時に音
声命令に対応した負荷を作動するための指令を出
力する機能を有し、比較器１３０ａによつて閾値
メモリ１３２ａに記憶されている閾値h₁と最小値
判別器１２２で取出された最小類似度の値を比較
し、その比較結果を判別器１３４に与えている。
又比較器１３０ｂには閾値メモリ１３２ｂに記憶
している閾値h₂が基準値として設定され、この閾
値h₂により類似度演算器１２０ｂで演算された類
似度を比較判別して判別器１３４に与えている。
また、判別器１３４は比較器１３０ａで閾値h₁以
下となる類似度が得られた時に、表示器１８に対
して２回目の音声命令の入力を要求する表示を指
令し、且つ制御部５に対してマルチプレクサ２０
をメモリ１０ｂ側に切換える指令を与える。又、
２回目の音声命令が入力された後に比較器１３０
ｂで閾値h₂以下となる類似度が判別された時に
は、音声命令に対応した負荷の作動を指令するた
めの信号を制御部５に対して行う。更に第２図の実施例では制御部５に対し、車両
状態検出回路１５の出力が与えられており、この
車両状態検出回路１５にはイグニツシヨンスイツ
チのオン操作により閉じる接点を有するイグニツ
シヨンリレー１６の出力と、オルタネータ出力と
が与えられており、イグニツシヨンリレー１６の
オンによる出力とオルタネータ出力とが得られた
時に車両が使用状態にあることを検出して出力
し、それ以外の時には車両が停止状態にあること
を検出して出力する。この車両状態検出回路１５
の機能は車両の使用状態、すなわちエンジンが始
動されている状態での車室騒音レベルの上昇を検
知して制御部５にその旨を与えるもので、車両状
態検出回路１５が車両の使用状態を検出している
時に同じ音声命令を２回入力させて認識処理する
認識モードの制御を指令するようになる。次に、第２図の実施例の作用を第３図に示した
動作フローを参照して説明する。今、仮に車両状態検出回路１５が車両の使用状
態を検出して出力している状態で指令開始スイツ
チ３をオン操作して所定の音声命令を操作者が発
したとする。この指令開始スイツチ３のオン操作
によるスイツチ入力検出回路４の出力により、制
御部５は同じ音声命令を２回入力させて認識を行
なう認識モードとなり、マルチプレクサ２０をま
ずメモリ１０ａ側に切換えてブロツクＢに示すよ
うに音声命令の入力を待つ。次いで所定の音声命
令がマイクロホンを介して入力されたとすると、
この音声命令による音声信号は、帯域フイルタ群
８で周波数分割されるとともにパラメータ抽出回
路９において、各周波数帯域毎のパワースペクト
ラムに対応した時系列データに変換され、この時
系列データの変換に際しては、時間軸での正規
化、レベルの正規化等がほどこされ、マルチプレ
クサ２０を介して、メモリ１０ａに音声データ
F₁（Ｘ）として記憶される。このようにメモリ１
０ａに第１回目の音声データが記憶されると、ブ
ロツクＤに示すように類似度比較処理部１２の類
似度演算器１２０ａにより第１回目の音声データ
F₁（Ｘ）と登録データ記憶部１１の登録データF₀
（Ｘ）との間の類似度が前記第(1)式によつて演算
され、ブロツクＥに示すように演算された複数の
類似度l₁（Ｘ）の中から最小のものが判別されて
認識語判断処理部１３の比較器１３０ａに与えら
れる。比較器１３０ａは、判別ブロツクＦに示す
ように、閾値メモリ１３２ａの閾値h₁との比較を
行ない最小類似度l₁（Ｘ）が閾値h₁以下の時に判
別器１３４によりブロツクＧに示すように表示器
１８に対して２回目の音声命令の入力を要求する
表示を行なう。同時に判別器１３４は制御部５に
制御指令を与えて、マルチプレクサ２０をメモリ
１０ｂ側に切換える。そこで操作者は表示器１８
における２回目の音声命令要求表示を受けて同じ
音声命令を発したとすると、パラメータ抽出回路
９で同様に時系列データに変換された音声データ
は、マルチプレクサ２０を介してメモリ１０ｂに
音声データF₂（Ｘ）として記憶される。次いでブロツクＩに示すように類似度比較処理
部１２の類似度演算部１２０ｂにおいて、メモリ
１０ａ，１０ｂに記憶している音声データF₁
（Ｘ）とF₂（Ｘ）との類似度l₂（Ｘ）の演算が前記
第(2)式に基づいて行なわれ、比較器１３０ｂに与
えられる。比較器１３０ｂでは、判別ブロツクＪ
に示すように閾値メモリ１３２ｂの閾値h₂と類似
度l₂（Ｘ）との比較判別が行なわれ、閾値h₂以下
の時には、判別器１３４はブロツクＣ〜判別ブロ
ツクＦにおいて認識処理された負荷の作動指令、
すなわちブロツクＫに示すようにＸに対応した作
動指令を制御部５に与え、制御部５により図示し
ない駆動部を作動して音声命令に対応した負荷の
作動を行ない。これにより認識を終了する。一方、判別ブロツクＦ、Ｊにおいて類似度l₁
（Ｘ）又はl₂（Ｘ）が閾値h₁又はh₂を上回つている
ことが判別された時には、ブロツクＬに進んで表
示器１８に対し再入力の要求表示を行ない、ブロ
ツクＢの第１回目の音声命令を待つ状態に戻る。
尚、認識語判断処理部１３における閾値h₁、h₂と
の間にはh₁＞h₂となる関係が定められており、１
回目に対して２回目の音声命令の類似度を判別す
る閾値をきびしくし、１回目の音声命令と２回目
の音声命令とに含まれる雑音成分の割合が大きく
変わつていたような場合には、雑音成分の混入に
よる誤認識の恐れがあるので、この場合には閾値
h₂を上回ることを条件に、音声命令の認識による
負荷の作動を行なわずに再度音声命令をやり直さ
せるようにし、音声命令の誤認識による誤まつた
負荷の作動を確実に防止するようにしている。このように上記の実施例では同じ音声命令を２
回入力させて音声命令の誤認識をおこさないよう
にしているため、例えば走行中に誤作動しては、
こまるようなイグニツシヨンスイツチのオン、オ
フあるいはライトオン、オフ等についても音声命
令による作動を可能にするものである。尚、第２図の実施例では、第１回目の音声命令
について登録データとの間で音声認識を行ない、
２回目の音声命令については、第１回目の音声命
令との類似度を判別して、同じ音声命令であるこ
とを条件に、最初に認識した音声命令に対応した
負荷の作動を指令するようにしているが、他の実
施例として、第１回目と第２回目の音声命令のそ
れぞれについて登録データに基づいた音声認識を
行ない、両方の認識出力が一致した時にのみ、こ
の音声命令に対応した負荷の作動を指令するよう
にしても良い。又、上記の実施例では、車両の走行状態を検出
している時に、音声命令を２回入力させて認識処
理を行なうようにしているが、車両の走行又は停
止状態のいかんにかかわらず全ての車両状態につ
いて音声命令を２回入力させた認識処理を行なう
ようにしてもよい。以上説明してきたように、本発明によれば、指
令開始スイツチのオン操作に続いて入力する音声
命令を予め記憶した登録音声データに基づいて認
識し、この認識出力に基づいて上記音声命令に対
応した負荷の作動を指令する装置において、順次
入力する２つの同じ音声命令の少くともいずれか
一方を上記音声登録データに基づいて認識し、且
つ、上記２つの音声命令が同一であることを判別
した時にのみこの音声命令に対応した負荷の作動
を指令するようにしたため、走行中のように車室
の騒音レベルが高い状態で所定の音声命令を行な
つたとしても順次行なう２つの音声命令に基づい
た認識処理が行なわれることから音声命令ととも
に雑音が混入したとしてもこの雑音成分により誤
まつた音声命令の認識が行なわれることが確実に
防止でき、そのため運転操作に不可欠な操作、例
えばイグニツシヨンスイツチのオン、オフ、ライ
トのオン、オフ等の重要な操作事項についても音
声命令による作動を可能とし、音声認識装置の信
頼性を大幅に向上することができるという効果が
得られる。[Table] The meaning of the registration data shown in Table 1 is as follows, taking radio as an example. However, the time series data is configured such that the number of filter stages is i=1 to 4 and the number of time series data is j=1 to 32. Note that the above time series data is based on an example in which the number of filter stages in the band filter group 8 is four. The similarity comparison processing unit 12 calculates the degree of similarity for speech recognition, and the calculation of the Tievishiev distance or the like is used to calculate the degree of similarity. For this similarity calculation, similarity calculation units 120a, 120
b is provided, and the similarity calculator 120a is connected to the memory 1.
0a storage data F ₁ (X) and registered data storage unit 1
The similarity between registered data F ₀ (X) of 1 is expressed as l ₁ (X) = | F ₀ (X) − F ₁ (X) | = | ₄ 〓 ⁱ⁼¹ ₃₂ 〓 ^j=1 f ^O _X (i, j) − ₄ 〓 ⁱ⁼¹ ₃₂ _〓 ^j=1 f ¹
Further, the similarity calculator 120b is connected to the memories 10a and 10.
The similarity between the audio data F ₁ (X) and F ₂ (X) of b is expressed as l ₂ (X)=｜F ₁ (X)−F ₂ (X)｜= ₄ 〓 ⁱ⁼¹ ₃₂ 〓 ^j=1 f ¹ _X (i, j ⁾ − ₄ 〓 ⁱ⁼¹ ₃₂ 〓 ^{j=1 f} ₂ A minimum value discriminator 122 is provided at the output of the similarity calculator 120a to determine and output the minimum similarity among the plurality of similarity values calculated by the similarity calculator 120a. ing. The recognition word judgment processing section 13 includes the similarity comparison processing section 1
It has a function of determining whether the similarity calculated in step 2 is within a predetermined threshold range and outputting a command to operate the load corresponding to the voice command when it is within the threshold range. The threshold value h ₁ stored in the threshold value memory 132 a is compared with the minimum similarity value extracted by the minimum value discriminator 122 by the device 130 a, and the comparison result is provided to the discriminator 134 .
Further, the comparator 130b is set with a threshold value _h2 stored in the threshold value memory 132b as a reference value, and the similarity calculated by the similarity calculator 120b is compared and determined using this threshold value _h2 , and the result is provided to the discriminator 134. ing.
Furthermore, when the comparator 130a obtains a degree of similarity that is less than or equal to the threshold value _h1 , the discriminator 134 instructs the display 18 to display a request for inputting a second voice command, and also instructs the control unit 5 to display a request for inputting a second voice command. For multiplexer 20
A command is given to switch the memory 10b to the memory 10b side. or,
After the second voice command is input, the comparator 130
When it is determined that the degree of similarity is equal to or less than the threshold value _h2 in b, a signal is sent to the control unit 5 to instruct the operation of the load corresponding to the voice command. Furthermore, in the embodiment shown in FIG. 2, the output of a vehicle state detection circuit 15 is supplied to the control unit 5, and this vehicle state detection circuit 15 has an ignition switch which has a contact that closes when the ignition switch is turned on. The output of the relay 16 and the alternator output are provided, and when the output of the ignition relay 16 is turned on and the alternator output are obtained, it is detected that the vehicle is in use and is output. Sometimes it detects that the vehicle is stopped and outputs an output. This vehicle state detection circuit 15
The function is to detect an increase in the cabin noise level when the vehicle is being used, that is, when the engine is started, and to notify the controller 5 of this fact. During detection, the same voice command is input twice to command recognition mode control for recognition processing. Next, the operation of the embodiment shown in FIG. 2 will be explained with reference to the operational flow shown in FIG. Now, suppose that the operator issues a predetermined voice command by turning on the command start switch 3 while the vehicle state detection circuit 15 is detecting and outputting the usage state of the vehicle. The output of the switch input detection circuit 4 caused by the ON operation of the command start switch 3 causes the control unit 5 to enter a recognition mode in which the same voice command is input twice and recognized, and first switches the multiplexer 20 to the memory 10a side and blocks B. Wait for voice command input as shown in . Next, if a predetermined voice command is input via the microphone,
The voice signal based on this voice command is frequency-divided by a group of band filters 8 and is converted into time-series data corresponding to the power spectrum of each frequency band in a parameter extraction circuit 9. When converting this time-series data, The audio data is subjected to time axis normalization, level normalization, etc., and is stored in the memory 10a via the multiplexer 20.
It is stored as F ₁ (X). Memory 1 like this
When the first audio data is stored in 0a, the similarity calculation unit 120a of the similarity comparison processing unit 12 stores the first audio data as shown in block D.
F ₁ (X) and registration data F ₀ in the registration data storage section 11
(X) is calculated by the above equation (1), and the minimum one is determined from among the plurality of calculated similarities l ₁ (X) as shown in block E. It is applied to the comparator 130a of the recognition word judgment processing section 13. The comparator 130a performs a comparison with the threshold value h1 of the threshold value memory 132a, as shown in the discrimination block F, and when the minimum similarity _l1 ( _X ) is less than the threshold value _h1 , the discriminator 134 makes a comparison as shown in the block G. A message is displayed on the display 18 requesting input of a second voice command. At the same time, the discriminator 134 gives a control command to the control unit 5 to switch the multiplexer 20 to the memory 10b side. Therefore, the operator
If the same voice command is issued in response to the second voice command request display, the voice data similarly converted to time series data by the parameter extraction circuit 9 is stored in the memory 10b via the multiplexer 20 as voice data _F2. (X). Next, as shown in block I, the similarity calculation unit 120b of the similarity comparison processing unit 12 calculates the audio data F ₁ stored in the memories 10a and 10b.
A calculation of the degree of similarity l ₂ (X) between (X) and F ₂ (X) is performed based on the above-mentioned equation (2), and is provided to the comparator 130b. In the comparator 130b, the discrimination block J
As shown in FIG. 3, a comparison is made between the threshold h ₂ of the threshold memory 132b and the similarity l ₂ (X), and when the similarity is less than the threshold h ₂ , the discriminator 134 selects the load that has been recognized in blocks C to F. operating command,
That is, as shown in block K, an operation command corresponding to X is given to the control section 5, and the control section 5 operates a drive section (not shown) to operate the load in accordance with the voice command. This ends the recognition. On the other hand, in discrimination blocks F and J, the similarity l ₁
(X) or l ₂ (X) exceeds the threshold h ₁ or h ₂ , the process proceeds to block L, where a request for re-input is displayed on the display 18, and the Return to the state of waiting for the first voice command.
Note that a relationship such as h ₁ > h ₂ is defined between the threshold values h ₁ and h ₂ in the recognition word judgment processing unit 13, and 1
The threshold for determining the similarity between the second voice command and the second voice command is made stricter, and if the proportion of noise components included in the first voice command and the second voice command is significantly different, , since there is a risk of misrecognition due to the mixing of noise components, in this case the threshold
On the condition that _h2 is exceeded, the voice command is restarted without activating the load due to recognition of the voice command, and the erroneous activation of the load due to erroneous recognition of the voice command is reliably prevented. There is. In this way, in the above embodiment, the same voice command can be
This system prevents erroneous recognition of voice commands by inputting multiple times, so if a malfunction occurs while driving, for example,
This makes it possible to turn on and off an ignition switch or turn on and off a light, which can be difficult to do, by voice commands. In the embodiment shown in FIG. 2, voice recognition is performed between the first voice command and the registered data,
Regarding the second voice command, the degree of similarity with the first voice command is determined, and on the condition that the voice commands are the same, the system issues a command to operate the load corresponding to the first recognized voice command. However, as another example, voice recognition is performed based on registered data for each of the first and second voice commands, and only when both recognition outputs match, the load corresponding to this voice command is It may also be possible to instruct the operation of. Furthermore, in the above embodiment, when the running state of the vehicle is detected, the voice command is input twice to perform the recognition process, but regardless of whether the vehicle is running or stopped, all A recognition process may be performed in which a voice command is input twice regarding the vehicle state. As explained above, according to the present invention, the voice command inputted following the ON operation of the command start switch is recognized based on registered voice data stored in advance, and the voice command is responded to based on the recognition output. the device for instructing the operation of a load, which recognizes at least one of two identical voice commands input sequentially based on the voice registration data, and determines that the two voice commands are the same; Since the load operation corresponding to this voice command is commanded only at certain times, even if a predetermined voice command is given in a state where the noise level in the vehicle interior is high, such as when the car is running, it will not be possible to operate the load based on the two voice commands that are executed sequentially. Even if noise is mixed in with the voice command, it is possible to reliably prevent the voice command from being misrecognized due to this noise component. Important operational items such as turning on and off switches and turning on and off lights can also be operated by voice commands, resulting in the effect that the reliability of the voice recognition device can be greatly improved.

[Brief explanation of the drawing]

第１図は従来装置の一例を示したブロツク図、
第２図は本発明の一実施例を示したブロツク図、
第３図は第２図の実施例の動作フロー図である。１……マイクロホン、２……登録スイツチ、３
……指令開始スイツチ、４……スイツチ入力検出
回路、５……制御部、６……音声入力回路、７…
…音声検出回路、８……帯域フイルタ群、９……
パラメータ抽出回路、１０，１０ａ，１０ｂ……
メモリ、１２……類似度比較処理部、１３……認
識語判断処理部、１４……駆動部、１５……車両
状態検出回路、１６……イグニツシヨンリレー、
１８……表示器、２０……マルチプレクサ、１１
……登録データ記憶部、１２０ａ，１２０ｂ……
類似度演算器、１２２……最小値判別器、１３０
ａ，１３０ｂ……比較器、１３２ａ，１３２ｂ…
…閾値メモリ、１３４……判別器。 Figure 1 is a block diagram showing an example of a conventional device.
FIG. 2 is a block diagram showing an embodiment of the present invention.
FIG. 3 is an operational flow diagram of the embodiment of FIG. 2. 1...Microphone, 2...Registration switch, 3
...Command start switch, 4...Switch input detection circuit, 5...Control unit, 6...Audio input circuit, 7...
...Audio detection circuit, 8...Band filter group, 9...
Parameter extraction circuit, 10, 10a, 10b...
Memory, 12... Similarity comparison processing section, 13... Recognized word judgment processing section, 14... Drive section, 15... Vehicle state detection circuit, 16... Ignition relay,
18...Display device, 20...Multiplexer, 11
...Registered data storage section, 120a, 120b...
Similarity calculator, 122... Minimum value discriminator, 130
a, 130b... comparator, 132a, 132b...
...Threshold value memory, 134...Discriminator.

Claims

[Scope of Claims] 1. Recognizes a voice command input following the ON operation of a command start switch based on registered voice data,
In a device that commands the operation of a load corresponding to a voice command based on the recognition output, at least one voice command among at least two voice commands input sequentially is recognized based on the registered voice data, and , a voice for a vehicle, characterized in that a recognition means is provided that instructs the operation of a load corresponding to the voice command only when it is determined that the similarity of each voice command input sequentially is within a predetermined range. recognition device. 2. The recognition means includes a storage unit that stores each of voice data based on the first voice command and voice data based on the second voice command, and a plurality of registered voice data and the first voice data stored in the storage unit. a recognition calculation unit that calculates the degree of similarity with the voice data and recognizes the first voice data based on the degree of similarity that is equal to or less than a predetermined threshold; and the first and second voice data stored in the storage unit.
an output command unit that calculates the degree of similarity of the second voice data, and when the degree of similarity is less than a predetermined threshold value, determines that the voice commands are the same and instructs the operation of the load based on the recognition output of the recognition calculation unit; A voice recognition device for a vehicle according to claim 1.