Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
JPH0731508B2 - Speech recognition response device - Google Patents
[go: Go Back, main page]

JPH0731508B2 - Speech recognition response device - Google Patents

Speech recognition response device

Info

Publication number
JPH0731508B2
JPH0731508B2 JP4139390A JP13939092A JPH0731508B2 JP H0731508 B2 JPH0731508 B2 JP H0731508B2 JP 4139390 A JP4139390 A JP 4139390A JP 13939092 A JP13939092 A JP 13939092A JP H0731508 B2 JPH0731508 B2 JP H0731508B2
Authority
JP
Japan
Prior art keywords
voice
response
input
response device
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP4139390A
Other languages
Japanese (ja)
Other versions
JPH05173589A (en
Inventor
洋一 竹林
英範 篠田
輝彦 浮田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to JP4139390A priority Critical patent/JPH0731508B2/en
Publication of JPH05173589A publication Critical patent/JPH05173589A/en
Publication of JPH0731508B2 publication Critical patent/JPH0731508B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は音声入力による情報処理
システムに用いられる音声認識応答装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition response device used in an information processing system by voice input.

【0002】[0002]

【従来の技術】近時、音声認識技術や音声合成技術の発
達が目覚ましく、例えば連続音声認識や不特定話者を対
象とした音声認識が可能となり、また線形予測符号化法
を用いた精度の高い音声合成が可能となっている。また
文章を音声に変換する為の規則合成法に関しても、盛ん
に研究開発されている。
2. Description of the Related Art Recently, the speech recognition technology and the speech synthesis technology have been remarkably developed. For example, continuous speech recognition and speech recognition for an unspecified speaker are possible, and the accuracy using the linear predictive coding method is improved. High voice synthesis is possible. Also, research and development have been actively conducted on a rule synthesis method for converting sentences into speech.

【0003】このような技術を用いて、例えば電話公衆
回線を利用した各種のサービスを行う電話音声応答サー
ビスシステムや、銀行等におけるオンライン業務システ
ムの開発が試行されており、その有用性が注目されてい
る。
Using such a technology, for example, a telephone voice response service system for performing various services using a public telephone line and an online business system in a bank or the like have been attempted to be developed, and its usefulness has been noted. ing.

【0004】[0004]

【発明が解決しようとする課題】ところが、この種のシ
ステムの利用者は不特定多数であり、例えば老人や子供
等の不慣れな人、あるいは1日に何回となく利用する人
が存在する。これにも拘らず、従来装置にあっては、そ
の音声応答の内容が一様であり、またその発話速度も一
定であるため、人間と機械との対話が円滑になされてい
なかった。つまり応答が冗長で苛立たしさが生じたり、
或いは応答が分かり難いという問題が生じた。
However, the number of users of this type of system is unspecified, and for example, there are unfamiliar people such as old people and children, or people who use it many times a day. In spite of this, in the conventional device, the content of the voice response is uniform and the utterance speed is also constant, so that the conversation between the human and the machine is not smooth. In other words, the response is redundant and frustrating,
Or there was a problem that the response was difficult to understand.

【0005】本発明はこのような事情を考慮してなされ
たもので、その目的とするところは人間と機械との間の
自然で円滑な対話を可能として効果的な音声入力による
情報処理を可能ならしめる実用性の高い音声認識応答装
置を提供することにある。
The present invention has been made in consideration of such circumstances, and the purpose thereof is to enable a natural and smooth dialogue between a human and a machine and enable effective information processing by voice input. It is to provide a highly practical voice recognition response device that can be used.

【0006】[0006]

【課題を解決するための手段】本発明は、入力音声を認
識し、この認識結果に対する応答文を音声出力する音声
認識応答装置において、前記入力音声の発話速度を測定
する測定手段と、同一の意味を表わす応答文について文
字数の異なる表現形式を予め用意し、前記測定手段によ
り測定された発話速度に応じて前記応答文の表現形式を
制御する制御手段とを備えたことを特徴とする。
The present invention, in a voice recognition response device for recognizing an input voice and outputting a response sentence in response to the recognition result, is the same as the measuring means for measuring the utterance speed of the input voice. The present invention is characterized in that an expression format having a different number of characters is prepared in advance for a response sentence representing meaning, and a control means for controlling the expression format of the response sentence according to the utterance speed measured by the measuring means is provided.

【0007】[0007]

【作用】このように本発明では、入力音声の発話速度に
応じて、応答文の表現形式を御することにより、音声
入力者に対して適切な応答を与えることが可能となる。
例えば利用頻度の高い人に対しては文字数の少ない簡潔
な応答を与え、また利用頻度の低い人に対しては文字数
の多い丁寧な応答を与えることによって、音声入力の適
切な指示を与えることが可能となり、対話の自然性、円
滑性を十分に高めることができる。
SUMMARY OF above, in the present invention, in accordance with the speech speed of the input speech, by control the expression format of the response sentence, it is possible to provide an appropriate response to the speech input person.
For example, by giving a simple response with a small number of characters to a frequently used person and a polite response with a large number of characters to an infrequently used person, it is possible to give an appropriate instruction for voice input. It becomes possible and the naturalness and smoothness of the dialogue can be sufficiently enhanced.

【0008】[0008]

【実施例】以下、図面を参照して本発明の実施例につき
説明する。図1は、本発明の実施例に係る音声認識応答
装置の概略構成図である。この装置は、入力音声の発話
速度を検出して、応答文の内容自体を変えるようにした
もである。即ち、入力音声は分析器11を介して分析さ
れ、この分析により得られた音声パターンが音声パター
ンメモリ12に格納される。この音声パターンに対し
て、音声認識部13は辞書メモリ14に登録された音声
辞書を参照して認識を行っている。
Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a schematic configuration diagram of a voice recognition response device according to an embodiment of the present invention. This device detects the utterance speed of the input voice and changes the content itself of the response sentence. That is, the input voice is analyzed via the analyzer 11, and the voice pattern obtained by this analysis is stored in the voice pattern memory 12. The voice recognition unit 13 refers to the voice dictionary registered in the dictionary memory 14 to recognize this voice pattern.

【0009】一方、前記入力音声はピッチ抽出器15を
介してそのピッチ周波数成分が抽出されている。このピ
ッチ周波数成分の抽出は、例えばケプストラム法、変形
相関法、ADM法等を用いて、前記入力音声の認識処理
とは独立に行われる。このピッチ周波数の時系列パター
ンの変化から、発話速度測定器16により上記入力音声
の発話速度vが求められている。音声応答制御部17
は、この発話速度vの情報と前記入力音声の認識結果と
を入力して、それに応じた内容の応答文を決定してお
り、これが音声応答出力部18を介して音声出力され
る。
On the other hand, the pitch frequency component of the input voice is extracted through the pitch extractor 15. The extraction of the pitch frequency component is performed independently of the recognition processing of the input voice by using, for example, the cepstrum method, the modified correlation method, the ADM method, or the like. From the change of the time-series pattern of the pitch frequency, the speech rate measuring device 16 obtains the speech rate v of the input voice. Voice response control unit 17
Inputs the information of the speech speed v and the recognition result of the input voice and determines a response sentence having contents corresponding thereto, and this is output as voice through the voice response output unit 18.

【0010】即ち、音声認識結果とその発話速度に応じ
て、例えば「ありがとう」、「ありがとうございま
す」、「ありがとうございました、またどうぞ」等の同
一意味を表す応答であっても、その表現形式(文字数)
の異なるものの中の一つが選択制御されて音声出力され
る。つまり、音声入力者に応じた内容と速度の音声応答
がなされることになる。従って、音声応答として次の音
声入力指示を与えるような場合、少ない文字数で簡潔に
その指示を与えたり、不慣れな人に対しては多い文字数
で丁寧にその指示を与えたりすることが可能となり、対
話の自然性を高め、処理効率の向上を図ることが可能と
なる。
That is, depending on the voice recognition result and the speech rate, even if the response has the same meaning such as "Thank you", "Thank you", "Thank you again, please", etc. (word count)
One of the different ones is selectively controlled and output as voice. That is, the voice response having the content and speed according to the voice input person is made. Therefore, when giving the next voice input instruction as a voice response, it is possible to give the instruction simply with a small number of characters, or to give the instruction carefully with a large number of characters to an unfamiliar person, It is possible to enhance the naturalness of the dialogue and improve the processing efficiency.

【0011】尚、本発明は上記実施例に限定されるもの
ではない、例えば入力音声の発話速度の測定を母音類似
度やスペクトルの時間的変化を利用して行ったり、数字
の連続発生を認識対象とするときにはそのポーズ時間長
を用いて行うようにすることもできる。その他、音声応
答の制御を、その内容文(表現形式)の制御と共に発話
速度をも制御して行うようにしても良い。要するに本発
明は、その要旨を逸脱しない範囲で種々変形して実施す
ることができる。
The present invention is not limited to the above embodiment. For example, the speech rate of the input voice is measured by utilizing the vowel similarity and the temporal change of the spectrum, and the continuous occurrence of numbers is recognized. When the target is set, the pause time length can be used. In addition, the control of the voice response may be performed by controlling the utterance speed together with the control of the content sentence (expression format) . The present invention is needed, it can be modified in various ways without departing from the scope thereof.

【0012】[0012]

【発明の効果】このように本発明によれば、音声入力者
の性格を良く反映する音声発話速度を検出し、これに応
じて音声応答の応答文の表現形式(文字数)を制御する
ので、音声入力者との間の対話の自然性を高めることが
できる。この結果、音声入力者に苛立たしさを与える等
の不具合がなくなる等の実用上多大なる効果が奏せられ
る。
As described above, according to the present invention, the voice utterance speed that well reflects the character of the voice input person is detected, and the expression form (the number of characters) of the response sentence of the voice response is controlled accordingly. It is possible to enhance the naturalness of the dialogue with the voice input person. As a result, practically great effects such as elimination of troubles such as annoyance to the voice input person can be obtained.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例に係る音声認識応答装置の概
略構成図である。
FIG. 1 is a schematic configuration diagram of a voice recognition response device according to an embodiment of the present invention.

【符号の説明】[Explanation of symbols]

11…分析器 12…音声パター
ンメモリ 13…音声認識部 14…辞書メモリ 15…ピッチ抽出器 16…発話速度測
定器 17…音声応答制御部 18…音声応答出
力部
11 ... Analyzer 12 ... Voice pattern memory 13 ... Voice recognition unit 14 ... Dictionary memory 15 ... Pitch extractor 16 ... Speech rate measuring unit 17 ... Voice response control unit 18 ... Voice response output unit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.6 識別記号 庁内整理番号 FI 技術表示箇所 G06F 17/28 ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. 6 Identification code Internal reference number FI technical display location G06F 17/28

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】入力音声を認識し、この認識結果に対する
応答文を音声出力する音声認識応答装置において、 前記入力音声の発話速度を測定する測定手段と、 同一の意味を表わす応答文について文字数の異なる表現
形式を予め用意し、前記測定手段により測定された発話
速度に応じて前記応答文の表現形式を制御する制御手段
とを備えたことを特徴とする音声認識応答装置。
1. A voice recognition response device for recognizing an input voice and outputting a response sentence to the recognition result by voice, comprising: a measuring means for measuring the utterance speed of the input voice; A voice recognition response device, comprising different expression formats prepared in advance, and control means for controlling the expression format of the response sentence in accordance with the speech rate measured by the measurement means.
JP4139390A 1992-05-29 1992-05-29 Speech recognition response device Expired - Lifetime JPH0731508B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP4139390A JPH0731508B2 (en) 1992-05-29 1992-05-29 Speech recognition response device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP4139390A JPH0731508B2 (en) 1992-05-29 1992-05-29 Speech recognition response device

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
JP58091809A Division JPH0721759B2 (en) 1983-05-25 1983-05-25 Speech recognition response device

Publications (2)

Publication Number Publication Date
JPH05173589A JPH05173589A (en) 1993-07-13
JPH0731508B2 true JPH0731508B2 (en) 1995-04-10

Family

ID=15244190

Family Applications (1)

Application Number Title Priority Date Filing Date
JP4139390A Expired - Lifetime JPH0731508B2 (en) 1992-05-29 1992-05-29 Speech recognition response device

Country Status (1)

Country Link
JP (1) JPH0731508B2 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3489772B2 (en) * 1996-11-07 2004-01-26 株式会社リコー Work support system
JP4882152B2 (en) * 2001-01-24 2012-02-22 ヤマハ株式会社 Speech speed detection method and audio signal processing apparatus
JP6738781B2 (en) * 2017-09-14 2020-08-12 日本電信電話株式会社 Pseudo response transmission device, mutual expression expression learning device, information terminal device, communication system, pseudo response transmission method, mutual expression representation learning method and pseudo response transmission program

Also Published As

Publication number Publication date
JPH05173589A (en) 1993-07-13

Similar Documents

Publication Publication Date Title
Averbuch et al. Experiments with the TANGORA 20,000 word speech recognizer
Furui Toward spontaneous speech recognition and understanding
US20040193421A1 (en) Synthetically generated speech responses including prosodic characteristics of speech inputs
WO2007148493A1 (en) Emotion recognizer
CN113539239B (en) Voice conversion method and device, storage medium and electronic equipment
Fellbaum et al. Principles of electronic speech processing with applications for people with disabilities
Gilbert et al. Intelligent virtual agents for contact center automation
JPS6138479B2 (en)
JP4230142B2 (en) Hybrid oriental character recognition technology using keypad / speech in adverse environment
JPH0731508B2 (en) Speech recognition response device
JPH0721759B2 (en) Speech recognition response device
JPH0922296A (en) Sensitivity information input processing device and processing method therefor
Pranjol et al. Bengali speech recognition: An overview
Schramm et al. A brazilian portuguese language corpus development.
JPH05119793A (en) Method and device for speech recognition
Ajayi et al. Acoustic nudging-based model for vocabulary reformulation in continuous yorubá speech recognition
JP3110025B2 (en) Utterance deformation detection device
JPH0634175B2 (en) Text-to-speech device
JPS58186836A (en) Voice input data processor
JP2578771B2 (en) Voice recognition device
JPH0157370B2 (en)
JPS5952378A (en) translation device
JPS58123596A (en) Voice recognition system jointly using auxiliary information
Zadeh Technology of speech for a computer system
JP2888847B2 (en) Text-to-speech apparatus and method, and language processing apparatus and method