JP4535804B2

JP4535804B2 - Spoken dialogue sequence state notation method, program, and spoken dialogue apparatus

Info

Publication number: JP4535804B2
Application number: JP2004236510A
Authority: JP
Inventors: 陽助荒金; 真之高橋; 正次村中; 真理子関口; 伸浩阿部
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2004-08-16
Filing date: 2004-08-16
Publication date: 2010-09-01
Anticipated expiration: 2024-08-16
Also published as: JP2006053470A

Description

本発明は音声対話装置に関し、特に対話シーケンスの状態の表記方法に関する。 The present invention relates to a voice interaction device, and more particularly to a method for expressing a state of a conversation sequence.

従来のＣＴＩ（ＣｏｍｐｕｔｅｒＴｅｌｅｐｈｏｎｙＩｎｔｅｇｒａｔｉｏｎ）などの音声対話システムでは、対話自体をプログラムとして記述する例が多い。各音声入力に対して、応答処理をコードで書き下ろすことにより、様々な記述が可能なのが特徴である。 In a speech dialogue system such as a conventional CTI (Computer Telephony Integration), there are many examples where the dialogue itself is described as a program. The feature is that various descriptions can be made by writing down the response process with a code for each voice input.

また、ＶｏｉｃｅＸＭＬ（ｅＸｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）では、各状態毎にスクリプトを記述し、ブラウザがそれを解析することで、対話処理を行っている（非特許文献１）。ＶｏｉｃｅＸＭＬでは、可読性の高いＸＭＬを活用することで、コンテンツ生産性の向上を狙っている。
ＦＩＴ（情報科学技術フォーラム）２００３，Ｆ−００６，ＰＰ．２１５−２１６ In VoiceXML (extensible Markup Language), a script is described for each state, and a browser analyzes the script (Non-Patent Document 1). VoiceXML aims to improve content productivity by using highly readable XML.
FIT (Information Science and Technology Forum) 2003, F-006, PP. 215-216

音声ポータルやカーナビゲーション装置などのユビキタス環境からの情報アクセスにおいては、音声対話インタフェースが提供されることが多くなってきている。将来的に様々な音声サービスの追加や状況に応じた音声サービスの修正を適宜行うためには、音声対話シナリオ作成・修正の容易性が必要となる。しかし、従来の状態遷移を記述する技術では、音声入力に対する応答や状態遷移をプログラムとしてコーディングしていたり、ＶｏｉｃｅＸＭＬなどでは一状態毎にシナリオを書き下ろしているため、専門的なプログラミングスキルが必要であったり、一部の修正がプログラム全体に及ぼす影響を推定することが困難（些細なサービスの修正を行うために、プログラム全体の多数箇所の修正が必要）であったりした。 For information access from a ubiquitous environment such as a voice portal or a car navigation device, a voice dialogue interface is often provided. In order to add various voice services in the future and to modify voice services according to the situation as needed, it is necessary to facilitate the creation and correction of voice conversation scenarios. However, with the conventional technology for describing state transitions, response and state transitions to voice input are coded as programs, and VoiceXML and the like write down scenarios for each state, so specialized programming skills are required. , It was difficult to estimate the effect of some modifications on the entire program (in order to make minor service modifications, it was necessary to modify many parts of the entire program).

また、カーナビゲーション装置のように高速移動環境での利用においては、センタサーバとクライアント端末との回線が切断されることがある。従来のｗｅｂを用いたテレマティクスサービスでは、ｃｏｏｋｉｅを利用することでユーザＩＤなどのパラメータ保持を行ってはいるが、状態遷移の管理は行っておらず、再接続時にはメインメニュー（トップメニュー）から再び階層をたどることが必要となっている。また、ＣＴＩなどの音声対話システムにおいても、ＶｏｉｃｅＸＭＬなどを用いることによって記述の容易性は高めているが、依然、状態ごとにＸＭＬファイルを切り替えているために再接続時の状態復帰を十分にサポートしているとは言い難い。 In addition, when used in a high-speed moving environment such as a car navigation device, the line between the center server and the client terminal may be disconnected. In the conventional telematics service using web, parameters such as user ID are retained by using cookie, but state transition is not managed, and when reconnecting, the main menu (top menu) is used again. It is necessary to follow. In addition, in voice dialogue systems such as CTI, the ease of description is enhanced by using VoiceXML, but since the XML file is still switched for each state, state restoration at reconnection is fully supported. It ’s hard to say.

本発明の目的は、シナリオの追加や修正が容易な、音声対話シーケンス状態表記方法および音声対話装置を提供することにある。 An object of the present invention is to provide a spoken dialogue sequence state notation method and a spoken dialogue apparatus in which scenarios can be easily added and modified.

本発明は、音声対話シーケンス（音声対話をある目的に利用するために、音声対話開始時から目的が達成された状態に至るまでの音声のやりとり）の状態を、目的を達成するために必要な情報が格納されているスロットの状態と一意に関連づけて表現するものである。 The present invention requires a state of a voice conversation sequence (speech exchange from the start of a voice conversation to a state where the purpose is achieved in order to use the voice conversation for a certain purpose) to achieve the purpose. The information is uniquely associated with the state of the slot in which the information is stored.

ここで、スロットの状態は、スロットの中に情報が格納されていない状態である「空スロット」、スロットの中に情報は格納されているが、該情報はユーザが確認して確定していない状態である「未確定スロット」、スロットの中に格納された情報がユーザに確認、確定された状態である「確定スロット」、複数のスロットに対してユーザが同時に音声入力を行なった際に、ユーザが認識結果が正しくないと言った場合に、各スロットの情報を確認中である「逐次確認中スロット」の４つの状態をとる。 Here, the state of the slot is “empty slot” in which no information is stored in the slot, and information is stored in the slot, but the information is not confirmed by the user. "Undetermined slot" which is the state, information stored in the slot is confirmed to the user, "confirmed slot" which is the confirmed state, when the user performs voice input simultaneously to a plurality of slots, When the user says that the recognition result is not correct, four states of “sequentially checking slot” in which information of each slot is being checked are taken.

音声対話シーケンスをスロット状態で表すことで、シナリオ追加や修正の容易性と、再接続時の状態復帰が可能となる。 By expressing the voice interaction sequence in the slot state, it is possible to easily add or modify a scenario and to return the state at the time of reconnection.

本発明では、音声対話シーケンスの状態を２ビット（＝４状態）のスロット状態で表すことで、音声入力の受理可否を１ビットで記述可能、かつ遷移先状態もスロット状態で一意に確定する。従って、網羅性に優れ、状態遷移図に記述できる内容はすべて正常系であると共に、記述できない内容は受理しないため基本的に異常系という概念がなく、対話シーケンスの構築およびメンテナンスが非常に容易になると言う効果がある。 In the present invention, by expressing the state of the voice interaction sequence as a slot state of 2 bits (= 4 states), it is possible to describe the acceptability of the voice input by 1 bit, and the transition destination state is uniquely determined by the slot state. Therefore, it is excellent in completeness, and all the contents that can be described in the state transition diagram are normal systems, and contents that cannot be described are not accepted, so there is basically no concept of abnormal systems, and the construction and maintenance of dialog sequences is very easy. There is an effect to say.

また、本対話シーケンスは、スロット状態にのみ対話状態が依存するため、対話シーケンス名とスロット状態を保持するだけで、再開処理が可能となるため、複雑な再開処理やフロー再生を必要とせずに必要十分な再開処理の実装が容易になると言う効果がある。 In addition, since this dialog sequence depends only on the slot state, the restart process can be performed simply by holding the dialog sequence name and the slot state, so there is no need for complicated restart processing or flow playback. There is an effect that it becomes easy to implement necessary and sufficient restart processing.

次に、本発明の実施の形態について図面を参照して説明する。 Next, embodiments of the present invention will be described with reference to the drawings.

図１は本発明の一実施形態の音声対話装置の構成図である。本実施形態の音声対話装置はと音声入出力受付部１とデータ入出力受付部２とシナリオＤＢ３と認識辞書ＤＢ４と音声認識部５とシナリオ解析部６を有している。 FIG. 1 is a block diagram of a voice interactive apparatus according to an embodiment of the present invention. The voice interaction apparatus of this embodiment includes a voice input / output receiving unit 1, a data input / output receiving unit 2, a scenario DB 3, a recognition dictionary DB 4, a voice recognition unit 5, and a scenario analysis unit 6.

音声入出力受付部２は、シナリオ解析部６から出力された音声ガイダンスをユーザの端末装置７に出力し、またユーザが音声ガイダンスに対して端末装置７から入力した音声入力を受付け、シナリオ解析部６に出力する。データ入出力受付部２はシナリオ解析部６から出力された対話シーケンス名とスロット状態をユーザの端末装置７に出力し、また音声対話シーケンスの再開時などに端末装置７から入力された対話シーケンス名とスロット状態をシナリオ解析部６に出力する。シナリオＤＢ３には、シナリオ解析部６が音声対話を制御する際に用いる音声対話シナリオが音声対話シーケンス毎に格納されている、認識辞書ＤＢ４は、音声認識部５が音声認識する際に用いる認識文法を格納している。音声認識部５は、シナリオ解析部６から出力された音声に対して、シナリオ解析部６から指定された音声認識文法を用いて音声認識処理を行い、その認識結果をシナリオ解析部６に出力する。シナリオ解析部６は、音声ガイダンスを音声入出力受付部１に、対話シーケンス名とスロット状態（空スロット、未確定スロット、確定スロット、逐次確認中スロット）をデータ入出力受付部２に出力し、音声入出力受付部１からの該音声ガイダンスに対応するユーザの音声入力に対して、音声認識部５に指示して適切な音声認識文法にて音声認識処理を行わせ、その音声認識結果に応じてシナリオＤＢ３中の音声対話シナリオの状態を遷移させると共に、音声対話再開時にはデータ入出力受付部２より出力された対話シーケンス名およびスロット状態によって音声対話を再開する。 The voice input / output reception unit 2 outputs the voice guidance output from the scenario analysis unit 6 to the user terminal device 7, and receives the voice input input from the terminal device 7 by the user to the voice guidance, and the scenario analysis unit 6 is output. The data input / output accepting unit 2 outputs the dialogue sequence name and slot state output from the scenario analysis unit 6 to the user terminal device 7, and the dialogue sequence name input from the terminal device 7 when the voice dialogue sequence is resumed. And the slot state are output to the scenario analysis unit 6. The scenario DB 3 stores, for each voice conversation sequence, a voice conversation scenario used when the scenario analysis unit 6 controls the voice conversation. The recognition dictionary DB 4 uses a recognition grammar used when the voice recognition unit 5 performs voice recognition. Is stored. The speech recognition unit 5 performs speech recognition processing on the speech output from the scenario analysis unit 6 using the speech recognition grammar specified by the scenario analysis unit 6 and outputs the recognition result to the scenario analysis unit 6. . The scenario analysis unit 6 outputs the voice guidance to the voice input / output receiving unit 1 and the dialog sequence name and the slot state (empty slot, unconfirmed slot, confirmed slot, sequentially confirmed slot) to the data input / output receiving unit 2. In response to the voice input of the user corresponding to the voice guidance from the voice input / output receiving unit 1, the voice recognition unit 5 is instructed to perform voice recognition processing with an appropriate voice recognition grammar, and according to the voice recognition result Then, the state of the voice dialogue scenario in the scenario DB 3 is changed, and at the time of resuming the voice dialogue, the voice dialogue is resumed by the dialogue sequence name and the slot state output from the data input / output accepting unit 2.

［第１の実施形態］
シナリオ記述例
本発明のシナリオ記述例を図２に示す。 [First Embodiment]
Scenario Description Example FIG. 2 shows a scenario description example of the present invention.

以下、流れに沿って説明する。 Hereinafter, it demonstrates along a flow.

初期状態はシナリオが受け付け可能なスロット個数のＸが並んだ状態とする。図２のシナリオでは、店舗の種類スロット数を２としているため、初期状態はＸＸとなる。音声入出力受付部１は、シナリオ記述のＧｕｉｄａｎｃｅに定義された文字列をガイダンスとしてユーザに対し読み上げ処理を行う。この状態で受理可能な入力に対してはシナリオ記述欄に１を、受理不能な入力に対してはシナリオ記述欄に０が記入されている。 The initial state is a state in which the number of slots X that can accept a scenario is arranged. In the scenario of FIG. 2, since the number of store type slots is 2, the initial state is XX. The voice input / output receiving unit 1 performs a reading process for the user using the character string defined in the Guidance of the scenario description as guidance. In this state, 1 is entered in the scenario description column for inputs that can be accepted, and 0 is entered in the scenario description column for inputs that cannot be accepted.

例えば、図２のシナリオにおいて、お店の種類として、「和食」「洋食」「中華」が受理可能であるとする。その場合、このシナリオが受理可能な入力と発声例との比較は下記の通りである。
ＳＬａ：和食
ＳＬａＳＬａ：和食か洋食
ＳＬａＮＯＴＳＬａ：洋食ではなくて中華
ＮＯＴＳＬａ：そうじゃなくて洋食
ＡＮＹ：なんでもよい
ＹＥＳ：はい
ＮＯ：いいえ
初期状態ＸＸ
初期状態ＸＸでは、音声入出力受付部１は「お店の入力は」というガイダンスを流し、Ｓｅｃ１＿ＸＸ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。なお、Ｓｅｃ１＿ＸＸ．ｂｎｆは認識が受理可能な語彙をＢＮＦ文法によって記述したファイルのファイル名であり、例えば図３に示すような記述となる。図２に示すシナリオでは、初期状態ＸＸにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＳＬａＳＬａ｜ＡＮＹ）が受理可能となる。なお、「戻る」という入力については、シナリオ解析部６において前状態をスタック保持するため、シナリオに記述はせず、シナリオ解析部６において解析・処理される。 For example, in the scenario of FIG. 2, it is assumed that “Japanese food”, “Western food”, and “Chinese food” are acceptable as the types of shops. In that case, the comparison between the input that can be accepted by this scenario and the utterance example is as follows.
SLa: Japanese food SLa SLa: Japanese food or Western food SLa NOT SLa: Chinese food, not Western food SLA: Western food, ANY: Anything, YES: Yes NO: No Initial state XX
In the initial state XX, the voice input / output receiving unit 1 plays the guidance “Is the store input?” And Sec1_XX. bnf is set in the voice recognition unit 5 to wait for user input. Note that Sec1_XX. bnf is a file name of a file in which vocabulary that can be recognized is described in BNF grammar, for example, as shown in FIG. In the scenario shown in FIG. 2, the input that can be accepted in the initial state XX is an input in which 1 is entered in the column. In this case, (SLa | SLa SLa | ANY) can be accepted. Note that the input “return” is analyzed and processed in the scenario analysis unit 6 without being described in the scenario because the previous state is stacked and held in the scenario analysis unit 6.

初期状態ＸＸにおいて「和食」と入力された場合、Ｓｅｃ１＿ＸＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿和食”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたとみなす。そして、音声認識部５からの返値の内容（カテゴリ“ＦＯＯＤ＿”を除いたもの）によってスロットＸの前方よりスロットを埋める。この場合、全てのスロットが空のため、一番目のスロットＸに“和食”がセットされる。ここで、スロットＸの“和食”は音声認識の誤認識などを考えるとユーザの確認が行われていない未確認入力の状態である。従って、一番目のスロットＸはスロットＹとなり、二番目のスロットＸは変化がないため、状態ＹＸに遷移することになる。 When “Japanese food” is input in the initial state XX, Sec1_XX. In accordance with bnf, the value {“FOOD_Japanese food”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Then, the slot is filled from the front of the slot X by the content of the return value from the voice recognition unit 5 (excluding the category “FOOD_”). In this case, since all slots are empty, “Japanese food” is set in the first slot X. Here, “Japanese food” in the slot X is an unconfirmed input state in which the user is not confirmed in consideration of misrecognition of voice recognition or the like. Accordingly, the first slot X becomes the slot Y, and the second slot X does not change, so the state transitions to the state YX.

また、初期状態ＸＸにおいて「和食か洋食」と入力された場合、Ｓｅｃ１＿ＸＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿和食”，“ＯＲ”，“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は二つのスロットに対して入力があると判断し、［ＳＬａＳＬａ］の入力がなされたとみなす。そして、スロットＸの前方よりスロットを埋める。この場合、全てのスロットが空のため、一番目のスロットＸに“和食”が、二番目のスロットＸに“洋食”がセットされる。ここで、スロットＸの“和食”およびスロットＸの“洋食”はユーザの確認が行われていない未確認入力の状態である。従って、一番目のスロットＸはスロットＹとなり、二番目のスロットＸはスロットＹとなり、状態ＹＹに遷移することになる。 In addition, when “Japanese or Western food” is input in the initial state XX, Sec1_XX. According to bnf, the value {“FOOD_Japanese food”, “OR”, “FOOD_Western food”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there are inputs to the two slots, and considers that [SLa SLa] has been input. Then, the slot is filled from the front of the slot X. In this case, since all slots are empty, “Japanese food” is set in the first slot X, and “Western food” is set in the second slot X. Here, “Japanese food” in slot X and “Western food” in slot X are unconfirmed input states that have not been confirmed by the user. Therefore, the first slot X becomes slot Y, the second slot X becomes slot Y, and the state transitions to YY.

また、初期状態ＸＸにおいて「何でも良い」と入力された場合、Ｓｅｃ１＿ＸＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。そして、スロットＸの前方よりスロットを埋める。この場合、全てのスロットが空のため、一番目のスロットＸに“ＡＮＹ”がセットされる。ここで、スロットＸの“ＡＮＹ”は未確認入力の状態である。従って、一番目のスロットＸはスロットＹとなり、二番目のスロットＸは変化がないため、状態としてスロットＹＸに遷移することになる。 Further, when “anything” is input in the initial state XX, Sec1_XX. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Then, the slot is filled from the front of the slot X. In this case, since all slots are empty, “ANY” is set in the first slot X. Here, “ANY” in the slot X is an unconfirmed input state. Accordingly, the first slot X becomes the slot Y, and the second slot X does not change, so the state transitions to the slot YX.

状態ＹＸ
状態ＹＸでは、一つのスロットが未確認情報で埋まった状態となる。そこで、未確認情報を確認し確定情報とする対話を行う。音声入出力受付部１は「入力は和食でよろしいですか」といったガイダンスを流し、Ｓｅｃ１＿ＹＸ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。図２に示すシナリオでは、状態ＹＸにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＳＬａＳＬａ｜ＳＬａＮＯＴＳＬａ｜ＮＯＴＳＬａ｜ＡＮＹ｜ＹＥＳ｜ＮＯ）が受理可能となる。 State YX
In the state YX, one slot is filled with unconfirmed information. Therefore, a dialogue is performed in which unconfirmed information is confirmed and confirmed. The voice input / output receiving unit 1 plays a guidance such as “Are you sure you want to input Japanese food?”, And Sec1_YX. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario shown in FIG. 2, the input that can be accepted in the state YX is an input in which 1 is entered in the column. In this case, (SLa | SLa SLa | SLa NOT SLa | NOT SLa | ANY | YES | NO) is accepted. It becomes possible.

状態ＹＸで「洋食」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定または空のスロットの前方よりスロットを埋める。この場合、一番目のスロットＹが未確定であり、既にスロットＹに入っている値“和食”と異なった入力“洋食”受理されたため、スロットＹの内容が“洋食”がセットされる。従って、スロット状態はＹＸとなり、同じ状態ＹＸに戻り「入力は洋食でよろしいですか」というガイダンスとなる。 When “Western food” is input in the state YX, Sec1_YX. In accordance with bnf, the value {“FOOD_Western food”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is YX, the slot is filled from the front of the indeterminate or empty slot. In this case, since the first slot Y is unconfirmed and an input “Western food” different from the value “Japanese food” already in the slot Y is accepted, the content of the slot Y is set to “Western food”. Accordingly, the slot state becomes YX, and the guidance returns to the same state YX as “Are you sure you want to input Western food?”.

なお、状態ＹＸで「和食」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿和食”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定または空のスロットの前方よりスロットを埋める。この場合、一番目のスロットＹが未確定であり、既にスロットＹに入っている値“和食”と同じ入力が受理されたため、スロットＹの内容はそのまま“和食”がセットされ、同じ状態ＹＸに戻り「入力は和食でよろしいですか」というガイダンスが繰り返されることになる。 When “Japanese food” is input in the state YX, Sec1_YX. In accordance with bnf, the value {“FOOD_Japanese food”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is YX, the slot is filled from the front of the indeterminate or empty slot. In this case, since the first slot Y is unconfirmed and the same input as the value “Japanese food” already in the slot Y is accepted, the contents of the slot Y are set to “Japanese food” as they are, and the same state YX is set. The guidance “Return input is Japanese food?” Will be repeated.

状態ＹＸで「洋食か中華」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”，“ＯＲ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は、二つのスロットに対して入力があると判断し、［ＳＬａＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定または空のスロットの前方よりスロットを埋める。この場合、一番目のスロットＹが未確定であり、また入力“洋食”、“中華”とは異なった値“和食”がセットされているため、一番目のスロットＹの内容が“洋食”がリセットされ、二番目のスロットＸは空のため、スロットＸの内容が“中華”にセットされる。従って、一番目のスロットＹはスロットＹとなり、二番目のスロットＸはスロットＹとなり、状態ＹＹに遷移することになる。 When “Western food or Chinese” is input in the state YX, Sec1_YX. According to bnf, the value {“FOOD_Western food”, “OR”, “FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there are inputs to the two slots, and considers that [SLa SLa] has been input. Since the current slot state is YX, the slot is filled from the front of the indeterminate or empty slot. In this case, since the first slot Y is unconfirmed, and the value “Japanese food” different from the input “Western food” and “Chinese food” is set, the content of the first slot Y is “Western food”. Since the second slot X is reset, the contents of the slot X are set to “Chinese”. Therefore, the first slot Y becomes slot Y, the second slot X becomes slot Y, and the state transitions to YY.

また、状態ＹＸで「和食か洋食」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿和食”，“ＯＲ”，“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は、二つのスロットに対して入力があると判断し、［ＳＬａＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定または空のスロットの前方よりスロットを埋める。この場合、一番目のスロットＹが未確定であり、また入力“和食”と同じ値がセットされているため、スロットＹの内容は“和食”で保持され、二番目のスロットＸは空のため、スロットＸの内容が“洋食”にセットされる。従って、スロットＹはスロットＹとなり、スロットＸはスロットＹとなり、状態ＹＹに遷移することになる。 In addition, when “Japanese or Western food” is input in the state YX, Sec1_YX. According to bnf, the value {“FOOD_Japanese food”, “OR”, “FOOD_Western food”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there are inputs to the two slots, and considers that [SLa SLa] has been input. Since the current slot state is YX, the slot is filled from the front of the indeterminate or empty slot. In this case, since the first slot Y is unconfirmed and the same value as the input “Japanese food” is set, the content of the slot Y is held as “Japanese food”, and the second slot X is empty. , The contents of slot X are set to “Western food”. Therefore, slot Y becomes slot Y, slot X becomes slot Y, and transitions to state YY.

状態ＹＸで、「和食ではなくて洋食」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿和食”，“ＮＯＴ”，“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は、一つのスロットに対して修正入力があると判断し、［ＳＬａＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定のスロットＹの値“和食”を参照し、それを修正入力値である“洋食”にリセットする。従って、スロットＹはスロットＹとなり、スロットＸはそのままとなり、状態ＹＸに遷移することになる。 In the state YX, when “Western food instead of Japanese food” is input, Sec1_YX. In accordance with bnf, the values {“FOOD_Japanese food”, “NOT”, “FOOD_Western food”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [SLa NOT SLa] has been input. Since the current slot state is YX, the value “Japanese food” of the undetermined slot Y is referred to and reset to “Western food” which is the corrected input value. Therefore, the slot Y becomes the slot Y, the slot X remains as it is, and a transition is made to the state YX.

状態ＹＸで、「そうじゃなくて洋食」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯＴ”，“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は、一つのスロットに対して修正入力があると判断し、［ＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定のスロットＹの値“和食”を参照し、それを修正入力値である“洋食”にリセットする。従って、スロットＹはスロットＹとなり、スロットＸはそのままとなり、状態ＹＸに遷移することになる。 In the state YX, when “Western food is not so” is entered, Sec1_YX. In accordance with bnf, the values {“NOT”, “FOOD_Western food”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [NOT SLa] has been input. Since the current slot state is YX, the value “Japanese food” of the undetermined slot Y is referred to and reset to “Western food” which is the corrected input value. Therefore, the slot Y becomes the slot Y, the slot X remains as it is, and a transition is made to the state YX.

状態ＹＸで、「なんでもよい」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は、一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定のスロットＹの値“和食”を参照し、入力値“ＡＮＹ”と異なるためそれを修正入力値である“ＡＮＹ”にリセットする。従って、スロットＹはスロットＹとなり、スロットＸはそのままとなり、状態ＹＸに遷移することになる。 When “anything” is input in the state YX, Sec1_YX. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Since the current slot state is YX, the value “Japanese food” of the undetermined slot Y is referred to, and since it is different from the input value “ANY”, it is reset to the corrected input value “ANY”. Therefore, the slot Y becomes the slot Y, the slot X remains as it is, and a transition is made to the state YX.

状態ＹＸで、「はい」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＹＥＳ”｝という値が返る。シナリオ解析部６は、確認ガイダンスに対して受諾意志の入力があると判断し、［ＹＥＳ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定のスロットＹの値が確定スロットＸとなり、スロットＸはそのままとなり、スロット状態はＺＸとなる。ここで、スロット数＝２に対して、決定スロット＝１が下回るため、この入力だけでよいか確認するシーケンスに遷移する。これは、ＺＸに遷移して良いか確認する（ｃｏｎｆｉｒｍａｔｉｏｎ）ということから、状態ＺＸＣとする。従って、状態ＹＸで、「はい」と入力された場合、状態ＺＸＣに遷移することになる。但し、受諾されたＹが“ＡＮＹ”の場合は、スロット状態をＺＺ＝｛“ＡＮＹ”，“ＡＮＹ”｝とし、Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔを実行する。 If “Yes” is input in the state YX, Sec1_YX. In accordance with bnf, the value {“YES”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an acceptance intention input for the confirmation guidance, and considers that [YES] has been input. Since the current slot state is YX, the value of the undetermined slot Y becomes the confirmed slot X, the slot X remains as it is, and the slot state becomes ZX. Here, since the decision slot = 1 falls below the number of slots = 2, the sequence shifts to a sequence for confirming whether only this input is necessary. This is a state ZXC because it is confirmed whether or not the transition to ZX is allowed (confirmation). Therefore, when “Yes” is input in the state YX, the state transitions to the state ZXC. However, if the accepted Y is “ANY”, the slot state is set to ZZ = {“ANY”, “ANY”}, and Sec1_ZZ. Run script.

状態ＹＸで、「いいえ」と入力された場合、Ｓｅｃ１＿ＹＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯ”｝という値が返る。シナリオ解析部６は、確認ガイダンスに対して拒否意志の入力があると判断し、［ＮＯ］の入力がなされたと見なす。現状のスロット状態はＹＸであるため、未確定のスロットＹの値が空スロットＸとなり、二番目のスロットＸはそのままとなり、状態ＸＸに遷移することになる。 When “No” is input in the state YX, Sec1_YX. In accordance with bnf, the value {“NO”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a rejection intention input for the confirmation guidance, and considers that [NO] has been input. Since the current slot state is YX, the value of the undetermined slot Y becomes the empty slot X, the second slot X remains as it is, and a transition is made to the state XX.

状態ＹＹ
状態ＹＹでは、二つのスロットが未確認情報で埋まった状態となる。そこで、未確認情報を確認し確定情報とする対話を行う。音声入出力受付部１は「入力は和食と洋食でよろしいですか」といったガイダンスを流し、Ｓｅｃ１＿ＹＹ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。図２に示すシナリオでは、状態ＹＹにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＳＬａＳＬａ｜ＳＬａＮＯＴＳＬａ｜ＮＯＴＳＬａ｜ＡＮＹ｜ＹＥＳ｜ＮＯ）が受理可能となる。ここでは、状態ＹＹの受理可能入力と状態ＹＸの受理可能入力が完全に同一のため、認識文法はＳｅｃ１＿ＹＸ．ｂｎｆと同一となる。 State YY
In the state YY, the two slots are filled with unconfirmed information. Therefore, a dialogue is performed in which unconfirmed information is confirmed and confirmed. The voice input / output reception unit 1 circulates guidance such as “Do you want to input Japanese and Western foods?”, Sec1_YY. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario shown in FIG. 2, the input that can be accepted in the state YY is an input in which 1 is entered in the column. In this case, (SLa | SLa SLa | SLa NOT SLa | NOT SLa | ANY | YES | NO) is accepted. It becomes possible. Here, since the acceptable input in the state YY and the acceptable input in the state YX are completely the same, the recognition grammar is Sec1_YX. It is the same as bnf.

状態ＹＹで「中華」と入力された場合、Ｓｅｃ１＿ＹＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＹであるため、未確定スロットである一番目のＹおよび二番目のＹの値を抹消し、入力値“中華”によって前方よりスロットを埋める。従って、状態ＹＸに遷移することとなる。 When “Chinese” is input in the state YY, Sec1_YY. In accordance with bnf, the value {“FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is YY, the values of the first Y and the second Y which are unconfirmed slots are deleted, and the slot is filled from the front with the input value “Chinese”. Therefore, the state transitions to the state YX.

状態ＹＹで「洋食か中華」と入力された場合、Ｓｅｃ１＿ＹＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”，“ＯＲ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は二つのスロットに対して入力があると判断し、［ＳＬａＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＹであるため、未確定スロットである一番目のＹおよび二番目のＹの値を抹消し、入力値“洋食”と“中華”によって前方よりスロットを埋める。従って、同じ状態ＹＹに遷移することとなる。 When “Western food or Chinese” is input in the state YY, Sec1_YY. According to bnf, the value {“FOOD_Western food”, “OR”, “FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there are inputs to the two slots, and considers that [SLa SLa] has been input. Since the current slot state is YY, the values of the first Y and the second Y, which are indeterminate slots, are deleted, and the slots are filled from the front with the input values “Western food” and “Chinese food”. Therefore, the state transitions to the same state YY.

状態ＹＹで「和食ではなくて中華」と入力された場合、Ｓｅｃ１＿ＹＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿和食”，“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して修正入力があると判断し、［ＳＬａＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＹであるため、未確定スロットである一番目のＹおよび二番目のＹの値と修正対象入力“和食”の値を比較し、該当する一番目のスロットＹの値を“中華”に変更する。また、二番目のスロットＹの値は変更はない。従って、同じ状態ＹＹに遷移することとなる。 When “Chinese food is not Japanese food” is input in the state YY, Sec1_YY. In accordance with bnf, the values {“FOOD_Japanese food”, “NOT”, “FOOD_Chinese”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [SLa NOT SLa] has been input. Since the current slot state is YY, the first and second Y values which are unconfirmed slots are compared with the value of the input to be corrected “Japanese food”, and the value of the corresponding first slot Y is set to “ Change to “Chinese”. The value of the second slot Y is not changed. Therefore, the state transitions to the same state YY.

状態ＹＹで「そうではなくて中華」と入力された場合、Ｓｅｃ１＿ＹＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は修正入力があると判断し、［ＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＹＹであるため、未確定スロットである一番目のＹおよび二番目のＹの値を抹消し、入力値“中華”によって前方よりスロットを埋める。従って、スロット状態はＹＸとなり、状態ＹＸに遷移することとなる。 If “Chinese but not so” is input in the state YY, Sec1_YY. In accordance with bnf, the values {“NOT”, “FOOD_Chinese”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input and considers that [NOT SLa] has been input. Since the current slot state is YY, the values of the first Y and the second Y which are unconfirmed slots are deleted, and the slot is filled from the front with the input value “Chinese”. Therefore, the slot state becomes YX, and the state changes to YX.

状態ＹＹで「なんでもよい」と入力された場合、Ｓｅｃ１＿ＹＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。現状のスロット状態はＹＹであるため、未確定スロットである一番目のＹおよび二番目のＹの値を抹消し、入力値“ＡＮＹ”によって前方よりスロットを埋める。従って、スロット状態はＹＸとなり、状態ＹＸに遷移することとなる。 When “anything” is input in the state YY, Sec1_YY. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Since the current slot state is YY, the values of the first Y and the second Y which are unconfirmed slots are deleted, and the slot is filled from the front with the input value “ANY”. Therefore, the slot state becomes YX, and the state changes to YX.

状態ＹＹで「はい」と入力された場合、Ｓｅｃ１＿ＹＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＹＥＳ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して受諾意志の入力があると判断し、［ＹＥＳ］の入力がなされたと見なす。現状のスロット状態はＹＹであるため、一番目の未確定スロットＹが確定スロットＺになり、二番目の未確定スロットＹが確定スロットＺとなり、状態としては状態ＺＺに相当する。ここで、スロット数＝２と同数のスロットが確定スロットとなったため、スロットを埋めるという、本対話シーケンスの目的は達成された。そこで、Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔに記述された処理を実行することとなる。 When “Yes” is input in the state YY, Sec1_YY. In accordance with bnf, the value {“YES”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an acceptance intention input for the confirmation guidance, and considers that [YES] has been input. Since the current slot state is YY, the first unconfirmed slot Y becomes the confirmed slot Z, the second unconfirmed slot Y becomes the confirmed slot Z, and the state corresponds to the state ZZ. Here, since the number of slots equal to the number of slots = 2 has become a definite slot, the purpose of this interactive sequence of filling the slot is achieved. Therefore, Sec1_ZZ. The process described in the script is executed.

状態ＹＹで「いいえ」と入力された場合、Ｓｅｃ１＿ＹＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して拒否意志の入力があると判断し、［ＮＯ］の入力がなされたと見なす。現状のスロット状態はＹＹであり、複数の未確定スロットが存在するため、逐次確認処理に遷移する。そこで逐次確認状態をＷと定義し、状態ＷＷに遷移する。 When “No” is input in the state YY, Sec1_YY. In accordance with bnf, the value {“NO”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a rejection intention input for the confirmation guidance, and considers that [NO] has been input. Since the current slot state is YY and there are a plurality of unconfirmed slots, the process proceeds to the sequential confirmation process. Therefore, the sequential confirmation state is defined as W, and the state transitions to the state WW.

状態ＷＷ
状態ＷＷでは、二つのスロットが要逐次確認情報で埋まった状態となる。そこで、未確認情報を逐次確認し確定情報とする対話を行う。音声入出力受付部１は「最初の入力は和食でよろしいですか」といったガイダンスを流し、Ｓｅｃ１＿ＷＷ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。図２に示すシナリオでは、状態ＷＷにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＳＬａＳＬａ｜ＳＬａＮＯＴＳＬａ｜ＮＯＴＳＬａ｜ＡＮＹ｜ＹＥＳ｜ＮＯ）が受理可能となる。ここでは、状態ＷＷの受理可能入力と状態ＹＸの受理可能入力が完全に同一のため、認識文法はＳｅｃ１＿ＹＸ．ｂｎｆと同一となる。 State WW
In the state WW, two slots are filled with necessary sequential confirmation information. Therefore, a dialogue is performed in which unconfirmed information is sequentially confirmed and confirmed. The voice input / output reception unit 1 sends a guidance such as “Are you sure you want to use Japanese food for the first input?”, And Sec1_WW. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario shown in FIG. 2, the input that can be accepted in the state WW is an input in which 1 is entered in the column. In this case, (SLa | SLa SLa | SLa NOT SLa | NOT SLa | ANY | YES | NO) is accepted. It becomes possible. Here, since the acceptable input in the state WW and the acceptable input in the state YX are completely the same, the recognition grammar is Sec1_YX. It is the same as bnf.

状態ＷＷで「中華」と入力された場合、Ｓｅｃ１＿ＷＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＷであるため、確認中の一番目の未確定スロットＷの値“和食”を入力値“中華”によってリセットする。“中華”は未確定状態のため、再び確認するシーケンスに移ることとなり、状態ＷＷに遷移することとなる。 When “Chinese” is input in the state WW, Sec1_WW. In accordance with bnf, the value {“FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is WW, the value “Japanese food” of the first unconfirmed slot W being confirmed is reset by the input value “Chinese food”. Since “Chinese Chinese” is an indeterminate state, the sequence proceeds to the confirmation sequence again, and the state transitions to the state WW.

状態ＷＷで「洋食か中華」と入力された場合、Ｓｅｃ１＿ＷＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”，“ＯＲ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は二つのスロットに対して入力があると判断し、［ＳＬａＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＷであるため、逐次確認中の一番目の未確定スロットＷおよび二番目の未確定スロットＷの値を抹消し、入力値“洋食”と“中華”によって前方よりスロットを埋める。従って、状態ＹＹに遷移することとなる。 When “Western food or Chinese” is input in the state WW, Sec1_WW. According to bnf, the value {“FOOD_Western food”, “OR”, “FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there are inputs to the two slots, and considers that [SLa SLa] has been input. Since the current slot state is WW, the values of the first unconfirmed slot W and the second unconfirmed slot W being sequentially confirmed are deleted, and the slots are filled from the front with the input values “Western food” and “Chinese food”. . Accordingly, the state transitions to the state YY.

状態ＷＷで「和食ではなくて中華」と入力された場合、Ｓｅｃ１＿ＷＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿和食”，“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して修正入力があると判断し、［ＳＬａＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＷであるため、逐次確認中未の一番目の確定スロットＷの値を“中華”に変更する。“中華”は未確定状態のため、再び確認するシーケンスに移ることとなり、状態ＷＷに遷移することとなる。 In the state WW, when “Chinese instead of Japanese” is input, Sec1_WW. In accordance with bnf, the values {“FOOD_Japanese food”, “NOT”, “FOOD_Chinese”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [SLa NOT SLa] has been input. Since the current slot state is WW, the value of the first confirmed slot W that is not being sequentially confirmed is changed to “Chinese”. Since “Chinese Chinese” is an indeterminate state, the sequence proceeds to the confirmation sequence again, and the state transitions to the state WW.

状態ＷＷで「そうではなくて中華」と入力された場合、Ｓｅｃ１＿ＷＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は修正入力があると判断し、［ＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＷであるため、逐次確認中の一番目の未確定スロットＷの値を抹消し、入力値“中華”によってスロットＷを埋める。“中華”は未確定状態のため、再び確認するシーケンスに移ることとなり、状態ＷＷに遷移することとなる。 When “Chinese but not so” is input in the state WW, Sec1_WW. In accordance with bnf, the values {“NOT”, “FOOD_Chinese”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input and considers that [NOT SLa] has been input. Since the current slot state is WW, the value of the first unconfirmed slot W being sequentially confirmed is deleted, and the slot W is filled with the input value “Chinese”. Since “Chinese Chinese” is an indeterminate state, the sequence proceeds to the confirmation sequence again, and the state transitions to the state WW.

状態ＷＷで「なんでもよい」と入力された場合、Ｓｅｃ１＿ＷＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。現状のスロット状態はＷＷであるため、逐次確認中の一番目の未確定スロットＷを入力値“ＡＮＹ”によってリセットする。従って、スロット状態はＷＷとなり、状態ＷＷに遷移することとなる。 When “anything” is input in the state WW, Sec1_WW. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Since the current slot state is WW, the first unconfirmed slot W being sequentially confirmed is reset by the input value “ANY”. Therefore, the slot state becomes WW, and a transition is made to the state WW.

状態ＷＷで「はい」と入力された場合、Ｓｅｃ１＿ＷＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＹＥＳ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して受諾意志の入力があると判断し、［ＹＥＳ］の入力がなされたと見なす。現状のスロット状態はＷＷであるため、逐次確認中の一番目の未確定スロットＷが確定スロットＺになり、状態ＺＷに遷移することとなる。 When “Yes” is input in the state WW, Sec1_WW. In accordance with bnf, the value {“YES”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an acceptance intention input for the confirmation guidance, and considers that [YES] has been input. Since the current slot state is WW, the first unconfirmed slot W that is being sequentially confirmed becomes the confirmed slot Z and transitions to the state ZW.

状態ＷＷで「いいえ」と入力された場合、Ｓｅｃ１＿ＷＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して拒否意志の入力があると判断し、［ＮＯ］の入力がなされたと見なす。現状のスロット状態はＷＷであるため、逐次確認中の一番目の未確定スロットＷが空スロットＸになり、スロットの状態はＸＷとなるが、確定情報（Ｚ）→未確認情報（Ｙ，Ｗ）→空（Ｘ）、の順でスロットをソートするというルールに従い、ＸとＷが入れ替わり、スロット状態はＷＸとなり状態ＷＸに遷移することとなる。 When “No” is input in the state WW, Sec1_WW. In accordance with bnf, the value {“NO”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a rejection intention input for the confirmation guidance, and considers that [NO] has been input. Since the current slot state is WW, the first unconfirmed slot W being sequentially confirmed becomes the empty slot X and the slot state becomes XW, but the confirmation information (Z) → unconfirmed information (Y, W) In accordance with the rule that slots are sorted in the order of empty (X), X and W are interchanged, the slot state becomes WX, and the state changes to WX.

状態ＷＸ
状態ＷＸでは、一つのスロットについて逐次確認を行う状態となる。音声入出力受付部１は「次の入力は洋食でよろしいですか」といったガイダンスを流し、Ｓｅｃ１＿ＸＷ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。シナリオでは、状態ＷＸにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＳＬａＳＬａ｜ＳＬａＮＯＴＳＬａ｜ＮＯＴＳＬａ｜ＡＮＹ｜ＹＥＳ｜ＮＯ）が受理可能となる。ここでは、状態ＷＸの受理可能入力と状態ＹＸの受理可能入力が完全に同一のため、認識文法はＳｅｃ１＿ＹＸ．ｂｎｆと同一となる。 State WX
In the state WX, one slot is sequentially confirmed. The voice input / output reception unit 1 plays a guidance such as “Are you sure you want to input Western food?”, And Sec1_XW. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario, the input that can be accepted in the state WX is an input in which 1 is entered in the column. In this case, (SLa | SLa SLa | SLa NOT SLa | NOT SLa | ANY | YES | NO) can be accepted. Here, since the acceptable input in the state WX and the acceptable input in the state YX are completely the same, the recognition grammar is Sec1_YX. It is the same as bnf.

状態ＷＸで「中華」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＸであるため、確認中の未確定スロットＷの値“洋食”を入力値“中華”に変更する。従って、スロット状態はＹＸとなり、状態ＹＸに遷移することとなる。 When “Chinese” is input in the state WX, Sec1_XW. In accordance with bnf, the value {“FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is WX, the value “Western food” of the unconfirmed slot W being confirmed is changed to the input value “Chinese”. Therefore, the slot state becomes YX, and the state changes to YX.

状態ＷＸで「洋食か中華」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”，“ＯＲ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は二つのスロットに対して入力があると判断し、［ＳＬａＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＸであるため、逐次確認中未確定スロットＷの値を抹消し、入力値“洋食”と“中華”によって前方よりスロットを埋める。従って、状態ＹＹに遷移することとなる。 When “Western food or Chinese” is input in the state WX, Sec1_XW. According to bnf, the value {“FOOD_Western food”, “OR”, “FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there are inputs to the two slots, and considers that [SLa SLa] has been input. Since the current slot state is WX, the value of the undetermined slot W during the sequential confirmation is deleted, and the slot is filled from the front with the input values “Western food” and “Chinese”. Accordingly, the state transitions to the state YY.

状態ＷＸで「洋食ではなくて中華」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”，“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して修正入力があると判断し、［ＳＬａＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＸであるため、逐次確認中未確定スロットＷの値と入力“洋食”を比較し、それを“中華”に変更する。従って、スロット状態はＹＸとなり、状態ＹＸに遷移することとなる。 When “Chinese instead of Western food” is input in the state WX, Sec1_XW. According to bnf, the value {“FOOD_Western food”, “NOT”, “FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [SLa NOT SLa] has been input. Since the current slot state is WX, the value of the undetermined slot W during the sequential confirmation is compared with the input “Western food” and changed to “Chinese”. Therefore, the slot state becomes YX, and the state changes to YX.

状態ＷＸで「そうではなくて中華」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は修正入力があると判断し、［ＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＷＸであるため、逐次確認中未確定スロットＷの値を抹消し、入力値“中華”によってスロットＷを埋める。従って、スロット状態はＹＸとなり、状態ＹＸに遷移することとなる。 When “Chinese but not so” is input in the state WX, Sec1_XW. In accordance with bnf, the values {“NOT”, “FOOD_Chinese”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input and considers that [NOT SLa] has been input. Since the current slot state is WX, the value of the unconfirmed slot W during the sequential confirmation is deleted, and the slot W is filled with the input value “Chinese”. Therefore, the slot state becomes YX, and the state changes to YX.

状態ＷＸで「なんでもよい」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。現状のスロット状態はＷＸであるため、逐次確認中の未確定スロットＷの値を抹消し、入力値“ＡＮＹ”によってスロットを埋める。従って、スロット状態はＹＸとなり、状態ＹＸに遷移することとなる。 When “anything” is input in the state WX, Sec1_XW. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Since the current slot state is WX, the value of the undetermined slot W being sequentially confirmed is deleted, and the slot is filled with the input value “ANY”. Therefore, the slot state becomes YX, and the state changes to YX.

状態ＷＸで「はい」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＹＥＳ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して受諾意志の入力があると判断し、［ＹＥＳ］の入力がなされたと見なす。現状のスロット状態はＷＸであるため、逐次確認中の未確定スロットＷが確定スロットＺになり、スロット状態はＺＸとなる。ここで、二つの入力可能スロット数に対して、それ未満のスロットが確定した状態となる、そこで、状態ＹＸにおいて「はい」と入力された場合と同様に、この入力だけでよいか確認するシーケンスに遷移する。これは、ＺＸに遷移して良いか確認する（ｃｏｎｆｉｒｍａｔｉｏｎ）ということから、状態ＺＸＣとする。従って、状態ＷＸで、「はい」と入力された場合、状態ＺＸＣに遷移することとなる。 When “Yes” is input in the state WX, Sec1_XW. In accordance with bnf, the value {“YES”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an acceptance intention input for the confirmation guidance, and considers that [YES] has been input. Since the current slot state is WX, the unconfirmed slot W being sequentially confirmed becomes the confirmed slot Z, and the slot state becomes ZX. Here, the number of slots that are less than the two slots that can be input is determined, and therefore, a sequence for confirming whether or not only this input is necessary, as in the case of inputting “Yes” in the state YX. Transition to. This is a state ZXC because it is confirmed whether or not the transition to ZX is allowed (confirmation). Therefore, when “Yes” is input in the state WX, the state transitions to the state ZXC.

状態ＷＸで「いいえ」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して拒否意志の入力があると判断し、［ＮＯ］の入力がなされたと見なす。現状のスロット状態はＷＸであるため、逐次確認中の未確定スロットＷが空スロットＸになり、状態ＸＸに遷移することとなる。 When “No” is input in the state WX, Sec1_XW. In accordance with bnf, the value {“NO”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a rejection intention input for the confirmation guidance, and considers that [NO] has been input. Since the current slot state is WX, the unconfirmed slot W that is being sequentially confirmed becomes the empty slot X, and transitions to the state XX.

状態ＺＷ
状態ＺＷでは、一つのスロットについて逐次確認を行う状態となる。音声入出力受付部１は「次の入力は洋食でよろしいですか」といったガイダンスを流し、Ｓｅｃ１＿ＺＷ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。図２に示すシナリオでは、状態ＺＷにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＳＬａＮＯＴＳＬａ｜ＮＯＴＳＬａ｜ＡＮＹ｜ＹＥＳ｜ＮＯ）が受理可能となる。 State ZW
In the state ZW, it becomes a state in which confirmation is sequentially performed for one slot. The voice input / output receiving unit 1 circulates guidance such as “Are you sure you want Western food for the next input?”, And Sec1_ZW. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario shown in FIG. 2, the input that can be accepted in the state ZW is an input in which 1 is entered in the column. In this case, (SLa | SLa NOT SLa | NOT SLa | ANY | YES | NO) is acceptable. .

状態ＺＷで「中華」と入力された場合、Ｓｅｃ１＿ＺＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＺＷであるため、確認中の未確定スロットＷの値“洋食”を入力値“中華”に変更する。従って、状態ＺＷに遷移することとなる。 When “Chinese” is input in the state ZW, Sec1_ZW. In accordance with bnf, the value {“FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is ZW, the value “Western food” of the unconfirmed slot W being confirmed is changed to the input value “Chinese”. Therefore, the state changes to the state ZW.

状態ＺＷで「洋食ではなくて中華」と入力された場合、Ｓｅｃ１＿ＺＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”，“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して修正入力があると判断し、［ＳＬａＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＺＷであるため、逐次確認中未確定スロットＷの値と入力“洋食”を比較し、それを“中華”に変更する。従って、状態ＺＷに遷移することとなる。 When “Chinese instead of Western food” is input in the state ZW, Sec1_ZW. According to bnf, the value {“FOOD_Western food”, “NOT”, “FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [SLa NOT SLa] has been input. Since the current slot state is ZW, the value of the unconfirmed slot W during the sequential confirmation is compared with the input “Western food” and is changed to “Chinese”. Therefore, the state changes to the state ZW.

状態ＺＷで「そうではなくて中華」と入力された場合、Ｓｅｃ１＿ＺＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯＴ”，“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は修正入力があると判断し、［ＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＺＷであるため、逐次確認中未確定スロットＷの値を抹消し、入力値“中華”によってスロットＷを埋める。従って状態ＺＷに遷移することとなる。 When “Chinese but not so” is input in the state ZW, Sec1_ZW. In accordance with bnf, the values {“NOT”, “FOOD_Chinese”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input and considers that [NOT SLa] has been input. Since the current slot state is ZW, the value of the unconfirmed slot W during the sequential confirmation is deleted, and the slot W is filled with the input value “Chinese”. Therefore, the state is changed to ZW.

状態ＺＷで「なんでもよい」と入力された場合、Ｓｅｃ１＿ＸＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。現状のスロット状態はＺＷであるため、逐次確認中未確定スロットＷの値を抹消し、入力値“ＡＮＹ”によってスロットＷを埋める。従って状態ＺＷに遷移することとなる。 When “anything” is input in the state ZW, Sec1_XW. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Since the current slot state is ZW, the value of the undetermined slot W during the sequential confirmation is deleted, and the slot W is filled with the input value “ANY”. Therefore, the state is changed to ZW.

状態ＺＷで「はい」と入力された場合、Ｓｅｃ１＿ＺＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＹＥＳ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して受諾意志の入力があると判断し、［ＹＥＳ］の入力がなされたと見なす。現状のスロット状態はＺＷであるため、逐次確認中の未確定スロットＷが確定スロットＺになり、スロット状態はＺＺとなり、状態としては状態ＺＺに相当する。ここで、スロット数＝２と同数のスロットが確定スロットとなったため、スロットを埋めるという、本対話シーケンスの目的は達成された。そこで、Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔに記述された処理を実行することとなる。 When “Yes” is input in the state ZW, Sec1_ZW. In accordance with bnf, the value {“YES”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an acceptance intention input for the confirmation guidance, and considers that [YES] has been input. Since the current slot state is ZW, the unconfirmed slot W being sequentially confirmed becomes the confirmed slot Z, the slot state becomes ZZ, and the state corresponds to the state ZZ. Here, since the number of slots equal to the number of slots = 2 has become a definite slot, the purpose of this interactive sequence of filling the slot is achieved. Therefore, Sec1_ZZ. The process described in the script is executed.

状態ＺＷで「いいえ」と入力された場合、Ｓｅｃ１＿ＺＷ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して拒否意志の入力があると判断し、［ＮＯ］の入力がなされたと見なす。現状のスロット状態はＺＷであるため、逐次確認中の未確定スロットＷが空スロットＸになる。従って、スロット状態はＺＸとなり二つの入力可能スロット数に対して、それ未満のスロットが確定した状態となる。そこで、状態ＹＸにおいて「はい」と入力された場合と同様に、この入力だけでよいか確認するシーケンスに遷移する。これは、ＺＸに遷移して良いか確認する（ｃｏｎｆｉｒｍａｔｉｏｎ）ということから、状態ＺＸＣとする。従って、状態ＸＷで、「はい」と入力された場合、状態ＺＸＣに遷移することになる。 When “No” is input in the state ZW, Sec1_ZW. In accordance with bnf, the value {“NO”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a rejection intention input for the confirmation guidance, and considers that [NO] has been input. Since the current slot state is ZW, the unconfirmed slot W that is being sequentially confirmed becomes the empty slot X. Therefore, the slot state is ZX, and the number of slots less than that is determined for two input possible slot numbers. Therefore, as in the case where “Yes” is input in the state YX, the process proceeds to a sequence for confirming whether only this input is necessary. This is a state ZXC because it is confirmed whether or not the transition to ZX is allowed. Therefore, when “Yes” is input in the state XW, the state transitions to the state ZXC.

状態ＺＸＣ
状態ＺＸＣでは、入力可能スロット数未満のスロットが確定された場合に、確定された情報だけで本対話シーケンスを完了するかどうか確認を行う状態となる。音声入出力受付部１は「入力は和食だけでよろしいですか」といったガイダンスを流し、Ｓｅｃ１＿ＺＸＣ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。図２に示すシナリオでは、状態ＺＸＣにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＹＥＳ｜ＮＯ）が受理可能となる。 State ZXC
In the state ZXC, when slots less than the number of slots that can be input are confirmed, it is confirmed whether or not the present dialogue sequence is completed only by the confirmed information. The voice input / output receiving unit 1 circulates guidance such as “Is it OK to input only Japanese food?”, And Sec1_ZXC. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario shown in FIG. 2, the input that can be accepted in the state ZXC is an input in which 1 is entered in the column, and in this case, (YES | NO) is acceptable.

状態ＺＸＣで「はい」と入力された場合、Ｓｅｃ１＿ＺＸＣ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＹＥＳ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して受諾意志の入力があると判断し、［ＹＥＳ］の入力がなされたと見なす。現状のスロット状態はＺＸＣであるため、本対話シーケンスの目的は達成されたと見なし、Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔに記述された処理を実行することとなる。 When “Yes” is input in the state ZXC, Sec1_ZXC. In accordance with bnf, the value {“YES”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an acceptance intention input for the confirmation guidance, and considers that [YES] has been input. Since the current slot state is ZXC, it is considered that the purpose of this dialogue sequence has been achieved, and Sec1_ZZ. The process described in the script is executed.

状態ＺＸＣで「いいえ」と入力された場合、Ｓｅｃ１＿ＺＸＣ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して拒否意志の入力があると判断し、［ＮＯ］の入力がなされたと見なす。現状のスロット状態はＺＸＣであるため、未確定の空スロットＸを確定するシーケンスに移行する。すなわち、状態ＺＸＣで「いいえ」が入力された場合、状態ＺＸに遷移することになる。 When “NO” is input in the state ZXC, Sec1_ZXC. In accordance with bnf, the value {“NO”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a rejection intention input for the confirmation guidance, and considers that [NO] has been input. Since the current slot state is ZXC, the process shifts to a sequence for determining an undetermined empty slot X. That is, when “No” is input in the state ZXC, the state ZX is transitioned to.

状態ＺＸ
状態ＺＸでは、入力可能スロット数未満のスロットが確定された場合に、残りの空スロットに情報を入力する状態となる。音声入出力受付部１は「もう一つのお店の種類は」といったガイダンスを流し、Ｓｅｃ１＿ＺＸ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。図２に示すシナリオでは、状態ＺＸにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＡＮＹ）が受理可能となる。 State ZX
In the state ZX, when slots less than the number of slots that can be input are determined, information is input to the remaining empty slots. The voice input / output receiving unit 1 sends a guidance such as “What is the other shop type?”, And Sec1_ZX. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario shown in FIG. 2, the input that can be accepted in the state ZX is an input in which 1 is entered in the column. In this case, (SLa | ANY) is acceptable.

状態ＺＸで「中華」と入力された場合、Ｓｅｃ１＿ＺＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿中華”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＺＸであるため、空スロットＸに入力値“中華”を格納する。従って、状態ＺＹに遷移することとなる。 When “Chinese” is input in the state ZX, Sec1_ZX. In accordance with bnf, the value {“FOOD_Chinese”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is ZX, the input value “Chinese” is stored in the empty slot X. Therefore, the state transitions to the state ZY.

状態ＺＸで「なんでもよい」と入力された場合、Ｓｅｃ１＿ＺＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。現状のスロット状態はＺＸであるため、空スロットＸの値を抹消し、入力値“ＡＮＹ”によってスロットを埋める。従って、状態ＺＹに遷移することとなる。 When “anything” is input in the state ZX, Sec1_ZX. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Since the current slot state is ZX, the value of the empty slot X is deleted, and the slot is filled with the input value “ANY”. Therefore, the state transitions to the state ZY.

状態ＺＹ
状態ＺＹでは、入力可能スロット数未満のスロットが確定された場合に、残りの空スロットに入力された情報を確認する状態となる。音声入出力受付部１は「中華でよろしいですか」といったガイダンスを流し、Ｓｅｃ１＿ＺＹ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。図２に示すシナリオでは、状態ＺＹにおいて受理可能な入力は欄に１が記入された入力であり、この場合は（ＳＬａ｜ＳＬａＮＯＴＳＬａ｜ＮＯＴＳＬａ｜ＡＮＹ｜ＹＥＳ｜ＮＯ）が受理可能となる。 State ZY
In the state ZY, when slots less than the number of slots that can be input are determined, the information input to the remaining empty slots is checked. The voice input / output reception unit 1 plays a guidance such as “Are you sure you want to use Chinese?”, And Sec1_ZY. bnf is set in the voice recognition unit 5 to wait for user input. In the scenario shown in FIG. 2, the input that can be accepted in the state ZY is an input in which 1 is entered in the column. In this case, (SLa | SLa NOT SLa | NOT SLa | ANY | YES | NO) is acceptable. .

状態ＺＹで「洋食」と入力された場合、Ｓｅｃ１＿ＺＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたと見なす。現状のスロット状態はＺＹであるため、未確定スロットＹの値を入力値“洋食”に変更する。従って、同じ状態ＺＹに遷移することとなる。 When “Western food” is input in the state ZY, Sec1_ZY. In accordance with bnf, the value {“FOOD_Western food”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [SLa] has been input. Since the current slot state is ZY, the value of the undetermined slot Y is changed to the input value “Western food”. Therefore, the state transitions to the same state ZY.

状態ＺＹで「中華ではなくて洋食」と入力された場合、Ｓｅｃ１＿ＺＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＦＯＯＤ＿中華”，“ＮＯＴ”，“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は一つのスロットに対して修正入力があると判断し、［ＳＬａＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＺＹであるため、修正対象“中華”に該当する未確定スロットＹの値を入力値“洋食”に変更する。従って、同じ状態ＺＹに遷移することとなる。 When “Western food instead of Chinese” is input in the state ZY, Sec1_ZY. According to bnf, the value {“FOOD_Chinese”, “NOT”, “FOOD_Western food”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [SLa NOT SLa] has been input. Since the current slot state is ZY, the value of the unconfirmed slot Y corresponding to the correction object “Chinese Chinese” is changed to the input value “Western food”. Therefore, the state transitions to the same state ZY.

状態ＺＹで「そうではなくて洋食」と入力された場合、Ｓｅｃ１＿ＺＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯＴ”，“ＦＯＯＤ＿洋食”｝という値が返る。シナリオ解析部６は一つのスロットに対して修正入力があると判断し、［ＮＯＴＳＬａ］の入力がなされたと見なす。現状のスロット状態はＺＹであるため、未確定スロットＹの値を入力値“洋食”に変更する。従って、同じ状態ＺＹに遷移することとなる。 When “Western food is not so” is input in the state ZY, Sec1_ZY. In accordance with bnf, the values {“NOT”, “FOOD_Western food”} are returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a correction input for one slot, and considers that [NOT SLa] has been input. Since the current slot state is ZY, the value of the undetermined slot Y is changed to the input value “Western food”. Therefore, the state transitions to the same state ZY.

状態ＺＹで「なんでもよい」と入力された場合、Ｓｅｃ１＿ＺＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＡＮＹ”｝という値が返る。シナリオ解析部６は一つのスロットに対して入力があると判断し、［ＡＮＹ］の入力がなされたと見なす。現状のスロット状態はＺＹであるため、未確定スロットＹの値を抹消し、入力値“ＡＮＹ”によってスロットを埋める。従って、同じ状態ＺＹに遷移することとなる。 When “anything” is input in the state ZY, Sec1_ZY. In accordance with bnf, the value {“ANY”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot, and considers that [ANY] has been input. Since the current slot state is ZY, the value of the indeterminate slot Y is deleted, and the slot is filled with the input value “ANY”. Therefore, the state transitions to the same state ZY.

状態ＺＹで「はい」と入力された場合、Ｓｅｃ１＿ＺＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＹＥＳ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して受諾意志の入力があると判断し、［ＹＥＳ］の入力がなされたと見なす。現状のスロット状態はＺＹであるため、スロット状態はＺＺとなり本対話シーケンスの目的は達成されたと見なし、Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔに記述された処理を実行することとなる。但し、受諾されたＹが以下の場合には異なる処理が行われる。
・Ｙ＝Ｚの場合
スロット内容がマージされるため、スロット状態はＺＸとなる。従って、入力可能スロット数未満のスロットが確定した状態となり、状態ＺＸＣに遷移することとなる。
・Ｙ＝“ＡＮＹ”の場合
確定済みのＺと内容が競合するが、確定済みの内容を優先するとし、Ｚ＝“和食”であった場合、スロット状態をＺＺ＝｛“和食”，''''｝とし、Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔを実行する。 When “Yes” is input in the state ZY, Sec1_ZY. In accordance with bnf, the value {“YES”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an acceptance intention input for the confirmation guidance, and considers that [YES] has been input. Since the current slot state is ZY, the slot state becomes ZZ, and it is considered that the purpose of this dialogue sequence has been achieved, and Sec1_ZZ. The process described in the script is executed. However, different processing is performed when the accepted Y is as follows.
When Y = Z, the slot contents are merged, so the slot state is ZX. Therefore, a slot less than the number of slots that can be input is determined, and the state transits to the state ZXC.
・ When Y = “ANY”, the content of the confirmed Z conflicts with the content of the confirmed Z. However, when the confirmed content is prioritized, and Z = “Japanese food”, the slot state is changed to ZZ = {“Japanese food”, ” ''} And Sec1_ZZ. Run script.

状態ＺＹで「いいえ」と入力された場合、Ｓｅｃ１＿ＺＹ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＮＯ”｝という値が返る。シナリオ解析部６は確認ガイダンスに対して拒否意志の入力があると判断し、［ＮＯ］の入力がなされたと見なす。現状のスロット状態はＺＹであるため、未確定スロットＹが空スロットＸとなる。従って、入力可能スロット数未満のスロットが確定した状態となり、状態ＺＸＣに遷移することとなる。 If “NO” is input in the state ZY, Sec1_ZY. In accordance with bnf, the value {“NO”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is a rejection intention input for the confirmation guidance, and considers that [NO] has been input. Since the current slot state is ZY, the undetermined slot Y becomes the empty slot X. Therefore, a slot less than the number of slots that can be input is determined, and the state transits to the state ZXC.

Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔについて
Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔは、対話シーケンスＳｅｃ１が完了した場合に実行されるスクリプトである。ここには、検索処理実行方法や、検索後に起動する対話シーケンス名が記載される。Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔの記述例を図４に示す。例えば、対話シーケンス（Ｓｅｃ１）にて、“和食”と“洋食”が確定した場合、ＤＢアクセスとして、“ｓｅｌｅｃｔ ^* ｆｒｏｍｔａｂｌｅｗｈｅｒｅＳＬａ＝“和食”｜ＳＬａ＝“洋食””というＳＱＬが呼ばれ、その返値の配列＠Ｒｅｓｕｌｔが次の対話シーケンス（Ｓｅｃ２）に渡され、Ｓｅｃ２が起動される。 Sec1_ZZ. About Sec1_ZZ. The script is a script that is executed when the dialogue sequence Sec1 is completed. Here, a search processing execution method and a dialog sequence name to be started after the search are described. Sec1_ZZ. An example of script description is shown in FIG. For example, when “Japanese food” and “Western food” are confirmed in the dialogue sequence (Sec1), SQL of “select ^* from table where SLa =“ Japanese food ”| SLa =“ Western food ”” is called as DB access, The returned array @Result is passed to the next interactive sequence (Sec2), and Sec2 is activated.

［第２の実施形態］
切断・再開処理の実施形態について説明する。 [Second Embodiment]
An embodiment of the disconnection / resumption process will be described.

本発明の対話シーケンスはスロット状態にのみ依存するため、スロットの状態を指定することで中断復帰の動作は容易に実施可能となる。 Since the dialogue sequence of the present invention depends only on the slot state, the interruption / return operation can be easily performed by designating the slot state.

例えば、状態ＹＹにおいて、｛“和食”，“洋食”｝が入力され、その確認ガイダンス中（「和食と洋食でよろしいですか」）の状態で中断された場合、端末装置７は対話シーケンス名Ｓｅｃ１と状態ＹＹおよび｛“和食”，“洋食”｝を記憶する。接続再開時に、端末装置７は音声対話装置に対し、対話シーケンス名Ｓｅｃ１と状態ＹＹおよび｛“和食”，“洋食”｝を送信する。音声対話装置のシナリオ解析部６は対話シーケンスＳｅｃ１を状態ＹＹおよびスロット値｛“和食”，“洋食”｝で呼び出して実行することで、切断前の状態に復帰可能となる。 For example, in the state YY, when {"Japanese food", "Western food"} is input and the confirmation guidance is interrupted ("Are you sure you want Japanese food and Western food?"), The terminal device 7 displays the dialogue sequence name Sec1. And state YY and {“Japanese food”, “Western food”} are stored. When the connection is resumed, the terminal device 7 transmits the dialogue sequence name Sec1 and the state YY and {“Japanese food”, “Western food”} to the voice interactive device. The scenario analysis unit 6 of the voice interaction device can return to the state before disconnection by calling and executing the interaction sequence Sec1 with the state YY and the slot value {“Japanese food”, “Western food”}.

［第３の実施形態］
複数種別のスロットが存在する場合の実施形態を説明する。 [Third Embodiment]
An embodiment in the case where there are multiple types of slots will be described.

例えば、駐車場検索において、現在地からの距離と料金を用いて「８００ｍ以内で５００円／Ｈ以下」といった検索する場合を考える。この場合、距離と料金は別カテゴリの入力となるため、それぞれに対して別の入力ＩＤ、ＳＬａおよびＳＬｂを付与すると共に、カテゴリ毎に利用可能なスロット数を指定する。図６に示すシナリオ例では、ＳＬａ｛ＤＩＳＴＡＮＣＥ｝＝１，ＳＬｂ｛ＰＲＩＣＥ｝＝１となる。複数カテゴリ、複数スロットであれば、ＳＬａ＝２，ＳＬｂ＝１，ＳＬｃ＝３となり、初期状態はＸＸＸＸＸＸと表記されることとなる。 For example, in the parking lot search, consider a case where a search such as “500 yen / H or less within 800 m” is performed using the distance from the current location and the fee. In this case, since the distance and the charge are input in different categories, different input IDs, SLa and SLb are assigned to the respective categories, and the number of slots available for each category is designated. In the example scenario shown in FIG. 6, SLa {DISTANCE} = 1 and SLb {PRICE} = 1. In the case of a plurality of categories and a plurality of slots, SLa = 2, SLb = 1, and SLc = 3, and the initial state is expressed as XXXXXX.

駐車場検索における状態遷移表例の一部を図６に示す。距離および料金はそれぞれ０以上１未満のスロットに対して入力可能であるため、初期状態はＸＸとなる。 A part of the state transition table example in the parking lot search is shown in FIG. Since the distance and fee can be input to slots of 0 or more and less than 1, respectively, the initial state is XX.

状態ＸＸ
初期状態ＸＸでは、「距離または料金は」というガイダンスを流し、図７に示すＳｅｃ２＿ＸＸ．ｂｎｆを音声認識部５にセットしてユーザの入力を待つ。 State XX
In the initial state XX, the guidance “Distance or fee is” is played, and Sec2_XX. bnf is set in the voice recognition unit 5 to wait for user input.

初期状態ＸＸにおいて「８００ｍ以内」と入力された場合、Ｓｅｃ２＿ＸＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＤＩＳＴＡＮＣＥ＿８００ｍ”｝という値が返る。シナリオ解析部６は距離に関する一つのスロットに対して入力があると判断し、［ＳＬａ］の入力がなされたとみなす。そして、スロットＸの前方よりスロットを埋める。この場合は、距離に関する一番目のスロットＸに“８００ｍ”がセットされ、スロット状態はＹとなる。従って、状態ＹＸに遷移することになる。 When “within 800 m” is input in the initial state XX, Sec2_XX. In accordance with bnf, the value {“DISTANCE_800m”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot related to the distance, and considers that [SLa] has been input. Then, the slot is filled from the front of the slot X. In this case, “800 m” is set in the first slot X related to the distance, and the slot state is Y. Therefore, the state transitions to the state YX.

初期状態ＸＸにおいて、「５００円以下」と入力された場合、Ｓｅｃ２＿ＸＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＰＲＩＣＥ＿５００円”｝という値が返る。シナリオ解析部６は料金に関する一つのスロットに対して入力があると判断し、［ＳＬｂ］の入力がなされたとみなす。そして、料金に関する二番目のスロットＸの前方よりスロットを埋める。この場合は、スロットＸｂに“８００ｍ”がセットされ、スロット状態はＹとなる。従って、状態ＸＹに遷移することになる。 When “500 yen or less” is input in the initial state XX, Sec2_XX. In accordance with bnf, the value {“PRICE_500 yen”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for one slot related to the charge, and considers that [SLb] has been input. Then, the slot is filled from the front of the second slot X related to the fee. In this case, “800 m” is set in the slot Xb, and the slot state is Y. Therefore, the state transitions to the state XY.

初期状態ＸＸにおいて、「８００ｍ以内で５００円以下」と入力された場合、Ｓｅｃ２＿ＸＸ．ｂｎｆに従い、音声認識部５からシナリオ解析部６へは｛“ＤＩＳＴＡＮＣＥ＿８００ｍ”，“ＰＲＩＣＥ＿５００円”｝という値が返る。シナリオ解析部６は距離と料金に関してそれぞれ一つのスロットに対して入力があると判断し、［ＳＬａＳＬｂ］の入力がなされたとみなす。そして、距離に関する一番目のスロットＸの前方より距離に関する入力値“８００ｍ”を埋め、料金に関する二番目のスロットＸの前方より料金に関する入力値“５００円”を埋める。従って、状態ＹＹに遷移することになる。その後の処理ルールは第１の実施形態と同様のルールに従う。 In the initial state XX, when “500 yen or less within 800 m” is input, Sec2_XX. In accordance with bnf, the value {“DISTANCE_800 m”, “PRICE_500 yen”} is returned from the speech recognition unit 5 to the scenario analysis unit 6. The scenario analysis unit 6 determines that there is an input for each slot regarding the distance and the charge, and considers that [SLa SLb] has been input. Then, the input value “800 m” related to the distance is filled from the front of the first slot X related to the distance, and the input value “500 yen” related to the charge is filled from the front of the second slot X related to the charge. Therefore, the state transitions to the state YY. Subsequent processing rules follow the same rules as in the first embodiment.

［第４の実施形態］
受理可能な入力はシナリオ毎に設定されるものであり、図２や図６に示したものの他に、以下のような入力も考えられる。
・［ＳＬａＳＬａＮＯＴＳＬａＳＬａ］
２つのスロットに｛“和食”，“洋食”｝が入っているＹＹ状態において、「和食と洋食ではなくて中華とイタリア料理」という入力がこれに相当する。
・［ＮＯＴＳＬａＳＬａ］
２つのスロットに｛“和食”，“洋食”｝が入っているＹＹ状態において、「そうではなくて中華かイタリア料理」という入力がこれに相当する。 [Fourth Embodiment]
Acceptable inputs are set for each scenario, and in addition to those shown in FIGS. 2 and 6, the following inputs are also conceivable.
・ [SLa SLa NOT SLa SLa]
In the YY state in which {"Japanese food", "Western food"} is in two slots, the input "Chinese and Italian food, not Japanese food and Western food" corresponds to this.
・ [NOT SLa SLa]
In the YY state in which {"Japanese food", "Western food"} is contained in the two slots, the input “Chinese or Italian food instead” corresponds to this.

［第５の実施形態］
図２や図６で示したシナリオ例では、「和食でよろしいですか」や「洋食だけでよろしいですか」といった確認シーケンスが含まれている。しかし、シナリオが使われる状況によっては、これら確認シーケンスをスキップすることが必要となる場合もある。その場合は、図８に示す状態に対してＳｋｉｐ属性を付与する。シナリオ解析部６はＳｋｉｐ属性を持つ状態に対して、スロットＹはＺとして確定させると共に、ＺＸＣのようにＣを持つ状態についてはＺＸに無条件に遷移させる。 [Fifth Embodiment]
The scenario examples shown in FIGS. 2 and 6 include a confirmation sequence such as “Are you sure you want Japanese food?” Or “Are you sure you want only Western food?” However, depending on the situation in which the scenario is used, it may be necessary to skip these confirmation sequences. In that case, a Skip attribute is assigned to the state shown in FIG. The scenario analysis unit 6 determines the slot Y as Z for the state having the Skip attribute, and unconditionally shifts the state having C like ZXC to ZX.

［第６の実施形態］
図２に示したシナリオ例では、入力可能なスロット数より少ないスロット数が確定した状態で、確定スロットだけで検索など次の処理に進むか確認するＺＸＣなどの状態では、「Ｓｌｏｔ［１］だけでＯＫ」というガイダンスにより、［ＹＥＳ］であればシナリオ完了と見なし、［ＮＯ］であれば、不足スロットを入力する状態（ＺＸ）に遷移する。しかし、ガイダンスとして、「もう一つの条件を入力しますか」といったものを設定した場合、［ＹＥＳ］、［ＮＯ］の持つ意味が正反対となる。そこで、［ＹＥＳ］、［ＮＯ］に関してＺＸＣと正反対の遷移先となるＺＸＤを定義する。 [Sixth Embodiment]
In the scenario example shown in FIG. 2, in a state such as ZXC in which the number of slots smaller than the number of slots that can be input is confirmed and whether to proceed to the next processing such as search using only the confirmed slots, only “Slot [1]” According to the guidance “OK”, the scenario is considered to be complete if it is [YES], and transitions to a state (ZX) in which an insufficient slot is input if it is [NO]. However, when a guidance such as “Do you want to input another condition” is set, the meanings of [YES] and [NO] are opposite. Therefore, ZXD, which is the opposite destination of ZXC with respect to [YES] and [NO], is defined.

ＹＸなどで［ＹＥＳ］入力でＹが確定した場合、シナリオ解析部６は最初に状態ＺＸＣを検索し、見つからない場合には状態ＺＸＤを検索する。状態ＺＸＤにおいては、［ＹＥＳ］入力が状態ＺＸへの遷移を意味し、［ＮＯ］入力が状態ＺＺと同様に次の処理に進むことを意味する。 When Y is confirmed by inputting [YES] in YX or the like, the scenario analysis unit 6 first searches for the state ZXC, and if not found, searches for the state ZXD. In the state ZXD, the [YES] input means a transition to the state ZX, and the [NO] input means that the process proceeds to the next process as in the state ZZ.

なお、本発明は専用のハードウェアにより実現されるもの以外に、その機能を実現するためのプログラムを、コンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行するものであってもよい。コンピュータ読み取り可能な記録媒体とは、フレキシブルディスク、光磁気ディスク、ＣＤ−ＲＯＭ等の記録媒体、コンピュータシステムに内蔵されるハードディスク装置等の記憶装置を指す。さらに、コンピュータ読み取り可能な記録媒体は、インターネットを介してプログラムを送信する場合のように、短時間の間、動的にプログラムを保持するもの（伝送媒体もしくは伝送波）、その場合のサーバとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含む。 In addition to what is implemented by dedicated hardware, the present invention records a program for realizing the function on a computer-readable recording medium, and the program recorded on the recording medium is stored in a computer system. It may be read and executed. The computer-readable recording medium refers to a recording medium such as a flexible disk, a magneto-optical disk, and a CD-ROM, and a storage device such as a hard disk device built in a computer system. Furthermore, a computer-readable recording medium is a server that dynamically holds a program (transmission medium or transmission wave) for a short period of time, as in the case of transmitting a program via the Internet, and a server in that case. Some of them hold programs for a certain period of time, such as volatile memory inside computer systems.

本発明の一実施形態の音声対話装置の構成図である。It is a block diagram of the voice interactive apparatus of one Embodiment of this invention. 食事検索の状態遷移表の例を示す図である。It is a figure which shows the example of the state transition table of a meal search. Ｓｅｃ１＿ＸＸ．ｂｎｆの記述例を示す図である。Sec1_XX. It is a figure which shows the example of description of bnf. Ｓｅｃ１＿ＹＸ．ｂｎｆの記述例を示す図である。Sec1_YX. It is a figure which shows the example of description of bnf. Ｓｅｃ１＿ＺＺ．ｓｃｒｉｐｔの記述例を示す図である。Sec1_ZZ. It is a figure which shows the example of description of script. 駐車場検索の状態遷移表の例を示す図である。It is a figure which shows the example of the state transition table of a parking lot search. Ｓｅｃ２＿ＸＸ．ｂｎｆの記述例を示す図である。Sec2_XX. It is a figure which shows the example of description of bnf. スキップを含むシナリオの例を示す図である。It is a figure which shows the example of the scenario containing a skip.

Explanation of symbols

１音声入出力受付部
２データ入出力受付部
３シナリオＤＢ
４認識辞書ＤＢ
５音声認識部
６シナリオ解析部
７端末装置 1 Voice input / output reception part 2 Data input / output reception part 3 Scenario DB
4 recognition dictionary DB
5 Speech recognition unit 6 Scenario analysis unit 7 Terminal device

Claims

A voice dialogue sequence state expression method performed in a voice dialogue device,
Assign slots for each piece of information needed to achieve a given goal in a voice conversation with the user,
Information is stored in each slot according to each voice input from the start of voice dialogue to the achievement of a predetermined purpose,
Information is empty slots is a state that is not stored, but information is stored, the information is unconfirmed slot is in a state that is not determined to confirm the user information is stored,該情 When there is a confirmed slot and a plurality of unconfirmed slots in which the information has been confirmed and confirmed to the user, and the user confirmation result of whether or not the information stored in each unconfirmed slot is correct , is set for each of the undetermined slot, by managing the status of each slot in the four states of the sequential verify the slot is being confirmed information of the undetermined slot, the voice interaction sequence status the expressed in association with the unique and slot state,
For each sequentially confirmed slot set for each unconfirmed slot, if the information of the sequentially confirmed slot is confirmed and confirmed to the user, the status from the sequentially confirmed slot to the confirmed slot When the correction information of the information of the slot being sequentially confirmed is input and the correction information is a plurality of other information, a plurality of slots being sequentially confirmed in which the other information is stored separately , Each state is changed to the unconfirmed slot, and when the correction information is one piece of other information, the other information is stored in the sequential confirmation slot, and the state waiting for confirmation by the user is maintained.
Spoken dialogue sequence state notation method.

The method according to claim 1, wherein the slot state further includes a control state code for confirming to the user whether or not to proceed to the next sequence with only the confirmed slots when slots less than the number of slots that can be input are confirmed. .

The state of the voice interaction sequence is represented by a plurality of states defined by combinations of two or more slot states among the four slot states of the empty slot, the confirmed slot, the unconfirmed slot, and the sequentially confirmed slot. For each of the states, according to the state transition table in which acceptable or unacceptable is set for a plurality of defined inputs that respectively define a plurality of voice input patterns that can be input by the user in the voice interaction sequence. The method according to claim 1 or 2, wherein the state of the voice interaction sequence is expressed in association with the slot state .

The method according to claim 3, wherein the predefined input is a single input, an OR input, an AND input, a partial designation correction input, a correction input, an ANY input, a YES input, and a NO input for the same type of slot.

The method according to claim 3, wherein the predefined input is a single input, an OR input, an AND input, a partial designation correction input, a correction input, an ANY input, a YES input, and a NO input for different types of slots.

The method according to any one of claims 1 to 5, wherein a transition destination of a voice interaction sequence is uniquely determined according to a slot state changed with respect to an acceptable input.

When processing according to the voice interaction sequence is interrupted, a slot state indicating the state of the voice interaction sequence at the time of the interruption is held, and when the processing is resumed, the state at the time of interruption is referred to the held slot state The method according to claim 1, wherein the method returns to

A voice interaction device,
A voice input / output receiving unit that outputs voice guidance to the user and receives voice input in response to the voice guidance from the user;
A data input / output receiving unit for outputting the input dialog sequence name and slot state to the user and receiving the dialog sequence name and slot state input by the user when the voice dialog sequence is resumed;
A scenario DB that stores voice conversation scenarios used for controlling the voice conversation for each voice conversation sequence;
A recognition dictionary DB that stores speech recognition grammar;
A speech recognition unit that performs speech recognition processing on the input speech using the designated speech recognition grammar stored in the recognition dictionary DB and outputs the speech recognition result;
Interaction sequence name and slot status output to the data input receiving unit, the input to the voice output unit in response to the voice guidance, the voice recognition unit on the speech input of the user, and instructs the voice The speech recognition process is performed with the recognition grammar, and the state of the speech dialogue scenario stored in the scenario DB is changed according to the speech recognition result, and is output from the data input / output receiving unit when the speech dialogue is resumed. resuming the scenario analyzer voice conversation by interaction sequence name and slot state, has,
The scenario analysis unit
Slots are assigned to each piece of information necessary to achieve a predetermined purpose in a voice dialogue with the user, and information is stored in each slot according to each voice input from the start of the voice dialogue to the achievement of the predetermined goal. And
Information is empty slots is a state that is not stored, but information is stored, the information is unconfirmed slot is in a state that is not determined to confirm the user information is stored,該情 When there is a confirmed slot and a plurality of unconfirmed slots in which the information has been confirmed and confirmed to the user, and the user confirmation result of whether or not the information stored in each unconfirmed slot is correct By managing the state of each of the slots in the four states of the sequentially confirming slot, which is set for each of the unconfirmed slots and confirming the information of the unconfirmed slot , Express the state uniquely associated with the slot state ,
For each sequentially confirmed slot set for each unconfirmed slot, if the information of the sequentially confirmed slot is confirmed and confirmed to the user, the status from the sequentially confirmed slot to the confirmed slot When the correction information of the information of the slot being sequentially confirmed is input and the correction information is a plurality of other information, a plurality of slots being sequentially confirmed in which the other information is stored separately , Each state is changed to the unconfirmed slot, and when the correction information is one piece of other information, the other information is stored in the sequential confirmation slot, and the state waiting for confirmation by the user is maintained. Spoken dialogue device.

A program for executing the spoken dialogue sequence state notation method according to any one of claims 1 to 7 on a computer.