JP2601302B2

JP2601302B2 - Pitch frequency generator in speech synthesizer

Info

Publication number: JP2601302B2
Application number: JP63052654A
Authority: JP
Inventors: 隆之大山
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1988-03-08
Filing date: 1988-03-08
Publication date: 1997-04-16
Anticipated expiration: 2012-04-16
Also published as: JPH01227199A

Description

【発明の詳細な説明】〔概要〕音声合成装置におけるピッチ周波数生成装置に係り、
特に文章に関する情報を入力して該文章を読み上げる音
声情報に変換して出力する音声合成装置の出力音声の時
間的に変化する音声のピッチ周波数を生成するに際し
て、文章の特徴に応じて時系列的に入力され、ピーク値
から単調減少する特性を有する複数のピッチパターンを
各時刻において重ぬ合わて合成する音声合成装置におけ
るピッチ周波数生成装置に関し、ピッチパターンの生成をできるだけ短時間で行なうこ
とができるようにすることを目的とし、文章に関する情報を入力して該文章を読み上げる音声
情報に変換して出力する音声合成装置の出力音声の時間
的に変化する音声のピッチ周波数を生成するに際して、
文章の特徴に応じて時系列的に入力され、ピーク値から
単調減少する特性を有する複数のピッチパターンを各時
刻において重ね合わて合成重ね合わせ手段を有する音声
合成装置におけるピッチ周波数生成装置において、上記
ピッチパターンが単調減少傾向であるとき減少傾向信号
を発生する傾向検出手段と、該ピッチパターンが所定の
閾値以下であるとき閾値信号を発生する閾値検出手段
と、減少傾向信号と閾値信号とを受けたとき、該ピッチ
パターンの生成を停止する重ね合せ停止手段とを有する
ように構成する。DETAILED DESCRIPTION OF THE INVENTION [Summary] The present invention relates to a pitch frequency generator in a speech synthesizer,
In particular, when generating a pitch frequency of a time-varying voice of an output voice of a voice synthesizer that inputs information about a text, converts the text into voice information to be read, and outputs the voice information, a time-series The present invention relates to a pitch frequency generator in a speech synthesizer that synthesizes a plurality of pitch patterns having characteristics that monotonically decrease from a peak value at each time, and can generate a pitch pattern in as short a time as possible. In order to generate a pitch frequency of a time-varying voice of a voice output from a voice synthesizer that inputs information about a text, converts the text into voice information to be read, and outputs the voice,
A pitch frequency generation device in a speech synthesis device having a plurality of pitch patterns that are input in a time series according to the characteristics of a sentence and have a characteristic of monotonically decreasing from a peak value at each time, and that are synthesized. A tendency detecting means for generating a decreasing tendency signal when the pattern is monotonically decreasing, a threshold detecting means for producing a threshold signal when the pitch pattern is equal to or less than a predetermined threshold, and receiving the decreasing tendency signal and the threshold signal. At this time, the apparatus is configured to have an overlay stopping means for stopping the generation of the pitch pattern.

[Industrial applications]

本発明は、音声合成装置におけるピッチ周波数生成装
置に係り、特に文章に関する情報を入力して該文章を読
み上げる音声情報に変換し、出力する音声合成装置の出
力音声の時間的に変化する音声のピッチ周波数を生成す
るに際して、文章の特徴に応じて時系列的に入力され、
ピーク値から単調減少する特性を有する複数のピッチパ
ターンを各時刻において重ね合わせて合成する重ね合わ
せ手段を有する音声合成装置におけるピッチ周波数生成
装置に関する。The present invention relates to a pitch frequency generation device in a speech synthesizer, and in particular, to input information related to a sentence, convert the sentence into speech information to be read out, and output the time-varying pitch of the output speech of the speech synthesis device. When generating frequencies, they are input in chronological order according to the features of the text,
The present invention relates to a pitch frequency generation device in a voice synthesis device having a superposition unit that superposes and synthesizes a plurality of pitch patterns having a characteristic that monotonously decreases from a peak value at each time.

[Conventional technology]

一般に文章に関する情報を入力して、この情報を該文
章を読み上げる音声情報に変換して出力する音声合成装
置としてつぎのようなものがある。これは、印刷物、例
えば新聞の校正装置で、原稿に基づいて、一旦紙面構成
装置に入力された文章が、原稿と一致しているかを校正
するものである。即ち、一旦紙面構成装置に入力された
漢字仮名混じり文をこの音声合成装置で読み上げ、この
読み上げられた文章を聞き、原稿と照合して入力された
文章に誤りがないかを校正するものである。In general, there are the following speech synthesizers for inputting information about a sentence, converting the information into speech information for reading out the sentence, and outputting the speech information. This is a device for proofreading a printed matter, for example, a newspaper, and proofreading whether a sentence once input to the paper-sheet forming device matches the document based on the document. That is, a sentence mixed with kanji and kana, which has been input to the paper composition device once, is read out by the voice synthesizing device. .

このような音声合成装置として第７図に示すようなも
のがある。同図において11は漢字仮名混じり文である文
字列が入力されると辞書12を参照して文章の解析及び単
語を同定して文章を音韻部と韻律部に分解する文章解析
部、13は分解された韻律を解析して韻律を指定するピッ
チ指令を発生する韻律解析部、14はこのピッチ指令に基
づきピッチパターンを生成すピッチ信号生成装置、15は
上記の音韻部に基づいて実際の発音を指定する発音記号
を生成する発音記号生成部、16はこの発音記号から音片
を格納した音片メモリ17から音片を取り出して結合して
実際の音韻信号を発生する音片結合部、18は上記のピッ
チパターンと音韻信号を合成してスピーカ19から実際の
合成音声を出力する音声合成部を示している。FIG. 7 shows an example of such a speech synthesizer. In the figure, reference numeral 11 denotes a sentence analysis unit that analyzes a sentence and identifies words by referring to a dictionary 12 when a character string that is a mixture of kanji and kana is input, and decomposes the sentence into a phonological part and a prosodic part. A prosody analyzer that analyzes the prosody and generates a pitch command that specifies the prosody, a pitch signal generator that generates a pitch pattern based on the pitch command, and an actual pronunciation based on the above phoneme. A phonetic symbol generation unit for generating a specified phonetic symbol, a speech unit combining unit for taking out a voice unit from a voice unit memory 17 storing a voice unit from the phonetic symbols and combining them to generate an actual phoneme signal, 18 A voice synthesis unit that synthesizes the pitch pattern and the phoneme signal and outputs an actual synthesized voice from the speaker 19 is shown.

そしてこの例における音韻解析部において解析された
韻律（ピッチ周波数）は所謂藤崎モデル（Ｆモデル）を
用いて表示され、また合成される。この藤崎モデルは、
ピッチ周波数を文章のイントネーションを表示するフレ
ーズ成分と単語のアクセントを表示するアクセント成分
とに分けて夫々記述して、これらの成分をあわせるもの
であり、以下の式によって表示される。ここで時刻ｔに
おけるピッチ周波数をF₀（ｔ）は、と表示される。但し G_pi＝α_i ²t exp（−α_it） …ｔ≧０のとき G_pi＝０ …ｔ＜０のとき G_pj＝_Min［１−（１＋β_jt）exp（−β_jt），θ］ …ｔ
≧０のとき G_pj＝０ …ｔ＜０のときここでＩはフレーズ指令の数Ｊはアクセント指令の数 A_piはｉ番目のフレーム指令の大きさ A_ajはｉ番目のアクセント指令の大きさ T_oiはｉ番目のフレーズ指令の開始時点 T_1jはｊ番目のアクセント指令の開始時点 T_2jはｊ番目のアクセント指令の終了時点 α_ｉはｉ番目のフレーズ指令に対するフレーズ制御機構
の固有角周波数 β_ｊはｊ番目のアクセント指令に対するアクセント制御
機構の固有角周波数 θはアクセント制御機構のステップ応答関数の上限位である。Then, the prosody (pitch frequency) analyzed by the phoneme analysis unit in this example is displayed using a so-called Fujisaki model (F model) and synthesized. This Fujisaki model is
The pitch frequency is described separately for a phrase component for displaying the intonation of a sentence and an accent component for displaying the accent of a word, and these components are combined, and are represented by the following formula. Here, the pitch frequency at time t, F ₀ (t), is Is displayed. Here, G _pi = α _i ² t exp (−α _i t)... When t ≧ 0, G _pi = 0... When t <0, G _pj = _Min [1- (1 + β _j t) exp (−β _j t) , Θ] ... t
When ≧ 0, G _pj = 0... When t <0, where I is the number of phrase commands J is the number of accent commands A _pi is the size of the i-th frame command A _aj is the size of the i-th accent command T _oi is the start time of the i-th phrase command T _1j is the start time of the j-th accent command T _2j is the end time of the j-th accent command α _i is the natural angular frequency β of the phrase control mechanism for the i-th phrase command _j is the natural angular frequency of the accent control mechanism for the j-th accent command. θ is the upper limit of the step response function of the accent control mechanism.

この藤崎モデルのピッチ周波数は、第４図に示すよう
に、フレーズパターンとアクセントパターンとを時系列
的に複数重ね合わせたものである。ここで一つのフレー
ズパターンを見ると第５図に示すように、パルス状のフ
レーズ指令に対して一旦立ち上がり、最大値からなだら
かに単調減少する波形のフレーズパターンを有してい
る。また、アクセントパターンを見ると第６図に示すよ
うに、矩形波状のアクセント指令に対して時間的後にず
れ、且つなだらかに上昇し、最大値からなだらかに単調
減少する波形のパターンを有するアクセントパターンを
形成するものである。そして、第４図に示すように、複
数のフレーズ指令及びアクセント指令（この例において
はフレーズ指令は３、アクセント指令は４である）で特
定される複数のフレーズパターン及びアクセントパター
ンを重ね合わせることにより実際のピッチパターンを生
成するようにしている。As shown in FIG. 4, the pitch frequency of the Fujisaki model is obtained by superimposing a plurality of phrase patterns and accent patterns in time series. Looking at one phrase pattern, as shown in FIG. 5, it has a waveform phrase pattern that rises once in response to a pulse-like phrase command and monotonically decreases from the maximum value. Looking at the accent pattern, as shown in FIG. 6, an accent pattern having a waveform pattern shifted in time with respect to a rectangular wave-like accent command, gradually rising, and gradually decreasing monotonously from the maximum value is obtained. To form. Then, as shown in FIG. 4, a plurality of phrase patterns and accent patterns specified by a plurality of phrase commands and accent commands (in this example, the phrase command is 3 and the accent command is 4) are overlapped. An actual pitch pattern is generated.

[Problems to be solved by the invention]

ところで、実際のピッチ信号生成装置は入力されたピ
ッチ指令に従って、上述した式に各定数を当てはめ、各
時刻におけるピッチ周波数の値を計算するものであるか
ら、時間が経過するに従って計算する項数が増加し（例
えば第４図中ｔ＝Ｔにおいては２つのフレーズパターン
と４つのアクセントパターンの全てについて計算を終了
しなければピッチパターンの結果を算出することはでき
ない）、この計算に時間がかかり、全体としてピッチ周
波数の生成に時間がかかるという問題がある。By the way, the actual pitch signal generation device calculates the value of the pitch frequency at each time by applying each constant to the above equation according to the input pitch command. (For example, at t = T in FIG. 4, it is not possible to calculate the result of the pitch pattern unless the calculation is completed for all of the two phrase patterns and the four accent patterns), and this calculation takes time. There is a problem that it takes time to generate the pitch frequency as a whole.

そこで本発明は、ピッチパターンの生成をできるだけ
短時間で行なうことができる音声合成装置におけるピッ
チパターンの生成装置を提供することを目的とする。Therefore, an object of the present invention is to provide a pitch pattern generation device in a speech synthesis device that can generate a pitch pattern in as short a time as possible.

[Means for solving the problem]

本発明にあって、上記の課題を解決するための手段
は、文章に関する情報を入力して該文章を読み上げる音
声情報に変換して出力する音声合成装置の出力音声の時
間的に変化する音声のピッチ周波数を生成するに際し
て、文章の特徴に応じて時系列的に入力され、ピーク値
から単調減少する特性を有する複数のピッチパターンを
各時刻において重ね合わせて合成する重ね合わせ手段を
有するピッチ周波数生成装置において、上記ピッチパタ
ーンにおけるピッチ周波数が単調減少傾向であるとき減
少傾向信号を発生する傾向検出手段と、該ピッチパター
ンにおけるピッチ周波数が所定の閾値以下であるとき閾
値信号を発生する閾値検出手段と、減少傾向信号と閾値
信号とを受けたとき、ピッチ周波数が単調減少傾向であ
り、かつピッチ周波数が所定の閾値以下であるピッチパ
ターンの生成を停止する重ね合わせ停止手段とを有する
ようにしたことである。In the present invention, a means for solving the above-mentioned problem is a method of inputting information relating to a sentence, converting the sentence into speech information to be read out, and outputting the speech information. When generating a pitch frequency, the pitch frequency generating unit includes a superimposing unit that superimposes and synthesizes a plurality of pitch patterns having a characteristic of monotonically decreasing from a peak value at each time in a time-series manner according to the characteristics of a sentence. In the apparatus, a tendency detecting means for generating a decreasing tendency signal when the pitch frequency in the pitch pattern is monotonically decreasing, and a threshold detecting means for generating a threshold signal when the pitch frequency in the pitch pattern is equal to or less than a predetermined threshold When receiving the decreasing trend signal and the threshold signal, the pitch frequency is monotonically decreasing and the pitch frequency is It is that you have a superposition stopping means stops generating the pitch pattern is less than a constant threshold value.

[Action]

本発明によれば、ピッチパターンにおけるピッチ周波
数が単調傾向であり、かつピッチパターンにおけるピッ
チ周波数が所定の閾値以下であるピッチパターンのピッ
チパターンの生成を停止するようにしたから、必要とな
る計算の項数が増加せず、計算時間が短いものとなる。
このため、ピッチ周波数の生成を全体として短時間で行
なうことができる。According to the present invention, the pitch frequency in the pitch pattern is monotonic, and the generation of the pitch pattern of the pitch pattern in which the pitch frequency in the pitch pattern is equal to or less than a predetermined threshold is stopped. The number of terms does not increase and the calculation time is short.
Therefore, the pitch frequency can be generated in a short time as a whole.

〔Example〕

以下本発明に係る音声合成装置におけるピッチ周波数
の生成装置の実施例を図面に基づいて説明する。Hereinafter, an embodiment of a pitch frequency generation device in a speech synthesis device according to the present invention will be described with reference to the drawings.

第２図及び第３図は本発明に係るの第一の実施例を示
すものである。本実施例において、ピッチ周波数生成装
置20は、第２図に示すように、ピッチ周波数の１成分で
あるフレーズパターンを生成するフレーズパターン生成
部21と、ピッチ周波数の他の成分であるアクセントパタ
ーンを生成するアクセントパターン生成部22とを併設し
て、これらの信号を加算器24で加え合わせている。そし
てまたこの加算器24はフレーズパターンとアクセントパ
ターンとを加えあわせる他、上述した式の第１項である
F_minを加え合わせるようにしている。2 and 3 show a first embodiment according to the present invention. In the present embodiment, as shown in FIG. 2, the pitch frequency generation device 20 includes a phrase pattern generation unit 21 that generates a phrase pattern that is one component of the pitch frequency, and an accent pattern that is another component of the pitch frequency. An accent pattern generation unit 22 to generate the signals is provided, and these signals are added by an adder 24. The adder 24 adds the phrase pattern and the accent pattern, and is the first term of the above equation.
F _min is added.

本実施例においてフレーズパターン生成部は第３図に
示すような構成を有している。同図において30は韻律解
析部からのフレーズ指令を格納するバッファメモリで上
式のフレーズを指定する項（第２項）の重ね合わせ項数
をＩを十分格納することができるだけのＮ段の記憶領域
を有し、各記憶領域にはフレーズ指令として大きさA_pi
時刻T_oi、及び固有各周波数α_ｉを格納している。また3
1は制御部32の操作によりバッファメモリ30に格納した
フレーズ指令を次々と取り出すセレクタ、33は上記のフ
レーズ指令に基いて時刻ｔにおけるフレーズパターンを
計算するフレーズパターン計算部、34はフレーズパター
ン計算部33の計算の結果を各時刻毎に重ね合わせて格納
する累積部である。In this embodiment, the phrase pattern generator has a configuration as shown in FIG. In the figure, reference numeral 30 denotes a buffer memory for storing a phrase command from the prosody analysis unit, and N-stage storage which can sufficiently store the number of superimposed terms I of the term (second term) specifying the phrase in the above equation. Each storage area has a size A _pi as a phrase command.
Time _Toi and each unique frequency α _i are stored. Also 3
1 is a selector for successively extracting the phrase commands stored in the buffer memory 30 by the operation of the control unit 32, 33 is a phrase pattern calculation unit that calculates a phrase pattern at time t based on the above-mentioned phrase commands, and 34 is a phrase pattern calculation unit This is an accumulating unit that stores the results of the calculations of 33 superimposed at each time.

上記のフレーズパターン計算機は、制御部32から送ら
れた現時刻ｔに対して以下の式に従ってフレーズパター
ンＰ（ｔ）を計算する。The above-mentioned phrase pattern calculator calculates a phrase pattern P (t) according to the following equation for the current time t sent from the control unit 32.

Ｐ（ｔ）＝A_piG_pi（ｔ−T_oi）但しG_pi（ｔ）＝α_i ²t exp（−α_it）…ｔ≧０ G_pi（ｔ）＝０ …ｔ＜０または傾向検出手段と閾値検出手段と重ね合わせ停止
手段として作動する閾値判定部である。この閾値判定部
35は、上記のフレーズパターン計算部33の計算した時刻
ｔに於けるフレーズパターンの値を受け取りその時刻に
おける第Ｍ番目フレーズ指令に対応する計算値が減少状
態であり（前回の計算値より小さい）、且つ予め定めた
所定の閾値より小さいときには、次回以降この第Ｍ番フ
レーズ指令に対応するフレーズパターンの計算を実行し
ないように指令部32に指令を送るものである。P (t) = A _pi G _pi (t−T _oi ) where G _pi (t) = α _i ² t exp (−α _i t)... T ≧ 0 G _pi (t) = 0. A threshold determination unit that operates as a detection unit, a threshold detection unit, and an overlay stop unit. This threshold judgment unit
Reference numeral 35 denotes a state in which the value of the phrase pattern at the time t calculated by the phrase pattern calculation unit 33 is received, and the calculated value corresponding to the Mth phrase command at that time is in a decreasing state (smaller than the previous calculated value). If it is smaller than a predetermined threshold, a command is sent to the command unit 32 so as not to execute the calculation of the phrase pattern corresponding to the M-th phrase command from the next time.

従ってこの実施例に係るフレーズパターン生成部22に
よれば各フレーズ指令に基くフレーズパターンの計算
は、フレーズ指令に対応する計算値が減少状態であり、
且つ予め定めた所定の閾値より小さいときには実行しな
いようにしたから、例えば第４図において、第１番フレ
ーズ指令に対するフレーズパターンの計算は、フレーズ
パターンの値が減少傾向で且つ閾値S₁以下となる時刻T
_s1以降の計算を実行する必要がなくなり、以下同様に第
２番以降のフレーズ指令に対しても時刻T_s2以降以下同
様にフレーズパターンの計算を実行する必要がなくなる
からフレーズパターンの計算時間を大幅に短縮できるこ
ととなる。Therefore, according to the phrase pattern generation unit 22 according to this embodiment, the calculation of the phrase pattern based on each phrase command is such that the calculated value corresponding to the phrase command is in a reduced state,
Because and was prevented from being performed when predetermined smaller than the predetermined threshold has, for example, in FIG. 4, the calculation of a phrase pattern for No.1 phrase command, the value of the phrase pattern and the threshold value S ₁ or less decline Time T
_It is not necessary to execute the calculation after _s1. Similarly, the calculation of the phrase pattern after time T _s2 is not necessary for the second and subsequent phrase commands. Can be shortened.

次に本実施例に係るピッチ周波数生成装置に設けられ
るアクセントパターン生成部について説明する。この実
施例におけるアクセントパターン生成部はバッファメモ
リに格納されるアクセント指令の内容とフレーズパター
ン計算部をアクセントパターン計算部としこのアクセン
トパターン計算部においてアクセントパターンを計算す
る計算式が上述したフレーズパターン生成部と異なるの
みであるので、その構成については第３図のフレーズパ
ターン計算部をアクセントパターン計算部と置換えるだ
けで足りるので、その詳細な説明は省略する。Next, an accent pattern generation unit provided in the pitch frequency generation device according to the present embodiment will be described. The accent pattern generation unit in this embodiment uses the contents of the accent command stored in the buffer memory and the phrase pattern calculation unit as an accent pattern calculation unit, and the expression for calculating the accent pattern in the accent pattern calculation unit is the phrase pattern generation unit described above. Since only the phrase pattern calculator of FIG. 3 is replaced with the accent pattern calculator, the detailed description thereof is omitted.

このアクセントパターン生成部において、バッファメ
モリには、立ちアクセント指令として、大きさA_aj立ち
上がり時刻T_1j、立ち上がり時刻T_2j、固有角周波数β_ｊ
を格納している。また、上記のアクセントパターン計算
部においては以下の式に従って時刻ｔにおけるアクセン
トパターンＡ（ｔ）が計算される。In this accent pattern generation unit, the buffer memory stores the magnitude A _aj rising time T _1j , rising time T _2j , natural angular frequency β _j in the buffer memory as a rising accent command.
Is stored. The accent pattern calculating section calculates the accent pattern A (t) at time t according to the following equation.

Ａ（ｔ）＝A_aj｛G_aj（ｔ−T_1j） −G_aj（ｔ−T_2j）｝但し G_aj（ｔ）＝_Min［１（１＋β_jt）exp（−β_jt），θ］
…ｔ≧０ G_aj（ｔ）＝０ …ｔ＜０となる。A (t) = _Aaj { _Gaj (t- _T1j ) _-Gaj (t- _T2j )} where _Gaj (t) = _Min [1 (1 + _.beta.jt ) exp (-. _Beta.jt ), .theta.]
... _t≥0 G _aj (t) = 0 ... t <0.

そして、このアクセントパターン生成部によれば上述
したフレーズパターン生成部と同様に、アクセントパタ
ーンの値が減少傾向で且つ所定の閾値以下である場合
は、それ以降の計算を実行しないようにしたから、アク
セントパターン生成の為の計算時間を短いものとするこ
とができる。According to this accent pattern generation unit, similarly to the above-described phrase pattern generation unit, when the value of the accent pattern is decreasing and is equal to or less than a predetermined threshold, the subsequent calculation is not performed. The calculation time for generating the accent pattern can be shortened.

従って本実施例に係るピッチ周波数生成部によれば、
ピッチ周波数を形成するフレーズパターン及びアクセン
トパターンを短時間で生成することができるから、全体
として高速にピッチ周波数を生成することができる。Therefore, according to the pitch frequency generation unit according to the present embodiment,
Since the phrase pattern and the accent pattern forming the pitch frequency can be generated in a short time, the pitch frequency can be generated at high speed as a whole.

なお上記の実施例において、ピッチ周波数生成装置は
フレーズパターンを生成するフレーズパターン生成部
と、ピッチパターンを生成するパターン生成部とから構
成したが、本発明は必ずしも両生成部を有する必要はた
く、どちらか一方の生成部を有するピッチ周波数生成装
置にも適用されることはいうまでもない。In the above-described embodiment, the pitch frequency generation device includes a phrase pattern generation unit that generates a phrase pattern and a pattern generation unit that generates a pitch pattern. However, the present invention does not necessarily need to include both generation units. It goes without saying that the present invention is also applied to a pitch frequency generator having one of the generators.

〔The invention's effect〕

以上説明したように、本発明によればピッチ周波数の
生成装置にピッチパターンが単調減少傾向であるとき減
少傾向信号を発生する傾向検出手段と、該ピッチパター
ンが所定の閾値以下であるとき閾値信号を発生する閾値
検出手段と、減少傾向信号と閾値信号とを受けたとき、
該ピッチパターンの生成を停止する重ね合せ停止手段と
を設けるようにしたから、ピッチ周波数の生成に影響を
与えない程度に小さくなった部分については計算を実行
しないようになり、ピッチ周波数の計算時間を短縮する
ことができ全体としてピッチ周波数の生成時間を短縮す
ることができるという効果を奏する。As described above, according to the present invention, a pitch frequency generating device generates a decreasing tendency signal when a pitch pattern is monotonically decreasing, and a threshold signal when the pitch pattern is equal to or less than a predetermined threshold. When a threshold detecting means for generating a signal and a decreasing tendency signal and a threshold signal are received,
Since the superposition stopping means for stopping the generation of the pitch pattern is provided, the calculation is not executed for a portion which is small enough not to affect the generation of the pitch frequency. And the time required for generating the pitch frequency as a whole can be shortened.

[Brief description of the drawings]

第１図は本発明の原理図、第２図は本発明に係るピッチ
周波数生成装置の実施例を示すブロック図、第３図は第
２図に示したピッチ周波数生成装置のフレーズパターン
生成部を示すブロック図、第４図はピッチ周波数の生成
状態を示す図、第５図はフレーズパターンの状態を示す
図、第６図はアクセントパターンの状態を示す図、第７
図は音声合成装置の構成を示す図である。１……ピッチ周波数生成装置２……重ね合わせ手段３……傾向検出手段４……閾値検出手段５……重ね合わせ停止手段FIG. 1 is a principle diagram of the present invention, FIG. 2 is a block diagram showing an embodiment of a pitch frequency generating device according to the present invention, and FIG. 3 is a diagram showing a phrase pattern generating unit of the pitch frequency generating device shown in FIG. FIG. 4 is a diagram showing a state of generating a pitch frequency, FIG. 5 is a diagram showing a state of a phrase pattern, FIG. 6 is a diagram showing a state of an accent pattern, FIG.
The figure shows the configuration of the speech synthesizer. DESCRIPTION OF SYMBOLS 1 ... Pitch frequency generation apparatus 2 ... Overlapping means 3 ... Trend detection means 4 ... Threshold detection means 5 ... Overlap stop means

Claims

(57) [Claims]

When generating a pitch frequency of a time-varying voice of an output voice of a voice synthesizing apparatus which inputs information relating to a text, converts the text into voice information to be read out, and outputs the voice information, the pitch depends on the characteristics of the text. A pitch frequency generation device having a superposition means for superimposing and synthesizing a plurality of pitch patterns having a characteristic of being monotonically decreasing from a peak value at each time, wherein the pitch frequency in the pitch pattern is monotonically decreasing. A tendency detecting means for generating a decreasing tendency signal when the tendency is detected, a threshold detecting means for generating a threshold signal when the pitch frequency in the pitch pattern is equal to or less than a predetermined threshold, and when receiving the decreasing tendency signal and the threshold signal The pitch frequency is monotonically decreasing, and the pitch frequency is equal to or less than a predetermined threshold. Superimposing stop formed pitch frequency generator in the speech synthesis apparatus and a stop means.