JPH0769718B2

JPH0769718B2 - Speech synthesis method

Info

Publication number: JPH0769718B2
Application number: JP63067826A
Authority: JP
Inventors: 祐之東福
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1988-03-22
Filing date: 1988-03-22
Publication date: 1995-07-31
Anticipated expiration: 2010-07-31
Also published as: JPH01239599A

Description

【発明の詳細な説明】［産業上の利用分野］本発明は適応型差分パルス符号変調方式（以下、単にAD
PCM方式）を用いた音声合成に係わり、特に１つの代表
波形素片を合成し、この代表波形素片をくり返し使用す
ることにより音声を合成する波形素片合成方式に関す
る。DETAILED DESCRIPTION OF THE INVENTION [Industrial field of application] The present invention relates to an adaptive differential pulse code modulation method (hereinafter, simply referred to as AD
The present invention relates to speech synthesis using the PCM method), and more particularly, to a waveform segment synthesis method in which one representative waveform segment is synthesized and voices are synthesized by repeatedly using this representative waveform segment.

［従来の技術］従来、音声合成方式にはPCM方式、ADPCM方式などの公知
の波形符号化方式が採用されており、特にADPCM方式はP
CM方式に比べて1/2以下のデータ量で波形を合成できる
ためデータ量の圧縮という面では非常に有効な方式であ
る。ADPCM方式というのは音声の隣接したサンプリング
間の相関の強さを利用したデータ圧縮方式であり、連続
したサンプリングポイントでの振幅値の差を符号化・量
子化する際に、その量子化幅を適応的に変化させて信号
中のノイズを軽減させる方式である。量子化に使用する
量子化幅は、現在の量子化幅とADPCM符号とにより次の
量子化幅を予測して使用する。この予測計算の一例を示
すと、Δｎ＋１＝Δｎ×Ｍ（Ln）となり、Lnはｎ番目の
ADPCM符号データ、Δｎはｎ番目のサンプリングポイン
トに対する量子化幅の大きさを表しＭ（Ln）は予測係数
であり、上式にしたがって現在の量子化幅Δｎに予測係
数Ｍ（Ln）を乗じた値を次のサンプリングポイントの量
子化幅Δｎ＋１として使用する。音声レベルが小さい時
は予測係数Ｍ（Ln）は１より小さくなり量子化幅Δｎ＋
１も小さくなる。逆に音声レベルが大きくなると予測係
数Ｍ（Ln）は１より大きくなり、その結果量子化幅Δｎ
＋１は大きくなる。但し量子化幅が大きくなりすぎると
逆にノイズが多くなる為、量子化幅の上限の値を決めて
おく必要があり、また量子化幅の値が０になってしまう
と次にいかなる予測係数をかけても０のままとなってし
まうため量子化幅の下限もあらかじめ設定しておく必要
がある。このような予測係数Ｍ（Ln）との乗算を含む処
理をし、読み出し専用メモリ（以下、ROM）に量子化幅
Δｎのデータをテーブル化して入れておき、上式に示さ
れた予測演算をROMから読みだされる量子化幅Δｎを使
用して行うようにする方式が高集積回路（以下、LSI）
化するのに適している。上記LSIにはADPCM符号と量子化
幅ポインタでROMのアドレスを指定可能なようにして差
分値データのテーブルを作成している。[Prior Art] Conventionally, known waveform coding methods such as PCM and ADPCM have been adopted as a voice synthesis method.
This is a very effective method in terms of data volume compression, since waveforms can be synthesized with less than half the data volume compared to the CM system. The ADPCM method is a data compression method that uses the strength of the correlation between adjacent samplings of voice, and when encoding and quantizing the difference in amplitude value at consecutive sampling points, the quantization width is set to This is a method of adaptively changing to reduce noise in the signal. As the quantization width used for quantization, the next quantization width is predicted and used based on the current quantization width and the ADPCM code. An example of this prediction calculation is Δn + 1 = Δn × M (Ln), where Ln is the nth
ADPCM code data, Δn represents the size of the quantization width for the nth sampling point, M (Ln) is a prediction coefficient, and the current quantization width Δn is multiplied by the prediction coefficient M (Ln) according to the above formula. Use the value as the quantization width Δn + 1 for the next sampling point. When the voice level is low, the prediction coefficient M (Ln) becomes smaller than 1 and the quantization width Δn +
1 will be smaller. On the contrary, when the voice level increases, the prediction coefficient M (Ln) becomes larger than 1, and as a result, the quantization width Δn
+1 becomes larger. However, if the quantization width becomes too large, noise will increase. Therefore, it is necessary to determine the upper limit value of the quantization width. If the quantization width becomes 0, any prediction coefficient Since it remains 0 even when multiplied, it is necessary to set the lower limit of the quantization width in advance. A process including multiplication with such a prediction coefficient M (Ln) is performed, data of the quantization width Δn is stored in a table in a read-only memory (hereinafter, ROM), and the prediction calculation shown in the above equation is performed. Highly integrated circuit (hereinafter referred to as LSI) is a method that uses the quantization width Δn read from ROM.
Suitable for converting. In the LSI, a table of difference value data is created so that the ROM address can be designated by the ADPCM code and the quantization width pointer.

量子化幅ポインタに応じてROMに入れる差分値を一定の
比率で増大するような値にしておけば量子化幅ポインタ
の値を増加させることは量子化幅に１より大きい予測係
数をかけることを意味し、予測係数の乗算を量子化幅ポ
インタに対する加減算に置き換えることができる。Increasing the value of the quantization width pointer means multiplying the quantization width by a prediction coefficient larger than 1 if the difference value to be inserted into the ROM according to the quantization width pointer is set to a value that increases at a constant rate. Meaning, the multiplication of the prediction coefficient can be replaced by addition and subtraction for the quantization width pointer.

［発明が解決しようとする問題点］一般に音声波形は子音部の音声ノイズと母音部のトーン
波形とその間のつなぎの部分に分けられ、特に母音部の
トーン波形はほとんど同じ周期で同じ形の波形が連続的
に少しずつ形を変化させながらエンベロープをつけて並
んでいるのが普通である。その中で連続する２ないし３
波形について見ればほとんど同じ形になっている為、波
形素片合成方式では代表波形として１波形選びそれをく
り返し使用することにより音声合成のデータ量を圧縮し
ている。[Problems to be Solved by the Invention] Generally, a voice waveform is divided into a voice noise of a consonant part, a tone waveform of a vowel part, and a connecting part between them. In particular, a tone waveform of a vowel part has almost the same period and the same shape. Are usually lined up with an envelope while gradually changing their shapes little by little. 2 to 3 consecutive in that
Since the waveforms have almost the same shape, the waveform segment synthesis method compresses the data amount of speech synthesis by selecting one waveform as a representative waveform and repeatedly using it.

ADPCM方式でこのような波形のくり返しを使って音声合
成のデータ量を圧縮しようとする場合、ADPCM方式では
音声をナイキスト周波数でサンプリングし隣接したサン
プリングポイント間での音声波形の振幅値の差分値を適
当な量子化幅で符号化するもので、しかも量子化幅を各
サンプリングポイントの差分値の大きさに応じて適応的
に変化させる方式である。従ってADPCM方式で代表波形
素片を合成する場合量子化幅は１波形内で一定ではな
く、また各サンプリングポイントでの量子化幅はそれぞ
れ直前のADPCMデータに依存することになる。またADPCM
方式においてはサンプリング周波数は通常音質とビット
レートの関係により4KHZ〜8KHZが使用されている。以上
説明したようなADPCM方式を用いて代表波形素片を合成
する場合実際の原波形のピッチ周期と合成した波形のピ
ッチ周期は完全に一致させることはできず、ピッチ周期
の誤差の影響により合成波形は最後のサンプリングポイ
ントの振幅は０にならない。また原音のピッチ周期と合
成波形のピッチ周期が完全に一致した場合でも最後のサ
ンプリングポイントでの振幅値は直前のサンプリングポ
イントでの振幅値に量子化幅とADPCM符号により決まる
差分値を加えたものであるため直前のサンプリングポイ
ントでの量子化幅の値によっては１波形内の最後のサン
プリングポイントの振幅値を０にできない場合が生じ
る。When trying to compress the amount of data for speech synthesis by repeating such waveforms in the ADPCM method, the ADPCM method samples the speech at the Nyquist frequency and determines the difference value of the amplitude value of the speech waveform between adjacent sampling points. This is a method of encoding with an appropriate quantization width, and is a method of adaptively changing the quantization width according to the magnitude of the difference value at each sampling point. Therefore, when the representative waveform segment is synthesized by the ADPCM method, the quantization width is not constant within one waveform, and the quantization width at each sampling point depends on the immediately preceding ADPCM data. Also ADPCM
In the method, the sampling frequency is usually 4KHZ to 8KHZ depending on the relationship between sound quality and bit rate. When synthesizing the representative waveform segment using the ADPCM method as described above, the pitch period of the actual original waveform and the pitch period of the synthesized waveform cannot be perfectly matched, and they are synthesized due to the influence of the pitch period error. The waveform does not have zero amplitude at the last sampling point. Even when the pitch period of the original sound and the pitch period of the synthesized waveform completely match, the amplitude value at the last sampling point is the amplitude value at the previous sampling point plus the difference value determined by the quantization width and ADPCM code. Therefore, depending on the value of the quantization width at the immediately previous sampling point, the amplitude value at the last sampling point in one waveform may not be 0.

以上説明したように従来のADPCM方式をそのまま用いて
代表波形素片を合成する場合、代表波形素片の最後のサ
ンプリングポイントでの振幅値を０にできない為、この
ような代表波形素片を用いて代表波形素片の合成を複数
回くり返しを行うと合成波形の振幅中心が変化してしま
うことになる。これはADPCM方式が基本的に各サンプリ
ングポイントの振幅値に差分値を加えて次のサンプリン
グポイントの振幅値を得るという差分符号化方式である
ため、１波形の最終振幅値が０でないとその最終振幅値
の値が誤差となり、これらの誤差が複数回のくり返しに
より累積されてゆき、合成波形の振幅中心が変化してし
まうことになるからである。合成波形の振幅中心が変化
すると波形データを音声に変換するD/Aコンバータで振
幅中心が変化した分だけオーバーフローする可能性があ
り、オーバーフローした場合には正常な音声波形が合成
できなくなるという重大な欠点が生じる。As described above, when a typical waveform segment is combined using the conventional ADPCM method as it is, since the amplitude value at the last sampling point of the representative waveform segment cannot be set to 0, such a representative waveform segment is used. If the synthesis of the representative waveform segment is repeated a plurality of times, the amplitude center of the synthesized waveform will change. This is a differential encoding method in which the ADPCM method basically adds the difference value to the amplitude value of each sampling point to obtain the amplitude value of the next sampling point, so if the final amplitude value of one waveform is not 0 This is because the value of the amplitude value becomes an error, and these errors are accumulated by repeating a plurality of times, and the amplitude center of the composite waveform changes. If the amplitude center of the synthesized waveform changes, the D / A converter that converts the waveform data to voice may overflow by the amount of the change in the amplitude center, and if it overflows, a normal voice waveform cannot be synthesized. There are drawbacks.

［問題点を解決するための手段、作用及び効果］本発明は、まず、第１の記憶手段と第２の記憶手段とに
それぞれの初期値を設定した後、第１のサンプリングポ
イントのADPCM符号データの供給を受け差分値供給手段
が、該ADPCM符号データと第１の記憶手段の初期値とに
基づき差分値を出力すると、出力波形振幅データ更新手
段が第２の記憶装置の初期値に差分値を加えて第１のサ
ンプリングポイントの出力波形振幅データを形成する。
この出力波形振幅データは第２の記憶手段に記憶される
と共に出力される。一方、量子化幅ポインタ更新手段も
ADPCM符号データと第１の記憶手段の初期値に基づき新
たな量子化幅ポインタの値を形成し、これを第１の記憶
手段に記憶させる。続く、第２のサンプリングポイント
では、そのADPCM符号データと更新された量子化幅ポイ
ンタの値とに基づき差分値が出力され、出力波形振幅デ
ータ更新手段がこの差分値を更新された出力波形振幅デ
ータに加算して第２のサンプリングポイントの出力波形
振幅データを形成し、これを第２の記憶手段に記憶させ
ると共に第２サンプリングポイントの出力波形振幅デー
タとして出力する。一方、量子化幅ポインタの値も更新
される。こうして複数の出力波形振幅データが出力され
ると、第１の記憶手段と第２の記憶手段とがそれぞれの
初期値に再び設定され、後続する繰り返し波形について
も上記と同様の手順で複数の出力波形振幅データが形成
される。[Means, Actions and Effects for Solving Problems] According to the present invention, first, respective initial values are set in the first storage means and the second storage means, and then the ADPCM code of the first sampling point is set. When the difference value supplying means receives the data and outputs the difference value based on the ADPCM code data and the initial value of the first storage means, the output waveform amplitude data updating means makes a difference to the initial value of the second storage device. The values are added to form output waveform amplitude data at the first sampling point.
The output waveform amplitude data is stored and output in the second storage means. On the other hand, the quantization width pointer updating means
A new quantization width pointer value is formed based on the ADPCM code data and the initial value of the first storage means, and this is stored in the first storage means. At the subsequent second sampling point, a difference value is output based on the ADPCM code data and the updated value of the quantization width pointer, and the output waveform amplitude data updating means updates the difference value of the output waveform amplitude data. To form output waveform amplitude data at the second sampling point, store this in the second storage means, and output it as output waveform amplitude data at the second sampling point. On the other hand, the value of the quantization width pointer is also updated. When a plurality of output waveform amplitude data are output in this way, the first storage means and the second storage means are reset to their respective initial values, and the subsequent repeated waveforms are output in the same manner as described above. Waveform amplitude data is formed.

したがって、代表波形素片を合成する場合でも、各代表
波形素片の最後のサンプリングポイントにおける出力波
形振幅データを初期値設定手段により初期値、例えば
「０」に設定することができ、複数の代表波形素片を連
続的に処理しても、各合成波形の振幅中心を一定の初期
値に保つことができ、オーバーフローなどの欠点を完全
に解決することができる。Therefore, even when the representative waveform pieces are combined, the output waveform amplitude data at the last sampling point of each representative waveform piece can be set to an initial value, for example, "0" by the initial value setting means, and a plurality of representatives can be set. Even if the waveform pieces are continuously processed, the amplitude center of each combined waveform can be maintained at a constant initial value, and a defect such as overflow can be completely solved.

さらに本発明の一実施例では、複数の差分値の組を有し
ており、これらの組の一つを選択して使用するので、同
じADPCM符号データをくり返し使用しても、同一の振幅
データのくり返しとはならず、１波形ごとにエンベロー
ブを負荷することができるという利点を有している。Furthermore, in one embodiment of the present invention, a set of a plurality of difference values is provided, and since one of these sets is selected and used, even when the same ADPCM code data is repeatedly used, the same amplitude data is used. It is not repeated, and has an advantage that the envelope can be loaded for each waveform.

［実施例］次に本発明について図面を参照して説明する。EXAMPLES Next, the present invention will be described with reference to the drawings.

第１図は本発明の第１実施例の回路構成図であり、第１
図に示された音声合成方式ではADPCM符号データがＬレ
ジスタ101にラッチされ、Ｌレジスタ101の内容に基づき
差分値データテーブルROM106の第１アドレスを指定す
る。Ｌレジスタ101の内容をデコーダ102でデコードした
値と、現在の量子化幅ポインタの値を保持しているＡレ
ジスタ104の値とを加算して次のサンプリングポイント
の量子化幅ポインタの値を定め、これをＡレジスタ104
に再び入れ、このＡレジスタ104の内容で差分値データ
テーブルROM106の第２アドレスを指定する。量子化幅ポ
インタの値が指定外の値にならないように加算器とＡレ
ジスタ105との間にリミッタ103を入れておく。差分値デ
ータテーブルROM106に供給されるアドレスの上位２ビッ
トでROM出力を切り替えている。量子化幅ポインタの値
が入っているＡレジスタ104の内容をＡ′レジスタ105に
転送して予め記憶させておき必要な時Ａ′レジスタ105
の内容をＡレジスタ104に読み出せるようにしておく。
差分値データテーブルROM106からは差分値データが出力
され、サンプリングポイントごとに１つ前の波形振幅値
が入っているＸレジスタ107の内容と差分値データを加
算し、次の波形振幅値をＸレジスタ107に入れている。
Ｘレジスタ107の内容はリセット信号で「０」にするこ
とができるようになっている。音声合成のフローチャー
トを第３図に示している。音声合成が開始されるとまず
最初はＸレジスタ107とＡレジスタ104とAD1・AD2にそれ
ぞれ初期値「０」をセットし（ステップＩ）、サンプリ
ングポイントごとにADPCM符号データをＬレジスタ101に
ラッチし（ステップIIのＬ←DATA）、Ｌレジスタ101の
値と量子化幅ポインタのＡレジスタ104の値とでROMアド
レスを指定して差分値データテーブルROM106の出力とＸ
レジスタ107内の値とを加算し、その結果をＸレジスタ1
07に格納し（ステップIIのＸ←Ｘ＋（L,A））、Ｘレジ
スタ107の内容をD/Aコンバータから出力する。この時Ｌ
レジスタの内容をデコーダ102でデコードした値（Ｌ）
と、Ａレジスタ104の値とを加算し次の量子化幅ポイン
タとしてＡレジスタ104に格納する（ステップIIのＡ←
Ａ＋（Ｌ））。以上が通常のADPCM方式による音声合成
時のフローであり、次にくり返される波形の合成時のフ
ローを説明すると、最初のくり返し波形の始めにＸレジ
スタ107とAD1・AD2を「０」にリセットし、Ａレジスタ1
04の内容をＡ′レジスタ105に記憶させてから（ステッ
プIII）最初のサンプリングポイントS1についてＸレジ
スタ107にセットすべき値Ｘを演算して求める。以下、
サンプリングポイント（S1〜S8、第５図参照）ごとに通
常の演算処理をくり返しＸレジスタ107の値をD/Aコンバ
ータから出力する（ステップIV）。２回目のくり返し波
形の始めにＸレジスタ107を「０」にリセットし（ステ
ップＶのＸ←０）、Ａ′レジスタ105の内容をＡレジス
タ104に読み出し（ステップＶのＡ←Ａ′）、差分値デ
ータテーブルROM106の内容としては、ROM上位アドレス
のAD1＝0,AD2＝０の場合に通常の差分値データが選ば
れ、AD1＝0,AD2＝１の場合に全体的に1.2倍に大きくさ
れた差分値データが選ばれ、AD1＝1,AD2＝１の場合全体
的に0.8倍に小さくした差分値データが選ばれるように
しておき、差分値データテーブルROM106の上位アドレス
AD1・AD2をエンベロープの増加、減少に合わせて適当な
値に設定してから（ステップＶのAD1,2←設定値）各サ
ンプリングポイントごとに通常の演算処理を行いＸレジ
スタ107の値をD/Aコンバータから出力する（ステップV
I）。上記ステップＶとVIとは必要なくり返し回数RPだ
けくり返せば第５図のS9〜S24に示すような同じADPCM符
号データを使ってエンベロープのついたくり返し波形を
出力させることができる。FIG. 1 is a circuit configuration diagram of a first embodiment of the present invention.
In the voice synthesis method shown in the figure, ADPCM code data is latched in the L register 101, and the first address of the difference value data table ROM 106 is designated based on the contents of the L register 101. The value of the L register 101 decoded by the decoder 102 and the value of the A register 104 holding the current value of the quantization width pointer are added to determine the value of the quantization width pointer of the next sampling point. , This is A register 104
And the second address of the difference value data table ROM 106 is designated by the contents of the A register 104. A limiter 103 is inserted between the adder and the A register 105 so that the value of the quantization width pointer does not become a value other than the specified value. The ROM output is switched by the upper 2 bits of the address supplied to the difference value data table ROM 106. The contents of the A register 104 containing the value of the quantization width pointer are transferred to the A'register 105 and stored in advance, and when necessary, the A'register 105
The contents of are read out to the A register 104.
The difference value data is output from the difference value data table ROM 106, the contents of the X register 107 containing the previous waveform amplitude value at each sampling point are added to the difference value data, and the next waveform amplitude value is added to the X register. I put it in 107.
The contents of the X register 107 can be set to "0" by a reset signal. A flow chart of voice synthesis is shown in FIG. When voice synthesis is started, first, initial values “0” are set in the X register 107, the A register 104, and AD1 and AD2 (step I), and ADPCM code data is latched in the L register 101 at each sampling point. (L ← DATA in step II), the ROM address is specified by the value of the L register 101 and the value of the A register 104 of the quantization width pointer, and the output of the difference value data table ROM 106 and X
The value in register 107 is added, and the result is added to X register 1
It is stored in 07 (X ← X + (L, A) in step II), and the contents of the X register 107 are output from the D / A converter. At this time L
Value (L) obtained by decoding the contents of the register with the decoder 102
And the value in the A register 104 are added and stored in the A register 104 as the next quantization width pointer (A ← in step II).
A + (L)). The above is the flow when synthesizing voice by the normal ADPCM method. To explain the flow when synthesizing the waveform to be repeated next, reset the X register 107 and AD1 and AD2 to "0" at the beginning of the first repeated waveform. A register 1
After storing the contents of 04 in the A'register 105 (step III), the value X to be set in the X register 107 is calculated for the first sampling point S1. Less than,
Normal calculation processing is repeated for each sampling point (S1 to S8, see FIG. 5) and the value of the X register 107 is output from the D / A converter (step IV). At the beginning of the second repeated waveform, the X register 107 is reset to “0” (X ← 0 in step V), the contents of the A ′ register 105 are read to the A register 104 (A ← A ′ in step V), and the difference is calculated. As the contents of the value data table ROM 106, normal difference value data is selected when the ROM upper address is AD1 = 0, AD2 = 0, and is enlarged 1.2 times as a whole when AD1 = 0, AD2 = 1. The difference value data is selected, and when AD1 = 1 and AD2 = 1, the difference value data that is 0.8 times smaller is selected as a whole.
After setting AD1 and AD2 to appropriate values according to the increase and decrease of the envelope (AD1,2 in step V ← set value), normal arithmetic processing is performed at each sampling point and the value of X register 107 is set to D / Output from A converter (Step V
I). By repeating steps V and VI as many times as necessary, it is possible to output an enveloped repeating waveform using the same ADPCM code data as shown in S9 to S24 of FIG.

第１実施例では、Ａレジスタ104とＸレジスタ107がそれ
ぞれ第１、第２の記憶手段を、差分値データテーブルRO
M106が差分値供給手段を、Ｌレジスタ101、デコーダ10
2、加算器が量子化幅ポインタ更新手段を、差分値デー
タテーブルROMの出力側加算器が出力波形振幅データ更
新手段を、Ａ′レジスタ、Ｘレジスタ107のリセット信
号が初期値設定手段をそれぞれ構成している。In the first embodiment, the A register 104 and the X register 107 respectively serve as the first and second storage means and the difference value data table RO.
M106 is a difference value supply means, L register 101, decoder 10
2. The adder constitutes the quantization width pointer updating means, the output side adder of the difference value data table ROM constitutes the output waveform amplitude data updating means, and the reset signal of the A'register and the X register 107 constitutes the initial value setting means. is doing.

第２図は本発明の第２実施例の回路構成図である。ADPC
M符号データがＬレジスタ201にラッチされ、Ｌレジスタ
の201内容で差分値データテーブルROM206〜208のアドレ
スを指定する。Ｌレジスタ201の内容をデコーダ202でデ
コードした値と、現在の量子化幅ポインタの値を保持し
ているＡレジスタ204の値とを加算して得られる次のサ
ンプリングポイントの量子化幅ポインタの値をＡレジス
タ204に入れ、Ａレジスタ204の内容で差分値データテー
ブルROM206〜208のアドレスを指定する。量子化幅ポイ
ンタの値が指定外の値にならないようリミッタ203を入
れておく。差分値データテーブルROM206〜208は通常の
差分値データ群を記憶しているROM2（207）と全体的に
1.2倍に大きくした差分値データ群を記憶しているROM3
（208）と全体的に0.8倍小さくした差分値データ群を記
憶しているROM1（206）とで構成されており、スイッチ2
09でROMの出力を選択的に切り替えている。FIG. 2 is a circuit configuration diagram of the second embodiment of the present invention. ADPC
The M code data is latched in the L register 201, and the address of the difference value data table ROM 206 to 208 is designated by the content of 201 of the L register. The value of the quantization width pointer at the next sampling point obtained by adding the value obtained by decoding the contents of the L register 201 with the decoder 202 and the value of the A register 204 holding the current value of the quantization width pointer In the A register 204, and the addresses of the difference value data tables ROM 206 to 208 are designated by the contents of the A register 204. The limiter 203 is inserted so that the value of the quantization width pointer does not exceed the specified value. The difference value data table ROMs 206 to 208 are generally the same as the ROM 2 (207) that stores a normal difference value data group.
ROM3 that stores the difference value data group that is 1.2 times larger
(208) and the ROM1 (206) that stores the difference value data group that is 0.8 times smaller than that of the switch 2
In 09, the output of ROM is selectively switched.

音声合成のフローチャートを第４図に示しているが、第
１実施例に比べてスイッチ209で差分値データテーブルR
OM206〜208をエンベロープの増加・減少に合わせて選択
的に切り替えること以外は全く同じフローであり、第５
図に示すエンベロープのついたくり返し波形を第１実施
例と同様に出力させることができる。A flow chart of voice synthesis is shown in FIG. 4, but the difference value data table R is used by the switch 209 as compared with the first embodiment.
The flow is exactly the same except that OM206 to 208 are selectively switched according to the increase / decrease of the envelope.
The repeated waveform with the envelope shown in the figure can be output as in the first embodiment.

上記第２実施例ではＡレジスタ204、Ｘレジスタ210が第
１の記憶手段と第２の記憶手段とをそれぞれ構成してお
り、差分値データテーブルROM1〜ROM3（206〜208）は差
分値供給手段をＬレジスタ201、デコーダ202、加算器は
量子化幅ポインタ更新手段を、差分値データテーブルRO
M1〜ROM3の出力側加算器が出力波形振幅データ更新手段
を、Ａ′レジスタ205、Ｘレジスタ210のリセット信号が
初期値設定手段をそれぞれ構成している。In the second embodiment, the A register 204 and the X register 210 constitute the first storage means and the second storage means, respectively, and the difference value data tables ROM1 to ROM3 (206 to 208) are the difference value supply means. The L register 201, the decoder 202, and the adder are quantization width pointer updating means, and the difference value data table RO
The output side adders of M1 to ROM3 constitute output waveform amplitude data updating means, and the reset signals of the A'register 205 and X register 210 constitute initial value setting means.

以上説明したように本発明の実施例ではくり返し波形の
最初のサンプリングポイントの差分値として振幅０を基
準にし、２回目以降のくり返し波形の最初のサンプリン
グポイントの量子化幅ポインタの値は１回目の値を使用
してくり返し波形の初期値を同じにしてADPCM方式で代
表波形素片のADPCM符号データをそのまま使用し、同一
波形をくり返し合成することができ、しかも差分値デー
タテーブルROMにエンベロープの増加・減少に合わせた
差分値データをあらかじめ入れておくことにより１波形
ごとにエンベロープの変化するくり返し波形を合成する
ことが可能であり、音質を劣化させずに合成データ量の
圧縮を行うのに非常に有効である。なお上記第１、第２
実施例では通常の差分値データと全体的に1.2倍大きく
した差分値データと全体的に0.8倍に小さくした差分値
データの３種類を使用しているがエンベロープの振幅変
化の精度に合わせて差分値データテーブルROMを任意に
設定して良いことは明らかである。As described above, in the embodiment of the present invention, the value of the quantization width pointer at the first sampling point of the second and subsequent repeated waveforms is set to the value of the first sampling point based on the amplitude 0 as the difference value of the first sampling point of the repeated waveform. By using the same value as the initial value of the repeated waveform and using the ADPCM code data of the representative waveform segment as it is in the ADPCM method, it is possible to repeatedly synthesize the same waveform and increase the envelope in the difference value data table ROM.・ By inserting the difference value data according to the decrease in advance, it is possible to synthesize a repeating waveform whose envelope changes for each waveform, which is very useful for compressing the amount of synthesized data without degrading the sound quality. Is effective for. The above first and second
In the embodiment, three types of difference value data, which are normal difference value data, 1.2 times larger overall difference value data, and 0.8 times smaller difference value data are used. It is obvious that the value data table ROM may be set arbitrarily.

[Brief description of drawings]

第１図は本発明の音声合成方式の第１実施例を示す回路
構成図、第２図は本発明の音声合成方式の第２実施例を
示す回路構成図、第３図は第１図の第１実施例で音声合
成を行うためのフローチャート、第４図は第２図の第２
実施例で音声合成を行うためのフローチャート、第５図
は合成した波形のサンプリングポイントを示す波形図で
ある。 101,201……Ｌレジスタ、 102,202……デコーダ、 103,203……リミッタ、 104,204……Ａレジスタ、 105,205……Ａ′レジスタ、 106,206,207,208……差分値データテーブルROM、 209……スイッチ、 107,210……Ｘレジスタ。FIG. 1 is a circuit configuration diagram showing a first embodiment of the speech synthesis system of the present invention, FIG. 2 is a circuit configuration diagram showing a second embodiment of the speech synthesis system of the present invention, and FIG. 3 is a diagram of FIG. A flow chart for performing voice synthesis in the first embodiment, FIG. 4 is a second flow chart of FIG.
FIG. 5 is a waveform diagram showing sampling points of the synthesized waveforms, which is a flow chart for performing voice synthesis in the embodiment. 101,201 ... L register, 102,202 ... decoder, 103,203 ... limiter, 104,204 ... A register, 105,205 ... A 'register, 106,206,207,208 ... difference value data table ROM, 209 ... switch, 107,210 ... X register.

Claims

[Claims]

1. In a speech synthesis method for forming and outputting output waveform amplitude data for each of a plurality of sampling points, an initial value or a value of a quantization width pointer of a sampling point preceding each sampling point is temporarily stored. A first storage means, a difference value that outputs a difference value based on the ADPCM code data at each sampling point and the initial value or the value of the quantization width pointer at the preceding sampling point stored in the first storage means. Supply means and at each sampling point
Forming a quantization width pointer value for a subsequent sampling point based on the ADPCM code data and the initial value or the quantization width pointer value at the preceding sampling point stored in the first storage means; Quantization width pointer updating means for storing the value of the quantization width pointer for the subsequent sampling point in the first storage means, and temporarily storing the initial value or the output waveform amplitude data at each sampling point, and Second storage means for outputting output waveform amplitude data, and each sampling by adding the difference value to an initial value temporarily stored in the second storage means or output waveform amplitude data at a preceding sampling point. Output waveform amplitude data at the point, and the output waveform amplitude data is stored in the second storage means. Output waveform amplitude data updating means to be stored; and initial value setting means for setting respective initial values in the first storage means and the second storage means each time a plurality of output waveform amplitude data is output. Characteristic speech synthesis method.

2. The difference value supplying means has a plurality of sets of difference values, selects one of the plurality of sets of difference values, and outputs the difference value from the selected set. The speech synthesis system according to claim 1.