JPH0679239B2

JPH0679239B2 - Speech synthesizer

Info

Publication number: JPH0679239B2
Application number: JP60093812A
Authority: JP
Inventors: 隆矢頭; 誠森戸; 敬三木
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1985-05-02
Filing date: 1985-05-02
Publication date: 1994-10-05
Anticipated expiration: 2009-10-05
Also published as: JPS61252597A

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、記憶領域から音声波形の波形領域での情報を
読み出し、音声を合成する音声合成装置において用いる
ものであって、差分符号特にADPCM符号を通常のPCM符号
へ変換するADPCM再生器に関する。DETAILED DESCRIPTION OF THE INVENTION (Industrial field of application) The present invention is used in a voice synthesizing device for synthesizing voice by reading information in a waveform region of a voice waveform from a storage region. The present invention relates to an ADPCM player that converts a code into a normal PCM code.

（従来の技術）音声の帯域圧縮の方式として、ADPCMがある。この方式
は音声の隣接標本間（時間T_iと時間T_i+1）のデータにお
いて、時間T_iに算出した予測値とT_i+1における音声信号
の差分をとり、それを符号化してADPCM符号とすること
によって、差分信号の量子化値を得、その値を逐次加算
することによって、通常のPCM符号形式の音声を再生す
る方式である。また、差分信号の量子化値を得る際に必
要となる量子化幅をADPCM符号に応じて変化させていく
ことを特徴としている。ADPCM方式は通常のPCM符号形式
では12ビット程度を要する音声サンプル符号化値を３〜
４ビットで表現し得るもので３〜４の情報圧縮度が得ら
れる。(Prior Art) As a voice band compression method, there is ADPCM. This method takes the difference between the predicted value calculated at time T _i and the voice signal at T _{i + 1} in the data between adjacent samples of voice (time T _i and time T _{i + 1} ), and encodes it to obtain ADPCM. This is a method for reproducing a sound in a normal PCM code format by obtaining a quantized value of a difference signal by using a code and sequentially adding the values. It is also characterized in that the quantization width required when obtaining the quantized value of the difference signal is changed according to the ADPCM code. The ADPCM method requires a voice sample coded value of 3 to 12 which requires about 12 bits in the normal PCM code format.
It can be expressed by 4 bits, and an information compression degree of 3 to 4 can be obtained.

また、第４図（ａ）の実際の波形に対し、第４図（ｂ）
に示すように第１波形を繰り返し合成することによって
情報圧縮を行なう方法もある。一般に、声道の伝達関数
の変化は緩やかであり、20〜30msの間はほぼ定常と考え
られる。音声音部分ではピッチ同期ごとにほぼ同一とみ
なされる波形か繰り返し現れる。従って、（この周期波
形の１つ１つを素片波形と呼ぶことにする）１つの素片
波形で後続する素片波形の幾つかを代用することができ
る。この素片波形の繰り返しは、音声全体を通して平均
３〜４回は合成音の品質を損なうことはなく、その圧縮
効果は極めて大きい。しかし、繰り返し回数があまり大
きくなると音声波形の定常性を越えてしまい音質は劣化
する。また、この場合、音声の平均電力を考えると第４
図（ｃ）に示されるが如く、実際の音声波形の電力は滑
らかに変化しているにも拘らず合成音では波形の繰り返
しによる平均電力の不連続が生じる。In addition, the actual waveform of FIG. 4 (a) is different from that of FIG. 4 (b).
There is also a method of performing information compression by repeatedly synthesizing the first waveform as shown in FIG. In general, the transfer function of the vocal tract changes slowly, and is considered to be almost steady for 20 to 30 ms. In the voice sound part, waveforms that are considered to be almost the same for each pitch synchronization appear repeatedly. Therefore, it is possible to substitute some of the succeeding segment waveforms by one segment waveform (each of the periodic waveforms is referred to as segment waveform). The repetition of this segment waveform does not impair the quality of the synthesized speech 3 to 4 times on average throughout the entire speech, and its compression effect is extremely large. However, if the number of repetitions becomes too large, the steadiness of the voice waveform is exceeded and the sound quality deteriorates. In this case, considering the average power of voice,
As shown in FIG. 6C, the average power discontinuity occurs due to the repetition of the waveform in the synthesized voice, although the power of the actual voice waveform changes smoothly.

従って、波形の平均電力が滑らかに変化する波形繰り返
し法が必要となる。当然のことながら繰り返す１つの音
声波形は符号化されており、その符号化データを繰り返
すことによって電力の異なる波形を再生する方式が必要
となる。Therefore, there is a need for a waveform repetition method in which the average power of the waveform changes smoothly. As a matter of course, one repeating voice waveform is encoded, and a method of reproducing waveforms having different powers by repeating the encoded data is required.

この点に関する従来技術としては、例えば特公昭59-147
60号公報に記載がある。同公報に開示されたADPCM再生
器は、簡単な回路構成により、ADPCM符号化された１つ
の素片波形に対し、同一ADPCM符号を用いることにより
電力の異なった波形を再生し得るものである。As a conventional technique in this regard, for example, Japanese Patent Publication No. 59-147.
It is described in Japanese Patent No. 60. The ADPCM regenerator disclosed in the above publication is capable of regenerating waveforms having different powers by using the same ADPCM code for one ADPCM-encoded segment waveform with a simple circuit configuration.

第２図は、前記公報に開示のADPCM再生器を示したもの
である。但し、以後の説明を容易にするため回路は前記
公報に開示のものをやや簡略化してある。FIG. 2 shows the ADPCM reproducing device disclosed in the above publication. However, in order to facilitate the following description, the circuit disclosed in the above publication is slightly simplified.

以下、この構成及び動作を説明する。The configuration and operation will be described below.

初めに第２図におけるADPCM符号の復号過程を説明す
る。入力符号L_iに対し、バイアスとして0.5が加えられ
た結果に量子化ステップ幅Δ_ｉを乗じ、差分復号値ｑを
得る。（（１）式）。First, the decoding process of the ADPCM code in FIG. 2 will be described. The result of adding 0.5 as a bias to the input code L _i is multiplied by the quantization step width Δ _i to obtain the differential decoded value q. (Equation (1)).

q_i＝（Li＋0.5）・Δ_ｉ（１）得られた差分復号値q_iはレジスタ55に格納されていると加算され、結果がレジスタ55に格納される（（２）式）。q _i = (Li + 0.5) · Δ _i (1) The obtained differential decoded value q _i is stored in the register 55. Is added and the result Is stored in the register 55 (equation (2)).

このが音声のPCM符号である。次に、次回ADPCMデータL_i+1の
復号に用いる量子化ステップ幅Δ_ｉ＋１を算出し、１つ
のADPCM符号L_iに対する復号処理を完了する。ここで、
量子化ステップ幅Δ_ｉの算出について説明する。量子化
ステップ幅Δ_ｉは量子化ROM63からポインタP_iを参照す
ることによって得られる。量子化ROM63には第１表に示
されるようにアドレスに対して指数的に増加する値が格
納されており、ポインタＰに対して Δ_ｉ＝Δ_min・A^Pi （３）なる量子化ステップ幅が得られる。従って、量子化ステ
ップ幅変更のブロックではP_iをポインタ移動量D_iだけ移
動させれば、量子化ステップ幅が変更される。すなわち P_i+1＝P_i＋D_i （４） Δ_ｉ＋１＝Δ_min・A^Pi+1 （５）ここで、ROM60の出力であるポインタ移動量はD_nはADPCM
符号L_iをアドレスとしてポインタ移動ROM60を参照する
ことによって得られる。ROM60の内容を第２表に示す。
（４）式の演算はポインタ61で行なっている。 this Is the voice PCM code. Next, the quantization step width Δ _{i + 1} used for decoding the ADPCM data L _{i + 1} next time is calculated, and the decoding process for one ADPCM code L _i is completed. here,
The calculation of the quantization step width Δ _i will be described. The quantization step width Δ _i is obtained from the quantization ROM 63 by referring to the pointer P _i . As shown in Table 1, the quantization ROM 63 stores a value that increases exponentially with respect to the address, and a quantization step width of Δ _i = Δ _min · A ^Pi (3) for the pointer P Is obtained. Therefore, in the block for changing the quantization step width, the quantization step width is changed by moving P _i by the pointer movement amount D _i . That is, P _{i + 1} = P _i + D _i (4) Δ _{i + 1} = Δ _min · A ^{Pi + 1} (5) Here, the pointer movement amount output from the ROM 60 is D _n is ADPCM.
It is obtained by referring to the pointer movement ROM 60 with the code L _i as an address. Table 2 shows the contents of ROM60.
The calculation of the equation (4) is performed by the pointer 61.

以上が第２図の回路におけるADPCM再生の動作である
が、ADPCM符号の再生は逐次演算であるため、波形再生
を始める場合（１）式におけるΔ_ｉ，（２）式のの初期値が必要となる。The above is the operation of ADPCM reproduction in the circuit of FIG. 2, but since reproduction of the ADPCM code is a sequential calculation, when waveform reproduction is started, Δ _i in equation (1) and equation (2) The initial value of is required.

第２図の回路においては（２）式は量子化ROM63の出力によって与えられる。即ち、
（３）式と第１表よりを満足するポインタ値P_INTを用いて量子化ROM63を読み
出し、その値を初期値とする。また、同じように（１）式における量子化ステ
ップ幅Δ_ｉの初期値は、ポインタ初期値P₁を与えること
により、（３）式からと計算される。また、このポインタ初期値は（４）式P_i
の初期値となる。（１），（２），（３），および
（６），（７）式より、素片波形のADPCM符号列L₁，
L₂...L_i....の再生値となる。 Equation (2) in the circuit of FIG. Is given by the output of the quantization ROM 63. That is,
From equation (3) and Table 1 The quantization ROM 63 is read using a pointer value P _INT that satisfies And Similarly, the initial value of the quantization step width Δ _i in equation (1) can be calculated from equation (3) by giving the pointer initial value P _1. Is calculated. Further, the initial value of this pointer is the equation (4) P _i
Is the initial value of. From equations (1), (2), (3), and (6) and (7), the ADPCM code string L ₁ of the segment waveform is
Playback value of L ₂ ... L _i .... Becomes

次に、ADPCMデータL₁，L₂・・・を用いて、波形の形は
同じであるが振幅のみを一定倍にする動作について説明
する。（６）式においてを与えるポインタ値P_INTにポインタ増減値Ｓを加え、そ
れを新たにP_INTとすると、そのときの初期値（９）となり、を（A^S）倍した値となる。同じように、前記のADPCM再
生過程におけるポインタ値P_iに一定値Ｓを加えると、
（８）式より（10）ことになる。第２図においては加算器81,レジスタ82,加
算器83によって素片波形が繰り返されるごとにポインタ
に加える値Ｓ′を加算して、Ｓ′＝S,S″＝Ｓ′＋S,S
＝Ｓ″＋Ｓとして素片波形をその形を変えずに振幅のみ
を、１回目の再生波形に対して（A^S）倍，（A^2S）倍，
（A^3S）倍・・・と徐々に変化させ、波形を繰り返した
場合の電力の不連続を簡易な回路で構成して軽減してい
る。Next, by using the ADPCM data L ₁ , L _2, ... In equation (6) Pointer decrement value S in addition to the pointer value P _INT give, which on new and P _INT, the initial value of the time (9) becomes, Is the value multiplied by (A ^S ). Similarly, if a constant value S is added to the pointer value P _i in the ADPCM reproducing process,
From equation (8) (Ten) It will be. In FIG. 2, the value S'added to the pointer is added every time the segment waveform is repeated by the adder 81, the register 82, and the adder 83, and S '= S, S "= S' + S, S.
= S ″ + S, the amplitude of the segment waveform is not changed and the amplitude is (A ^S ) times, (A ^2S ) times the first reproduced waveform,
(A ^3S ) times ... It gradually reduces the power discontinuity when the waveform is repeated by a simple circuit.

（発明が解決しようとする問題点）しかしながら、上記従来の音声合成器の回路構成では、
ポインタ増減値Ｓ又はポインタ値P_iのとる値によっては
必ずしも波形が一定倍とはならず、特に高い精度での振
幅制御が要求される用途では十分にその目的を達し得な
い場合がある。すなわち、（９），（10）式の関係が成
立するためには P_i′＝P_i＋Ｓ（11）なる関係が常に成立し、かつ、上式におけるP_i′に対応
する量子化ステップ幅が量子化ROM63に存在しなければ
ならない。しかし、実際に量子化ROM63に入力されるポ
インタ値はポインタリミッタ62により、制限を受け、Ｓ
またはP_i′^±のとる値によっては（11）式の条件を満足
しない場合がある。(Problems to be Solved by the Invention) However, in the circuit configuration of the conventional speech synthesizer described above,
The waveform does not necessarily become a constant multiple depending on the value taken by the pointer increase / decrease value S or the pointer value P _i , and there is a case where the purpose cannot be sufficiently achieved in an application in which amplitude control with high accuracy is required. That is, in order for the relations of equations (9) and (10) to hold, the relation of P _i ′ = P _i + S (11) always holds, and the quantization step width corresponding to P _i ′ in the above equation Must be present in the quantization ROM 63. However, the pointer value actually input to the quantization ROM 63 is limited by the pointer limiter 62, and S
Or, depending on the value of P _i ′ ^± , the condition of Eq. (11) may not be satisfied.

ここでポインタリッミタ62の働きについて説明する。先
に述べたように、量子化ROM62にはアドレスP_min（＝
０）〜P_maxに対してΔ_minA^O〜Δ_minA^Pmaxなる量子化ス
テップ幅が格納されており、従って当然のことながら量
子化ROM63に入力されるポインタ値はＯ〜P_maxの範囲内
になければならない。しかし、ポインタ値P_iはADPCM符
号L_iの値によっては前記制限を満たさない場合が生じる
ため、ポインタをＯ〜P_maxの値に制限する回路が必要と
なる。これがポインタリミッタ62であり、この種のADPC
M再生器には不可欠なものである。Here, the function of the pointer limiter 62 will be described. As described above, the quantization ROM 62 has the address P _min (=
0) to P _max against Δ _{_min} A ^O ~Δ _min A ^Pmax becomes the quantization step size and is stored, thus it will be appreciated that the range of pointer value O～P _max inputted to quantization ROM63 Must be in However, since the pointer value P _i may not satisfy the above limit depending on the value of the ADPCM code L _i , a circuit for limiting the pointer to a value of O to P _max is required. This is the pointer limiter 62, this kind of ADPC
It is indispensable for M regenerator.

第３図（ａ）に、従来の方式において問題となるポイン
タ61の働きを示す。図では時間区間Ta-Tbの間でポイン
タリミッタ62の働きによりＰ′_iがP_maxに制限され（1
1）式が成立しない。また区間Tc-Tdの間ではP_iはP_minで
制限されているにも拘らず、Ｐ′_iはリミッタの制限に
かかわらず同じく（11）式は成立しない。この例ではTa
-Tb,Tc-Tdの間はすべて（11）式が成立しないわけであ
るが、実際のADPCM再生では、逐次加算処理であるため
１点でも（11）式を満足しない点か存在すれば、それ以
降すべての点で（８）式の関係が崩れる。FIG. 3 (a) shows the function of the pointer 61 which is a problem in the conventional method. In the figure, P ′ _i is limited to P _max by the action of the pointer limiter 62 during the time interval Ta-Tb (1
1) Formula does not hold. The P _i are between the interval Tc-Td is despite being limited by P _min, P _'i is also regardless limiter limits (11) is not satisfied. In this example Ta
Equation (11) does not hold between -Tb and Tc-Td, but in actual ADPCM playback, since there is a point where even one point does not satisfy Equation (11) because it is a successive addition process, After that, the relation of equation (8) is broken in all points.

本発明は、以上述べた振幅制御において波形の相似性が
崩れる欠点を除去し、高い精度で振幅制御を行ない、１
つの素片を様々な振幅にて用いる法則合成等に適用して
も好適な音声合成器を提供することを目的とする。The present invention eliminates the drawback that the similarity of the waveform is broken in the amplitude control described above, and performs the amplitude control with high accuracy.
An object of the present invention is to provide a suitable speech synthesizer even when applied to law synthesis or the like in which one unit is used at various amplitudes.

（問題点を解決するための手段）本発明は、予め定められた複数個の量子化ステップ幅に
対応した量を記憶している第１メモリと、該第１メモリ
のアドレスを指定して該第１メモリから量子化ステップ
幅に対応した量の１つを出力させるポインタと、該ポイ
ンタの出力をADPCM符号に応じて移動させる第２メモリ
と、前記ポインタの出力をADPCM符号符号化時に使用し
た量子化ステップ幅の最大値と最小値の間に限定するポ
インタリミッタと、前記第１メモリから出力された量子
化ステップ幅に基づき再生された再生PCM符号を記憶し
得るレジスタとを有する音声合成器を対象とする。(Means for Solving Problems) According to the present invention, a first memory storing an amount corresponding to a plurality of predetermined quantization step widths, an address of the first memory, and A pointer for outputting one of the quantities corresponding to the quantization step width from the first memory, a second memory for moving the output of the pointer according to the ADPCM code, and the output of the pointer were used for ADPCM code encoding. Speech synthesizer having a pointer limiter for limiting the quantization step width to a maximum value and a minimum value, and a register capable of storing a reproduced PCM code reproduced based on the quantization step width output from the first memory Target.

本発明は、このような音声合成器において、第１に、音
素波形に関する振幅増減値を含む情報を入力し、先頭音
素波形再生の過程では零がセットされており、２回目以
降の音素波形再生においては各過程ごとに振幅増減値を
累算して記憶する手段を設ける。According to the present invention, in such a speech synthesizer, first, information including an amplitude increase / decrease value regarding a phoneme waveform is input, and zero is set in the process of reproducing the first phoneme waveform, and the second and subsequent phoneme waveform reproductions are performed. In, a means is provided for accumulating and storing the amplitude increase / decrease value for each process.

第２に、該記憶手段の出力と前記ポインタリミッタの出
力との和で前記第１メモリのアドレスを指定する加算手
段とを設ける。Secondly, there is provided addition means for designating the address of the first memory by the sum of the output of the storage means and the output of the pointer limiter.

第３に、前記第１メモリに、前記ADPCM符号符号化時に
使用した量子化ステップ幅の範囲に対して前記加算手段
によって増加又は減少する分をそれぞれ上位アドレス又
は下位アドレスに拡張し、これらの拡張されたアドレス
に対応する量子化ステップ幅を予め記憶させておく。Thirdly, in the first memory, the increment or decrement by the adding means with respect to the range of the quantization step width used at the time of ADPCM code encoding is extended to an upper address or a lower address, respectively, and these extensions are performed. The quantization step width corresponding to the generated address is stored in advance.

（作用）ポインタから出力された量子化ステップ幅に対応した量
の１つは、ポインタリミッタを介して加算手段に供給さ
れる。この際、このポインタの出力はポインタリミッタ
により、ADPCM符号符号化時に使用した量子化ステップ
幅の最大値と最小値の間に制御される。他方、加算器に
は２回目以降の音声波形再生における各過程ごとの振幅
増減値が累算されて供給される（先頭音素波形再生の過
程では０が供給される）。この結果、第１メモリのアド
レスはポインタリミッタからの出力と振幅増減値との和
となる。すなわち、ポインタリミッタの制限値に対して
上、下にアドレスが拡張される。この拡張されたアドレ
スに対応する量子化ステップ幅は予め第１メモリに記憶
されている。そして、第１メモリから出力される量子化
ステップ幅に基づき再生されたPCM符号がレジスタに記
憶される。(Operation) One of the quantities corresponding to the quantization step width output from the pointer is supplied to the adding means via the pointer limiter. At this time, the output of this pointer is controlled by the pointer limiter between the maximum value and the minimum value of the quantization step width used during ADPCM code encoding. On the other hand, to the adder, the amplitude increase / decrease value for each process in the second and subsequent voice waveform reproduction is accumulated and supplied (0 is supplied in the process of reproducing the head phoneme waveform). As a result, the address of the first memory is the sum of the output from the pointer limiter and the amplitude increase / decrease value. That is, the address is expanded above and below the limit value of the pointer limiter. The quantization step width corresponding to this expanded address is stored in advance in the first memory. Then, the PCM code reproduced based on the quantization step width output from the first memory is stored in the register.

このように、ポインタリミッタの制限値を振幅増減値だ
けシフトしているので、適切な量子化ステップ幅を設定
することができ、上記問題点は解決される。Since the limit value of the pointer limiter is shifted by the amplitude increase / decrease value in this way, an appropriate quantization step width can be set, and the above problem is solved.

（実施例）以下、本発明の一実施例を説明するに先だって、本発明
の原理について説明する。(Example) Before describing an example of the present invention, the principle of the present invention will be described below.

上記問題点は、従来の回路においては（11）式の関係が
必ずしも成立し得ない点に問題がある。この問題点に対
し、ポインタリミッタ62を取り去ることにより、ポイン
タ値のみは（11）式を常に満足させることができる。し
かし、この方法では、ADPCM符号化する種々の音声デー
タにおいてポインタ値がどのような範囲で変化するかが
定かでなく、従って量子化ROMもどの程度容易すべきか
分からず、その容量もかなり増えることになる。The above problem is that in the conventional circuit, the relationship of the equation (11) cannot always be established. With respect to this problem, by removing the pointer limiter 62, only the pointer value can always satisfy the expression (11). However, with this method, it is not clear in what range the pointer value changes in various audio data to be ADPCM encoded, and therefore, it is not possible to know how easy the quantization ROM should be, and its capacity will increase considerably. become.

そこで、第３図（ａ）なるポインタ値に対して第３図
（ｂ）示すようなリミッタ操作に変更する。すなわち、
ポインタの制限値をポインタ増減値Ｓだけ上下ともにシ
フトする。このようにすることによって（11）式は常に
成立し、ポインタの動き範囲も確実に把握できるため、
あとは量子化ROMを振幅の変更を要する範囲で上下に拡
張しておけば良い。実音声では隣接する素片の振幅はそ
う大きくは違わないため、量子化ROMの拡張もさ程大き
くはない。Therefore, the pointer value shown in FIG. 3 (a) is changed to the limiter operation as shown in FIG. 3 (b). That is,
The pointer limit value is vertically shifted by the pointer increase / decrease value S. By doing so, formula (11) is always established and the range of movement of the pointer can be grasped with certainty.
After that, the quantization ROM can be expanded up and down within the range where the amplitude needs to be changed. In real speech, the amplitudes of adjacent segments do not differ so much, so the expansion of the quantization ROM is not so large.

以下、本発明を一実施例に基づき図面を参照して詳細に
説明する。Hereinafter, the present invention will be described in detail based on an embodiment with reference to the drawings.

第１図は本発明の一実施例のブロック図である。同図に
おいて、130は各部の動作の制御を行なう制御部,140は
マルチプレクサ,141は「音素片長」を格納するレジス
タ,142は「繰り返し回数」を格納するレジスタ,143は
「初期値設定用ポインタ値」を格納するためのレジス
タ,144は「ポインタ初期値」を格納するためのレジス
タ,145は「ポインタ増減値」を格納するためのレジス
タ,146はADPCM符号を格納するためのメモリ,147はレジ
スタ141に格納された「音素片長」を格納し、所定のタ
イミングでカウントダウンしていくカウンタ,148はレジ
スタ142に格納された「繰り返し回数」を格納し、カウ
ンタ147のカウント値か０になるとカウントダウンする
カウンタ,149は、レジスタ143又はレジスタ144の切り換
えを行なうマルチプレクサである。155はレジスタ,159
はポインタ移動ROM,160はポインタ,161はポインタリミ
ッタ,162は量子化ROM,163は乗算器,164・165は加算器,1
66はレジスタ,167はレジスタ167の出力端子である。ま
た181は加算器,182は、加算器181から出力されるレジス
タ145のポインタ増減値とレジスタ182自身の出力との加
算結果を格納するレジスタ,183はポインタリミッタ161
の出力とレジスタ182の出力を加算する加算器である。FIG. 1 is a block diagram of an embodiment of the present invention. In the figure, 130 is a control unit for controlling the operation of each unit, 140 is a multiplexer, 141 is a register for storing “phoneme piece length”, 142 is a register for storing “repetition count”, and 143 is an “initial value setting pointer”. Register for storing `` value '', 144 for register for `` pointer initial value '', 145 for register for `` pointer increment / decrement value '', 146 for memory for storing ADPCM code, 147 for A counter that stores the “phoneme piece length” stored in the register 141 and counts down at a predetermined timing, a counter 148 stores the “repetition number” stored in the register 142, and counts down when the count value of the counter 147 becomes 0 or 0. The counter 149 for controlling is a multiplexer for switching the register 143 or the register 144. 155 is a register, 159
Is a pointer movement ROM, 160 is a pointer, 161 is a pointer limiter, 162 is a quantization ROM, 163 is a multiplier, 164 and 165 are adders, 1
Reference numeral 66 is a register, and 167 is an output terminal of the register 167. Further, 181 is an adder, 182 is a register for storing the addition result of the pointer increase / decrease value of the register 145 output from the adder 181 and the output of the register 182 itself, and 183 is a pointer limiter 161.
Is an adder that adds the output of the register 182 and the output of the register 182.

第５図は第１図に示す音声合成器（ADPCM再生器）にバ
ッファメモリ（第１図に図示なし）を通して送られてく
るデータの形式を示したものであり、１フレーム周期対
応のデータとして始めから順に、ADPCMデータの個数で
ある音素片長,ADPCM符号列の繰り返し回数,1フレーム周
期内で各繰り返し単位で波形再生の初期値をROM162から
読み出す時にポインタ値として用いる初期値設定用ポイ
ンタ値,1フレーム周期内の各繰り返し単位でポインタの
初期値として用いられるポインタ初期値，ポインタ増減
値，及びADPCM符号L₁，L₂・・・L_mの各データがあり、
このデータ形式で各フレーム周期毎に送られてくる。FIG. 5 shows the format of data sent to the voice synthesizer (ADPCM player) shown in FIG. 1 through a buffer memory (not shown in FIG. 1). In order from the beginning, the phoneme piece length that is the number of ADPCM data, the number of ADPCM code string repetitions, the initial value setting pointer value used as a pointer value when reading the initial value of waveform reproduction in each repetition unit within one frame period from the ROM 162, There is a pointer initial value, a pointer increase / decrease value, and ADPCM code L ₁ , L ₂ ... L _m data used as the initial value of the pointer in each repeating unit within one frame period,
This data format is sent every frame period.

この第５図に示したデータで第１図の回路で波形再生を
行なった時の再生波形を第６図に示す。FIG. 6 shows a reproduced waveform when the waveform shown in FIG. 5 is reproduced by the circuit shown in FIG.

以下、第５図及び第６図を参照して第１図の実施例の動
作について説明する。The operation of the embodiment shown in FIG. 1 will be described below with reference to FIGS. 5 and 6.

まず、伝送系あるいは音声ファイルから入力される第５
図のデータ形式のデータは順次バッファメモリ（図示せ
ず）に格納される。バッファメモリに格納されたデータ
を順次１フレーム周期分ずつ取り込む。このバッファメ
モリから取り込まれるデータはマルチプレクサ140を順
次切り換えることにより、レジスタ141には「音素片
長」、レジスタ143には「初期値設定用ポインタ値」、
レジスタ144には「ポインタ初期値」、レジスタ145には
「ポインタ増減値」、メモリ146にはADPCM符号、がそれ
ぞれ格納される。カウンタ147にはレジスタ141から出力
される「音素片長」が格納され、カウンタ148には「繰
り返し回数」が格納される。First, 5th input from transmission system or audio file
Data of the data format shown in the figure are sequentially stored in a buffer memory (not shown). The data stored in the buffer memory is sequentially fetched for one frame period. The data fetched from the buffer memory is sequentially switched by the multiplexer 140, whereby the "phoneme piece length" is stored in the register 141, the "pointer value for initial value setting" is stored in the register 143,
The register 144 stores "pointer initial value", the register 145 stores "pointer increment / decrement value", and the memory 146 stores ADPCM code. The “phoneme piece length” output from the register 141 is stored in the counter 147, and the “repetition count” is stored in the counter 148.

レジスタ166の初期値はROM162の出力によって与えられる。すなわち、レジス
タ143には格納された初期値設定用ポインタ値P_INTをマ
ルチプレクサ149を介してポインタ160に格納し、ポイン
タリミッタ161,加算器183（レジスタ182は初期は０にセ
ットされているから加算しても値は変わらない）を介し
てROM162を読み出し、その値をレジスタ166に格納す
る。このようにして得られた音素波形の初期値は、時間点T₁に出力端子167より出力される。Initial value of register 166 Is given by the output of ROM 162. That is, the initial value setting pointer value P _INT stored in the register 143 is stored in the pointer 160 via the multiplexer 149, and the pointer limiter 161 and the adder 183 (addition is performed because the register 182 is initially set to 0). Even if the value does not change), the ROM 162 is read out and the value is stored in the register 166. Initial value of phoneme waveform obtained in this way Is output from the output terminal 167 at time point T ₁ .

次に、時間点T₂以降の波形再生に先立って、ポインタ16
0の初期値をセットする。このためにマルチプレクサ149
を切り換えて、レジスタ144に格納された「ポインタ初
期値」P₁をポインタ160に格納させる。この値は（４）
式中のP_iの初期値となるとともにポインタリミッタ161,
加算器183を介してROM162を読み出し、（１）式中のｉ
＝１における量子化ステップ幅Δ_１を決定している。Then, prior to the waveform reproduction time point T ₂ later, the pointer 16
Set an initial value of 0. For this purpose the multiplexer 149
Is switched to store the “pointer initial value” P ₁ stored in the register 144 in the pointer 160. This value is (4)
It becomes the initial value of P _{i in} the expression and pointer limiter 161,
The ROM 162 is read out via the adder 183, and i in the equation (1) is read.
The quantization step width Δ ₁ at = 1 is determined.

次に時間点T₂における波形再生について説明する。メモ
リ146より入力される最初のADPCMデータL₁は加算器168
によってバイアス0.5が加えられた後、乗算器163によっ
て量子化ステップ幅Δ_１が乗ぜられ（（１）式）、加算
器165によって前時間 166に格納され、時間点T₂に出力端子167より出力され
る。一方これと並行してADPCMデータL₁からは、この値
をアドレスとしてROM159を読み出し、ポインタ160によ
って（４）式の演算が行なわれ、次のT₃点におけるポイ
ンタ値となる。以下同様にして時間点T₃以降、ADPCM符
号L₂・・・L₂₉を用いて第６図に示される音声波形を再
生する。Next, the waveform reproduction at time point T ₂ will be described. The first ADPCM data L ₁ input from the memory 146 is the adder 168.
After the bias 0.5 is added by the multiplier 163, the quantization step width Δ ₁ is multiplied by the multiplier 163 (equation (1)), and the adder 165 calculates the previous time. It is stored in 166 and is output from the output terminal 167 at time point T ₂ . On the other hand, in parallel with this, the ROM 159 is read from the ADPCM data L ₁ by using this value as an address, and the operation of the equation (4) is performed by the pointer 160, which becomes the pointer value at the next T ₃ point. Similarly, after time point T ₃ , the voice waveform shown in FIG. 6 is reproduced by using ADPCM codes L ₂ ... L ₂₉ .

このようにして各時間点における符号再生値が得られる。制御部130ではADPCM符号を１つ取りこんで
処理を行なう毎に、カウンタ147の音素片長を１つカウ
ントダウンしてゆく。このカウンタ147のカウント値は
常に監視されており、これが０になると１音素周期分の
ADPCM符号全部の処理が完了したと判定する。又、レジ
スタ142に格納された「繰り返し回数」データはカウン
タ148に格納されてやはり常時監視されており、前述の
音素片長のカウント値が０に達すると、カウンタ148の
繰り返し回数を１つカウントダウンする。In this way, the code reproduction value at each time point Is obtained. The control unit 130 counts down one phoneme piece length of the counter 147 each time one ADPCM code is fetched and processed. The count value of the counter 147 is constantly monitored, and when it becomes 0, it corresponds to one phoneme cycle.
Judge that the processing of all ADPCM codes is completed. The "repetition number" data stored in the register 142 is also stored in the counter 148 and is constantly monitored. When the count value of the phoneme piece length reaches 0, the number of repetitions of the counter 148 is counted down by one. .

この結果、繰り返し回数のカウント値が０でない場合に
は、新たに、音素片長のデータをレジスタ141からカウ
ンタ147に格納させ、且つマルチプレクサ149をレジスタ
143側に切り換えてレジスタ143に格納された初期値設定
用ポインタ値P_INTをポインタ160に格納する。この時、
レジスタ145に格納されたポインタ増減値Ｓを加算器181
に入力させ、レジスタ182に格納された値（第１音素周
期終了時の場合は０である）との加算を行ない、加算結
果をポインタの増減値Ｓ′としてレジスタ182に格納す
る。As a result, when the count value of the number of repetitions is not 0, the phoneme piece length data is newly stored in the counter 141 and the multiplexer 149 is registered in the counter 147.
Switching to the 143 side, the initial value setting pointer value P _INT stored in the register 143 is stored in the pointer 160. At this time,
The pointer increase / decrease value S stored in the register 145 is added to the adder 181.
To the value stored in the register 182 (0 at the end of the first phoneme cycle), and the addition result is stored in the register 182 as the pointer increase / decrease value S ′.

ただし、ここでポインタリミッタ161の出力がすでにリ
ミッタの制限値P_minもしくはP_maxに達していると、前記
第２図の従来のADPCM再生器と同様の量子化ROMを用いて
は、アドレスアンダーフローまたはオーバーフローが起
きる。そこで本実施例における量子化ROM162ではリミッ
タ制限値に対して第７図に示すように上下にアドレス拡
張し、対応する量子化ステップ幅を記憶させておく。従
って、第１図の実施例では、アドレス拡張によって生じ
るアドレスに対応する量子化ステップ幅のずれを補正す
るため、ポインタ160の初期値及びポインタリミッタ161
の制限値にはアドレス下位拡張分のオフセットが加えら
れている。この結果、（９）式（10）式が常に成立し、
従って音素の繰り返しにおいて波形の形は変わらない
が、振幅が一定倍になった波形を出力することができ
る。ここで、どの程度量子化ROMを拡張すれば良いかが
問題となるが、拡張する大きさ、すなわち振幅増減値Ｓ
の累算結果Ｓ′の大きさは、隣り合うフレームの平均電
力の比に相当し、音素の形状がほぼ同一とみなせる区間
では振幅の変化も比較的小さいことを考えると振幅にし
て1/2〜２倍、第７図における量子化ステップ幅にして
上下８ステップ程度拡張すれば十分であり、回路全体か
らみれば、ごく僅かな増加で十分な効果が得られる。However, if the output of the pointer limiter 161 has already reached the limiter limit value P _min or P _max , the address underflow is caused by using the same quantization ROM as the conventional ADPCM regenerator shown in FIG. Or overflow occurs. Therefore, in the quantization ROM 162 in this embodiment, the limiter limit value is vertically expanded and expanded as shown in FIG. 7, and the corresponding quantization step width is stored. Therefore, in the embodiment shown in FIG. 1, the initial value of the pointer 160 and the pointer limiter 161 are set in order to correct the deviation of the quantization step width corresponding to the address caused by the address expansion.
An offset for the lower extension of the address is added to the limit value of. As a result, equations (9) and (10) always hold,
Therefore, although the shape of the waveform does not change when the phonemes are repeated, it is possible to output a waveform having a constant amplitude. Here, how much the quantization ROM should be expanded becomes a problem, but the size of expansion, that is, the amplitude increase / decrease value S
The size of the cumulative result S'of corresponds to the ratio of the average powers of the adjacent frames. Considering that the change in the amplitude is relatively small in the section where the phoneme shapes can be considered to be almost the same, the amplitude is reduced to 1/2. It is sufficient to expand the quantization step width in FIG. 7 by about 2 times and expand the upper and lower steps by about 8 steps. From the viewpoint of the entire circuit, a very small increase can provide a sufficient effect.

以上、発明の第１の実施例を説明した。この第１の実施
例では素片の繰り返しによる音声情報の圧縮について考
えたものであるが任意語の合成を目的とする法則合成に
ADPCM方式を利用する場合、本手法はさらに有効な手段
となる。第８図に第２の実施例における量子化ROMの構
成を示す。すなわち、法則合成では音節，音素，素片と
いった小さな音声単位をつなぎ合わせて連続音声にして
いく。そのため用いられる音声単位の振幅も、それが使
用される状況によって種々の値に容易にコントロールで
きる手法が必要である。The first embodiment of the invention has been described above. In the first embodiment, the speech information is compressed by repeating the pieces, but the rule synthesis for the purpose of synthesizing an arbitrary word is performed.
This method is a more effective means when using the ADPCM method. FIG. 8 shows the structure of the quantization ROM in the second embodiment. That is, in law synthesis, small speech units such as syllables, phonemes, and phonemes are connected to form a continuous speech. Therefore, the amplitude of the voice unit used must be easily controlled to various values depending on the situation in which it is used.

この場合にも第１図の回路はそのまま法則合成の合成部
として利用できる。ただし、この場合は、合成のもとと
なる素片データの振幅制御を容易にするためにすべて正
規化しておき、これを最大振幅として振幅を小さくする
方向にのみ制御する方法が波形の精度を考えても有利で
ある。そのため、量子化ROM162は第８図のごとく下位方
向、にのみ必要な量子化ステップ幅を拡張しておく（31
ステップ、約1/20倍も拡張すれば十分である）。Also in this case, the circuit shown in FIG. 1 can be used as it is as a synthesizing unit for law synthesis. In this case, however, the method of controlling the amplitude of the segment data that is the source of the synthesis is normalized and the amplitude is controlled as the maximum amplitude only in the direction of decreasing the amplitude. It is advantageous to think. Therefore, the quantization ROM 162 expands the required quantization step width only in the lower direction as shown in FIG.
Step, it is enough to expand about 1/20 times).

（発明の効果）以上説明したように、本発明によれば、同一ADPCMデー
タの振幅値が一定倍された波形を従来の構成よりも良好
に出力することができるため、同一波形の繰り返しによ
って音声の情報圧縮を行なっている音声合成音の平均電
力の不連続性を一層軽減することができる。(Effects of the Invention) As described above, according to the present invention, a waveform in which the amplitude value of the same ADPCM data is multiplied by a constant value can be output better than that of the conventional configuration, and therefore, by repeating the same waveform, It is possible to further reduce the discontinuity of the average power of the voice-synthesized voice in which the information compression is performed.

[Brief description of drawings]

第１図は本発明の第１の実施例のブロック図、第２図は
従来のADPCM再生器のブロック図、第３図（ａ）は第２
図に示すADPCM再生器におけるポインタの働きの一例を
示す図、第３図（ｂ）は本発明におけるリミッタ操作を
示す図、第４図（ａ）は実際の音声波形の一例を示す
図、第４図（ｂ）は同一波形を３回繰り返す合成音の波
形を示す図、第４図（ｃ）は第４図（ａ）及び（ｂ）の
それぞれの平均電力を示す図、第５図は本発明の第１の
実施例におけるデータ形式を示す図、第６図（ａ）及び
（ｂ）は本発明の第１の実施例による再生波形の一例を
示す図、第７図は本発明の第１の実施例におけるROM162
の内容を示す図、及び第８図は本発明の第２の実施例に
おけるROM162の内容を示す図である。 130……制御部、140……マルチプレクサ、 141〜145……レジスタ、146……メモリ、 147,148……カウンタ、149……マルチプレクサ、 155……レジスタ、159……ROM、 160……ポインタ、161……ポインタリミッタ、 162……ROM、 163,165,181,183……加算器、 166,182……レジスタ。FIG. 1 is a block diagram of the first embodiment of the present invention, FIG. 2 is a block diagram of a conventional ADPCM regenerator, and FIG.
FIG. 3B is a diagram showing an example of the function of the pointer in the ADPCM reproducing device shown in FIG. 3, FIG. 3B is a diagram showing a limiter operation in the present invention, and FIG. 4A is a diagram showing an example of an actual voice waveform. FIG. 4 (b) is a diagram showing the waveform of a synthesized sound in which the same waveform is repeated three times, FIG. 4 (c) is a diagram showing the average power of each of FIGS. 4 (a) and (b), and FIG. 6 is a diagram showing a data format in the first embodiment of the present invention, FIGS. 6 (a) and 6 (b) are diagrams showing an example of reproduced waveforms according to the first embodiment of the present invention, and FIG. 7 is a diagram showing the present invention. ROM 162 in the first embodiment
FIG. 8 and FIG. 8 are views showing the contents of the ROM 162 in the second embodiment of the present invention. 130 ... Control unit, 140 ... Multiplexer, 141-145 ... Register, 146 ... Memory, 147,148 ... Counter, 149 ... Multiplexer, 155 ... Register, 159 ... ROM, 160 ... Pointer, 161 ... … Pointer limiter, 162 …… ROM, 163,165,181,183 …… Adder, 166,182 …… Register.

Claims

[Claims]

1. A first memory storing an amount corresponding to a plurality of predetermined quantization step widths, and an address of the first memory is designated to change the quantization step width from the first memory. A pointer that outputs one of the corresponding amounts, a second memory that moves the output of the pointer according to the ADPCM code, and a maximum and minimum value of the quantization step width used when the output of the pointer is used for ADPCM code encoding. In a voice synthesizer having a pointer limiter for limiting between values, and a register capable of storing a reproduced PCM code reproduced based on the quantization step width output from the first memory, The information including the input is input, zero is set in the process of reproducing the first phoneme waveform, and means for accumulating and storing the amplitude increase / decrease value for each process in the second and subsequent phoneme waveform reproduction, Addition means for designating the address of the first memory by the sum of the output of the storage means and the output of the pointer limiter is provided, and the first memory has a range of the quantization step width used at the time of the ADPCM code encoding. On the other hand, a speech synthesizer characterized in that the increment or decrement by the adding means is expanded to an upper address or a lower address, respectively, and a quantization step width corresponding to these expanded addresses is stored in advance.